Commit Graph

266 Commits

Author SHA1 Message Date
Xingjun.Wang
7d11b77112 Fix trust_remote_code (#1462)
1. Set `trust_remote_code` to `True` by default in datasets module
2. Set `trust_remote_code` to `True` by default in PolyLM pipeline
2025-08-14 10:56:16 +08:00
Xingjun.Wang
055496c597 Fix CI 2025-08-07 19:26:32 +08:00
Xingjun.Wang
0e49156bbe fix TestTinynasDamoyoloTrainerSingleGPU (#1449) 2025-08-06 18:50:25 +08:00
co63oc
8323fc5185 Fix typos in multiple files (#1357) 2025-06-05 14:04:29 +08:00
suluyana
1fe211ffe5 fix pipeline builder when model is not supported (#1125)
* fix pipeline builder when model is not supported

* fix ci & skip
---------

Co-authored-by: suluyan.sly@alibaba-inc.com <suluyan.sly@alibaba-inc.com>
2024-12-12 19:24:38 +08:00
tastelikefeet
7579fac6f3 fix ut (#1143) 2024-12-11 10:19:29 +08:00
liuyhwangyh
82ee20f447 fix issue #845 (#861)
* fix #845

Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>
2024-05-23 20:34:52 +08:00
mulin.lyh
23f1f474bf Merge branch 'master-github' into master-merge-github925
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/14164566
2023-09-26 21:15:41 +08:00
mulin.lyh
38ffb128c5 Merge branch 'master-github' into master-merge-github0901 2023-09-01 09:26:46 +08:00
XDUWQ
3b0006f52c skip tests 2023-08-31 20:46:41 +08:00
mulin.lyh
7c0f6aa5c5 Merge branch 'master-github' into master-merge-github0829 2023-08-29 21:20:54 +08:00
pingzhi.lzh
db27d5218f cones2 version 1.0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13778227

* cones2 version 1.0
2023-08-29 21:05:23 +08:00
tastelikefeet
f2e9ea4199 Replace code with swift wheel (#467) 2023-08-29 17:27:18 +08:00
Wang Qiang
de33f4dc87 Support sdxl finetune by lora method (#468)
* support sdxl finetune by lora

* remove useless imports

* support sdxl finetune

* upgrade diffusers to 0.19.0

* sdxl finetune

* fix bugs

* pre commit

* diffusers>=0.19.0
2023-08-23 11:23:34 +08:00
mulin.lyh
cba4e40bc1 fix numpy pandas compatible issue
明确受影响的模型(damo):  
ONE-PEACE-4B	ModuleNotFoundError: MyCustomPipeline: MyCustomModel: No module named 'one_peace',缺少依赖。
cv_resnet50_face-reconstruction	 不兼容tf2  
nlp_automatic_post_editing_for_translation_en2de	tf2.0兼容性问题,tf1.x需要  
cv_resnet18_ocr-detection-word-level_damo	tf2.x兼容性问题  
cv_resnet18_ocr-detection-line-level_damo	tf兼容性问题  
cv_resnet101_detection_fewshot-defrcn	模型限制必须detection0.3+torch1.11.0"  
speech_dfsmn_ans_psm_48k_causal	"librosa, numpy兼容性问题  
cv_mdm_motion-generation	"依赖numpy版本兼容性问题:   File ""/opt/conda/lib/python3.8/site-packages/smplx/body_models.py"",  
cv_resnet50_ocr-detection-vlpt	numpy兼容性问题  
cv_clip-it_video-summarization_language-guided_en	tf兼容性问题

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13744636
* numpy and pandas no version

* modify compatible issue

* fix numpy compatible issue

* modify ci

* fix lint issue

* replace Image.ANTIALIAS to Image.Resampling.LANCZOS pillow compatible

* skip uncompatible cases

* fix numpy compatible issue, skip cases that can not compatbile numpy or tensorflow2.x

* skip compatible cases

* fix clip model issue

* fix body 3d keypoints compatible issue
2023-08-22 23:04:31 +08:00
Wang Qiang
78f5e6a8bf Switching model from huggingface to modelscope hub of efficient tuning (#479) 2023-08-18 20:30:04 +08:00
suluyan.sly
1c6f5fe775 Merge branch 'master-github' into master-merge-github-230727
Conflicts:
       examples/pytorch/baichuan/finetune_baichuan.py
       examples/pytorch/chatglm6b/finetune.py
2023-07-27 17:29:27 +08:00
Wang Qiang
66cf72a75c Merge pull request #376 from XDUWQ/custom_diffusion
Custom method for finetuning stable diffusion
2023-07-27 10:41:38 +08:00
zeyinzi.jzyz
672c4899e9 add sd swift tuner
SD-Tuner base on Swift (LoRA/Adapter/Prompt)
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798

* sd swift tuner

* fix pre-checker
2023-07-25 19:00:49 +08:00
XDUWQ
8e00d85317 fix bugs 2023-07-24 19:46:22 +08:00
XDUWQ
66795aa3ff change tests level 2023-07-19 09:41:21 +08:00
Wang Qiang
0b85979f2e Update diffusers version to 0.18.0 (#377)
* update diffusers to 0.18.0

* fix bugs
2023-07-14 19:02:52 +08:00
XDUWQ
34ab717393 custom_diffusion 2023-07-12 19:47:32 +08:00
XDUWQ
1caa45422c custom diffusion 2023-07-11 20:46:32 +08:00
wenmeng.zwm
0271b9c256 Merge branch 'master-github' into merge_master_github_0628 2023-06-28 20:27:34 +08:00
Wang Qiang
a018cd6107 Dreambooth method for finetuning stable diffusions (#339)
* Copyright

* dreambooth

* dreambooth test trainer

* fix bugs

* pre-commit

---------

Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com>
2023-06-28 20:10:28 +08:00
yuze.zyz
8f18274f75 Add teardown for tests
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12643554

* add teardown for tests

* add teardown for dialog_modeling_trainer,document_grounded_dialog_generate_trainer,document_grounded_dialog_rerank_trainer,document_grounded_dialog_retrieval_trainer,training_args,translation_evaluation_trainer,translation_trainer
2023-06-28 09:44:44 +08:00
yuze.zyz
a58be34384 Add Lora/Adapter/Prompt and support for chatglm6B and chatglm2-6B
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12770413

* add prompt and lora

* add adapter

* add prefix

* add tests

* adapter smoke test passed

* prompt test passed

* support model id in petl

* migrate chatglm6b

* add train script for chatglm6b

* move gen_kwargs to finetune.py

* add chatglm2

* add model definination
2023-06-27 14:38:18 +08:00
xingjun.wxj
0db0ec5586 Merge code from github
1. Merge(add) daily regression from github PR (daily_regression.yaml)
2. Add lora stable diffusion from github PR
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13010802
* fix: device arg not work, rename device to ngpu (#272)

* Correcting the lora stable diffusion example script (#300)

* add vad model and punc model in README.md 

add vad model and punc model

* Merge pull request #302 from modelscope/langgz-patch-1

add vad model and punc model in README.md

* add 1.6

* modify ignore

* Merge pull request #307 from modelscope/dev_rs_16

Merge release 1.6

* undo datetime to 2099

* Merge pull request #311 from modelscope/fix_master_version

undo datetime to 2099

* add daily regression workflow

* modify workflow name

* fix cron format issue

* lora trainer

* Merge pull request #315 from liuyhwangyh/add_regression_workflow

add daily regression workflow
2023-06-21 10:22:06 +08:00
yuze.zyz
003c33648a Fix CI: test merge dataset failed
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12771691
* fix CI

* add teardown
2023-05-29 10:32:33 +08:00
yuze.zyz
febc0365de Support FlexTrain and update the structure of trainer
1. Refactor training_args
2. Refactor hooks
3. Add train_id for push_to_hub
4. Support both output_dir/output_sub_dir for checkpoint_hooks
5. Support copy when hardlink fails when checkpointing
6. Support mixed dataset config file as a CLI argument
7. Add eval txt in output folder
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12384253
* support the ignorance of file pattern
2023-05-13 12:12:04 +08:00
wanyu.wy
58df448182 Add trainer for UniTE 2023-05-11 14:41:08 +08:00
jiangnana.jnn
46072898da remove easycv codes, plugin access
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11965727

* remove easycv codes

* fix custome msdatasets import and remove metainfo

* fix pipeline imports

* fix pre-check

* fix models import

* fix pre-check

* merge master
2023-05-09 17:58:01 +08:00
bin.xue
dd4299e0c8 [to #42322933] add DFSMN model to far field KWS
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12334627
* feat: set stream mode in constructor

* fix: commit model file

* doc: add more information about DFSMN

* fix: seperation model evaluate returns None in speechbrain whose version < 0.5.12
2023-04-20 22:45:07 +08:00
hemu
672f25266f Merge branch 'master-github' into master-merge-github-0413 2023-04-13 10:32:01 +08:00
slin000111
92d7eae5b9 add token classification example and gpt3 one layer test (#268) 2023-04-13 10:21:00 +08:00
hemu
305006e18a Merge branch 'master-github' into master-merge-github-0412 2023-04-12 15:51:20 +08:00
chaojie.mcj
283517de08 add efficient tunner modules 2023-04-11 22:26:13 +08:00
ly261666
5f6cdf6814 [to #42322933] add damofd model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12261061
2023-04-11 11:08:59 +08:00
tastelikefeet
6dd94ff2bc add first case for gpt3 test (#236) 2023-04-09 21:46:48 +08:00
kangxiaoyang.kxy
d2fde9248c update image colorization metric
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12073952

* update image colorization metric

* update documentation strings
2023-03-28 15:14:10 +08:00
xingjun.wxj
4e77f654f5 backward compatible with to_task_dataset function in adaseq repo
1. backward compatible with to_task_dataset function for DefaultTrainer in adaseq repo
2. fix registry issue for RedsImageDeblurringDataset and GoproImageDeblurringDataset
3. add ut TestCustomDatasetsCompatibility
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11981956
2023-03-14 18:34:44 +08:00
lllcho.lc
38bcd54ee4 fix bug for install detecron2
从github安装库有时会因为网络问题导致失败,故改成gitee
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11945563
2023-03-10 15:14:21 +08:00
kangxiaoyang.kxy
681afac803 add finetune for ddcolor image colorization
图像上色模型添加finetune支持,本地测试已通过:python tests/test_image_colorization_trainer.py

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11874339
2023-03-10 14:00:48 +08:00
xingjun.wxj
e02a260c93 Refactor the task_datasets module
Refactor the task_datasets module:

1. Add new module modelscope.msdatasets.dataset_cls.custom_datasets.
2. Add new function: modelscope.msdatasets.ms_dataset.MsDataset.to_custom_dataset().
2. Add calling to_custom_dataset() func in MsDataset.load() to adapt new custom_datasets module.
3. Refactor the pipeline for loading custom dataset: 
	1) Only use MsDataset.load() function to load the custom datasets.
	2) Combine MsDataset.load() with class EpochBasedTrainer.
4. Add new entry func for building datasets in EpochBasedTrainer: see modelscope.trainers.trainer.EpochBasedTrainer.build_dataset()
5. Add new func to build the custom dataset from model configuration, see: modelscope.trainers.trainer.EpochBasedTrainer.build_dataset_from_cfg()
6. Add new registry function for building custom datasets, see: modelscope.msdatasets.dataset_cls.custom_datasets.builder.build_custom_dataset()
7. Refine the class SiameseUIETrainer to adapt the new custom_datasets module.
8. Add class TorchCustomDataset as a superclass for custom datasets classes.
9. To move modules/classes/functions:
	1) Move module msdatasets.audio to custom_datasets
	2) Move module msdatasets.cv to custom_datasets
	3) Move module bad_image_detecting to custom_datasets
	4) Move module damoyolo to custom_datasets
	5) Move module face_2d_keypoints to custom_datasets
	6) Move module hand_2d_keypoints to custom_datasets
	7) Move module human_wholebody_keypoint to custom_datasets
	8) Move module image_classification to custom_datasets
	9) Move module image_inpainting to custom_datasets
	10) Move module image_portrait_enhancement to custom_datasets
	11) Move module image_quality_assessment_degradation to custom_datasets
	12) Move module image_quality_assmessment_mos to custom_datasets
	13) Move class LanguageGuidedVideoSummarizationDataset to custom_datasets
	14) Move class MGeoRankingDataset to custom_datasets
	15) Move module movie_scene_segmentation custom_datasets
	16) Move module object_detection to custom_datasets
	17) Move module referring_video_object_segmentation to custom_datasets
	18) Move module sidd_image_denoising to custom_datasets
	19) Move module video_frame_interpolation to custom_datasets
	20) Move module video_stabilization to custom_datasets
	21) Move module video_super_resolution to custom_datasets
	22) Move class GoproImageDeblurringDataset to custom_datasets
	23) Move class EasyCVBaseDataset to custom_datasets
	24) Move class ImageInstanceSegmentationCocoDataset to custom_datasets
	25) Move class RedsImageDeblurringDataset to custom_datasets
	26) Move class TextRankingDataset to custom_datasets
	27) Move class VecoDataset to custom_datasets
	28) Move class VideoSummarizationDataset to custom_datasets
10. To delete modules/functions/classes:
	1) Del module task_datasets
	2) Del to_task_dataset() in EpochBasedTrainer
	3) Del build_dataset() in EpochBasedTrainer and renew a same name function.
11. Rename class Datasets to CustomDatasets in metainfo.py

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11872747
2023-03-10 09:03:32 +08:00
hemu.zp
2b1af959d5 Convert cfg during training
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11900238
2023-03-09 22:27:44 +08:00
yuze.zyz
a0bc5549a1 trainer support parallel_groups
Design doc: https://yuque.alibaba-inc.com/suluyan.sly/yh1rvu/yx0owblyebpa2b3l?singleDoc#flU3s

1. Add parallel_group field in trainer to support DP, TP, PP.
2. Move the construction of common hooks(except optimizer/lrscheduler hook) to trainer's init method to support after_init stage.
	after_init is to support DP, TP, PP's initializing
         https://aone.alibaba-inc.com/v2/workitem#viewIdentifier=1c46ee8637e0c978f115b6f7&openWorkitemIdentifier=48099986
3. Add before_eval/after_eval stage to support model wrapping.
	to solve the order problem of apex amp & ddp wrapping.
         https://aone.alibaba-inc.com/v2/workitem#viewIdentifier=1c46ee8637e0c978f115b6f7&openWorkitemIdentifier=48099986
4. Exporter supports lazy importing.
	https://aone.alibaba-inc.com/v2/workitem#viewIdentifier=1c46ee8637e0c978f115b6f7&openWorkitemIdentifier=48122780
5. Fold all megatron imports to megatron hook.
         https://aone.alibaba-inc.com/v2/workitem#viewIdentifier=1c46ee8637e0c978f115b6f7&openWorkitemIdentifier=48099986
6. Add compile method to TorchModel ,Pipeline,Trainer to support torch2.0
	https://aone.alibaba-inc.com/v2/workitem#viewIdentifier=1c46ee8637e0c978f115b6f7&openWorkitemIdentifier=46869415
7. Fix bug: Lrscheduler builder does not support torch2.0
8. Add callbacks for trainer
	https://aone.alibaba-inc.com/v2/workitem#viewIdentifier=1c46ee8637e0c978f115b6f7&openWorkitemIdentifier=48210342
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11849932
2023-03-09 21:33:35 +08:00
yuanzhi.zyz
2ce41d4974 add ocr_reco_crnntrain
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11824395
2023-03-09 21:14:09 +08:00
bin.xue
281d3c630e [to #42322933] feat: kws support continue training from a checkpoint
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11940446

* feat: kws support continue training from a checkpoint

* log: add loading model log
2023-03-09 21:07:34 +08:00
lllcho.lc
3e70ecfb37 add action detection finetune
添加action-detection模型finetune支持

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11873651
2023-03-09 16:54:19 +08:00