Commit Graph

1104 Commits

Author SHA1 Message Date
jinmao.yk
8284d2d366 fix log format to avoid misunderstanding
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11013023
2022-12-07 19:04:09 +08:00
zhangzhicheng.zzc
92c5abb076 [to #46619305] add kwargs in init method to allow additional kwargs 2022-12-07 18:42:29 +08:00
pangda
f59f9146de fix save_pretrained & load_checkpoint bug in DDP mode
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11012439

    * fix save_pretrained & load_checkpoint bug in DDP mode
2022-12-07 18:33:00 +08:00
wanggui.hwg
3ce1866224 [to #42322933] Fix bugs for UniTE
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11011725
2022-12-07 17:23:52 +08:00
mulin.lyh
f1a7ee91c6 [to #46604161]fix: git repository initialize lfs bug
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11005012
2022-12-07 13:14:22 +08:00
wenmeng.zwm
f0d6d58b17 add project description 2022-12-07 12:37:42 +08:00
wenmeng.zwm
2ea3e2998e add project description 2022-12-07 12:33:07 +08:00
wenmeng.zwm
18130ad800 update setup.py 2022-12-07 11:50:55 +08:00
wenmeng.zwm
e2bf864f63 update audio requirements to use funasr>=0.1.4 2022-12-07 11:37:38 +08:00
wenmeng.zwm
4cba118f16 bump version to 1.1.0 2022-12-07 11:37:38 +08:00
shiyi.zxh
c3a494e46d [to #42322933]
enable finetune of ofa-mmspeech 
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10981972
2022-12-06 20:58:49 +08:00
yuze.zyz
a0d2581028 Fix gpt_neo decode
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10985686
2022-12-06 12:29:42 +08:00
pangda
5fd3e7bb43 [to #42322933] Add early stop hook 2022-12-06 10:54:47 +08:00
xuanjie.wxb
a3a942352e support prompt ner
修改preprocessor增加对prompt模型的支持。
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10972542
2022-12-06 10:39:37 +08:00
ly261666
ed23d460d5 [to #42322933] Add facial landmark confidence model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10780109
2022-12-05 21:07:35 +08:00
yuze.zyz
bf97dd7501 Fix dist judgement when torch.distributed.is_available is always False
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10976015
2022-12-05 19:46:36 +08:00
ly261666
262f738460 [to #42322933] add FairFace face attribute model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10726376
2022-12-05 17:20:48 +08:00
wenmeng.zwm
d68d66f8c1 fix redundant log when using distributed training using pytorch
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10959078
2022-12-05 17:14:46 +08:00
mulin.lyh
892a237918 [to #46549395]fix: fix http_get_file headers is None, will exception
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10973267
2022-12-05 16:58:11 +08:00
suluyan.sly
c36b1cce8a [to #42322933] fix for plug
1. update pipeline for new preprocessor
2. update trainer for dist_info (remove megatron-ddp wapper)

Link: https://code.aone.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10969594
2022-12-05 14:34:21 +08:00
ly261666
9d43823f36 [to #42322933] add TinyMogFace face detector
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10792564
2022-12-05 12:01:26 +08:00
baiguan.yt
ce0480f7ed update image-portait-enhancement trainer
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10717891
2022-12-05 11:43:52 +08:00
hemu.zp
941dbe75cf [to #42322933] Add GPT-3 tensor parallel finetuning
Add GPT-3 tensor parallel finetuning, adjust some distributed codes to make tensor and data parallel compatible.
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10949507
2022-12-05 10:01:32 +08:00
jiangyu.xzy
db7c5d1494 asr 统一接口,支持conformer和uniasr模型
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10964641

    * support new asr paraformer model

* support asr conformer model

* add new asr model tests

* fix format

* support new in params

* fix conflict

* type fix

* fix conflict
2022-12-05 09:59:55 +08:00
pangda
54d219e90b [to #42322933] add UT for NER&EL models
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10897188
2022-12-04 15:53:32 +08:00
hemu.zp
346da3d489 [to #42322933] Add mplug pretrained model
Add pre-trained models for mplug finetuning.
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10963691
2022-12-04 15:27:50 +08:00
ly261666
492aa98d9a [to #42322933] Add face mask model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10897202

    * [to #42322933] 新增ArcFace人脸识别模型
2022-12-04 15:25:27 +08:00
wenmeng.zwm
82af080c18 upate git-lfs install instruction
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10963252

    * upate git-lfs install instruction
2022-12-03 17:42:20 +08:00
dingkun.ldk
000976836f 1130中文分词/词性标注新增LSTM轻量化模型
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10931263
2022-12-03 13:15:33 +08:00
yichang.zyc
90034236ab ofa asr support url 2022-12-03 08:49:11 +08:00
jinmao.yk
d84a1df65a add video human matting task code
add video human matting task code
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10839854
2022-12-02 19:44:01 +08:00
mulin.lyh
b293095bd0 [to #46522320]fix: fix download file timeout too short
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10961023

    * [to #46522320]fix: fix download file timeout too short
2022-12-02 19:41:59 +08:00
wanggui.hwg
e8608df930 Add support for UniTE
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10909489
2022-12-02 18:15:02 +08:00
hemu.zp
8184c86c5f [to #42322933] Fix bug for text generation task model
Fixed the bug for generate method in TaskModelForTextGeneration, which was unavailable due to the upgrade of the transformers library to version 4.24.0

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10791805
2022-12-02 17:52:19 +08:00
ly119399
2f17daa23f [to #42322933] reduce the GPU usage of dialog trianer
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10955485
2022-12-02 17:32:26 +08:00
suluyan.sly
2863a8f7fa [to #42322933] fix hook.__init__
Link: https://code.aone.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10957489

* fix hook.__init__
2022-12-02 17:09:06 +08:00
yuze.zyz
348e87e697 change sequence_length to max_length
To cooperate with other tokenizing args, change sequence_length to max_length, meanwhile making the input args compatible with old 'sequence_length' arg.
2022-12-02 16:57:09 +08:00
ly261666
4208d51e23 substitute face detection model in skin_retouching_pipeline.py
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10909902
2022-12-02 15:41:08 +08:00
wenmeng.zwm
c9a6b887a2 add tensorboard hook for visualization
1. add tensorboard hook to default config
2. add image visualization support to tensorboard hook and trainer
3. move evaluation logic out of single_gpu_test and multi_gpu_test to make prediction results available for further processing such as result saving and visualization.

visualization results are as follows:
![image.png](https://cn-hangzhou.oss-cdn.aliyun-inc.com/git/force/uploads/comment/29212/38448470860386707/image.png)
![image.png](https://cn-hangzhou.oss-cdn.aliyun-inc.com/git/force/uploads/comment/29212/38437794200606734/image.png)
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10894813
2022-12-02 15:13:24 +08:00
ziyuan.tw
31316b8d29 add nextvit-small_image-classification_Dailylife-labels model
支持1130新上线模.
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10886253
2022-12-02 14:46:49 +08:00
ly119399
5ae1e08db6 [to #42322933] fix bug of tableQA on gpu
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10943053
2022-12-02 10:38:30 +08:00
zhangzhicheng.zzc
a318f27247 [to #42322933] speed up the ast indexing during editing
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10907357
2022-12-02 10:06:24 +08:00
yuze.zyz
0e4766f41d Fix bugs in testlevel1 & 2
1. Fix: ws regression failed.
2. Fix: label2id missing in text_classification_pipeline when preprocessor is passed in through args.
3. Fix: remove obsolete imports
4. Fix: incomplete modification
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10936431
2022-12-01 21:16:55 +08:00
rujiao.lrj
9d8eb5b0b3 support license plate detection
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10917315
2022-12-01 19:48:06 +08:00
mulin.lyh
f663f420c4 [to #46480415]feat: ci command custom support regression case run all case in subprocess
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10936241
2022-12-01 19:33:25 +08:00
suluyan.sly
1394019102 [to #42322933] plug finetune
plug finetune :已在du reader- robust数据集上回归至最佳结果
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10916382
2022-12-01 19:31:15 +08:00
james.wjg
9b3a92e65d cv/language_guided_video_summarization增加finetune
cv/language_guided_video_summarization增加finetune
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10790262
2022-12-01 19:16:56 +08:00
lllcho.lc
b8dba17543 [to #42322933] action-detection model predownload video before inference
1. 在模型处理视频之前下载视频,防止网络抖动导致ffmpeg读取网络视频失败进而导致模型运行失败
2. 完善模型inference是的控制参数
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10906373
2022-12-01 18:13:08 +08:00
mulin.lyh
7039e93c99 skip temp failed case 2022-12-01 16:50:09 +08:00
yuze.zyz
bb5512d1ab [to #42322933] Refactor NLP and fix some user feedbacks
1. Abstract keys of dicts needed by nlp metric classes into the init method
2. Add Preprocessor.save_pretrained to save preprocessor information
3. Abstract the config saving function, which can lead to normally saving in the direct call of from_pretrained, and the modification of cfg one by one when training.
4. Remove SbertTokenizer and VecoTokenizer, use transformers' tokenizers instead
5. Use model/preprocessor's from_pretrained in all nlp pipeline classes.
6. Add model_kwargs and preprocessor_kwargs in all nlp pipeline classes
7. Add base classes for fill-mask and text-classification preprocessor, as a demo for later changes
8. Fix user feedback: Re-train the model in continue training scenario
9. Fix user feedback: Too many checkpoint saved
10. Simplify the nlp-trainer
11. Fix user feedback: Split the default trainer's __init__ method, which makes user easier to override
12. Add safe_get to Config class

----------------------------  Another refactor from version 36 -------------------------

13. Name all nlp transformers' preprocessors from TaskNamePreprocessor to TaskNameTransformersPreprocessor, for example:
      TextClassificationPreprocessor -> TextClassificationTransformersPreprocessor
14. Add a base class per task for all nlp tasks' preprocessors which has at least two sub-preprocessors
15. Add output classes of nlp models
16. Refactor the logic for token-classification
17. Fix bug: checkpoint_hook does not support pytorch_model.pt
18. Fix bug: Pipeline name does not match with task name, so inference will not succeed after training
       NOTE: This is just a stop bleeding solution, the root cause is the uncertainty of the relationship between models and pipelines
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10723513

    * add save_pretrained to preprocessor

* save preprocessor config in hook

* refactor label-id mapping fetching logic

* test ok on sentence-similarity

* run on finetuning

* fix bug

* pre-commit passed

* fix bug

* Merge branch 'master' into feat/refactor_config

# Conflicts:
#	modelscope/preprocessors/nlp/nlp_base.py

* add params to init

* 1. support max ckpt num 2. support ignoring others but bin file in continue training 3. add arguments to some nlp metrics

* Split trainer init impls to overridable methods

* remove some obsolete tokenizers

* unfinished

* support input params in pipeline

* fix bugs

* fix ut bug

* fix bug

* fix ut bug

* fix ut bug

* fix ut bug

* add base class for some preprocessors

* Merge commit '379867739548f394d0fa349ba07afe04adf4c8b6' into feat/refactor_config

* compatible with old code

* fix ut bug

* fix ut bugs

* fix bug

* add some comments

* fix ut bug

* add a requirement

* fix pre-commit

* Merge commit '0451b3d3cb2bebfef92ec2c227b2a3dd8d01dc6a' into feat/refactor_config

* fixbug

* Support function type in registry

* fix ut bug

* fix bug

* Merge commit '5f719e542b963f0d35457e5359df879a5eb80b82' into feat/refactor_config

# Conflicts:
#	modelscope/pipelines/nlp/multilingual_word_segmentation_pipeline.py
#	modelscope/pipelines/nlp/named_entity_recognition_pipeline.py
#	modelscope/pipelines/nlp/word_segmentation_pipeline.py
#	modelscope/utils/hub.py

* remove obsolete file

* rename init args

* rename params

* fix merge bug

* add default preprocessor config for ner-model

* move a method a util file

* remove unused config

* Fix a bug in pbar

* bestckptsaver:change default ckpt numbers to 1

* 1. Add assert to max_epoch 2. split init_dist and get_device 3. change cmp func name

* Fix bug

* fix bug

* fix bug

* unfinished refactoring

* unfinished

* uw

* uw

* uw

* uw

* Merge branch 'feat/refactor_config' into feat/refactor_trainer

# Conflicts:
#	modelscope/preprocessors/nlp/document_segmentation_preprocessor.py
#	modelscope/preprocessors/nlp/faq_question_answering_preprocessor.py
#	modelscope/preprocessors/nlp/relation_extraction_preprocessor.py
#	modelscope/preprocessors/nlp/text_generation_preprocessor.py

* uw

* uw

* unify nlp task outputs

* uw

* uw

* uw

* uw

* change the order of text cls pipeline

* refactor t5

* refactor tg task preprocessor

* fix

* unfinished

* temp

* refactor code

* unfinished

* unfinished

* unfinished

* unfinished

* uw

* Merge branch 'feat/refactor_config' into feat/refactor_trainer

* smoke test pass

* ut testing

* pre-commit passed

* Merge branch 'master' into feat/refactor_config

# Conflicts:
#	modelscope/models/nlp/bert/document_segmentation.py
#	modelscope/pipelines/nlp/__init__.py
#	modelscope/pipelines/nlp/document_segmentation_pipeline.py

* merge master

* unifnished

* Merge branch 'feat/fix_bug_pipeline_name' into feat/refactor_config

* fix bug

* fix ut bug

* support ner batch inference

* fix ut bug

* fix bug

* support batch inference on three nlp tasks

* unfinished

* fix bug

* fix bug

* Merge branch 'master' into feat/refactor_config

# Conflicts:
#	modelscope/models/base/base_model.py
#	modelscope/pipelines/nlp/conversational_text_to_sql_pipeline.py
#	modelscope/pipelines/nlp/dialog_intent_prediction_pipeline.py
#	modelscope/pipelines/nlp/dialog_modeling_pipeline.py
#	modelscope/pipelines/nlp/dialog_state_tracking_pipeline.py
#	modelscope/pipelines/nlp/document_segmentation_pipeline.py
#	modelscope/pipelines/nlp/faq_question_answering_pipeline.py
#	modelscope/pipelines/nlp/feature_extraction_pipeline.py
#	modelscope/pipelines/nlp/fill_mask_pipeline.py
#	modelscope/pipelines/nlp/information_extraction_pipeline.py
#	modelscope/pipelines/nlp/named_entity_recognition_pipeline.py
#	modelscope/pipelines/nlp/sentence_embedding_pipeline.py
#	modelscope/pipelines/nlp/summarization_pipeline.py
#	modelscope/pipelines/nlp/table_question_answering_pipeline.py
#	modelscope/pipelines/nlp/text2text_generation_pipeline.py
#	modelscope/pipelines/nlp/text_classification_pipeline.py
#	modelscope/pipelines/nlp/text_error_correction_pipeline.py
#	modelscope/pipelines/nlp/text_generation_pipeline.py
#	modelscope/pipelines/nlp/text_ranking_pipeline.py
#	modelscope/pipelines/nlp/token_classification_pipeline.py
#	modelscope/pipelines/nlp/word_segmentation_pipeline.py
#	modelscope/pipelines/nlp/zero_shot_classification_pipeline.py
#	modelscope/trainers/nlp_trainer.py

* pre-commit passed

* fix bug

* Merge branch 'master' into feat/refactor_config

# Conflicts:
#	modelscope/preprocessors/__init__.py

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fixbug

* pre-commit passed

* fix bug

* fixbug

* fix bug

* fix bug

* fix bug

* fix bug

* self review done

* fixbug

* fix bug

* fix bug

* fix bugs

* remove sub-token offset mapping

* fix name bug

* add some tests

* 1. support batch inference of text-generation,text2text-generation,token-classification,text-classification 2. add corresponding UTs

* add old logic back

* tmp save

* add tokenize by words logic back

* move outputs file back

* revert veco token-classification back

* fix typo

* Fix description

* Merge commit '4dd99b8f6e4e7aefe047c68a1bedd95d3ec596d6' into feat/refactor_config

* Merge branch 'master' into feat/refactor_config

# Conflicts:
#	modelscope/pipelines/builder.py
2022-11-30 23:52:17 +08:00