98 Commits

Author SHA1 Message Date
liugao.lg
40b6770956 [to #42322933]fix ocr prepreocess & conflict
修复ocr预处理逻辑不一致问题
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10581697
2022-11-01 10:22:11 +08:00
zhangzhicheng.zzc
0d3b7b0df2 [to #42322933]fix bugs relate to token cls
1.修复token classification preprocessor finetune结果错误问题
2.修复word segmentation output 无用属性
3. 修复nlp preprocessor传use_fast错误
4. 修复torch model exporter bug
5. 修复文档撰写过程中发现trainer相关bug
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10573269
2022-10-31 20:52:27 +08:00
yichang.zyc
e2d35fbb14 [to #42322933]clip支持finetune
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10572842
2022-10-30 21:51:11 +08:00
Yingda Chen
9f7b8b86a3 [to #42322933] disble 2dkeypoints training since face_2d_keypoints_dataset is set to be private 2022-10-30 13:59:12 +08:00
yuze.zyz
4b7e8e89aa [to #42322933] Fix some bugs when downgrade the version of some dependencies
1. Fix bug in model exporting
2. Skip some long trainings in test level 2
3. Refine some comments
4. Fix a bug that mode is not correct when saving checkpoints
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10564716
2022-10-28 21:44:33 +08:00
Yingda Chen
46cfa177aa [to #42322933]skip timeconsuming test 2022-10-28 09:34:29 +08:00
Yingda Chen
374fd3090e [to #42322933]skip referring video tests since model is private 2022-10-27 20:23:51 +08:00
yuze.zyz
212cf53318 [to #42322933] Fix some bugs
1. Add F1 score to sequence classification metric
2. Fix a bug that the evaluate method in trainer does not support a pure pytorch_model.bin
3. Fix a bug in evaluation of veco trainer 
4. Add some tips if lr_scheduler in the trainer needs a higher version torch
5. Add some comments
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10532230
2022-10-27 19:49:21 +08:00
shuying.shu
ddcb57440d [to #42322933]add fine-tune code for referring video object segmentation
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10539423
2022-10-27 19:43:54 +08:00
eniac.xcw
8886c3c1ae [to #42322933]fine tune team on caltech-101
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10525413
2022-10-27 12:00:14 +08:00
yingda.chen
de708dd518 add basic remap column wrapper
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10539917

    * add basic remap column wrapper
2022-10-27 10:12:05 +08:00
hemu.zp
69104c0f8a [to #42322933] Refactor text generation model outputs and fix some bugs
1. 将 single_gpu_test 与 multi_gpu_test 中的 model.forward 部分分离为 EpochBasedTrainer 中的 evaluation_step,为部分 evaluation 阶段不调用 forward 的模型提供更好的灵活性
2. 重构代码将文本生成模型 Model 层的输入输出统一为 Tensor,Tensor 到 str 的 decode 过程移动到 pipeline 中完成
3. pipeline 后处理添加对中文和中文标点与英文混杂时空格的处理,使 decode 后中英文混杂输出正确
4. 添加 TextGenerationTrainer 修复了部分模型 evaluation 过程 forward 输出单个 token 计算 metrics 的问题
5. 修复了 rouge 无法接收空字符串的问题
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10473768
2022-10-27 09:52:05 +08:00
liugao.lg
0605376135 [to #42322933]add ofa finetune
新增ofa的finetune能力
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10540701
2022-10-27 09:29:06 +08:00
zhangyanzhao.zyz
781fe49d63 [to #42322933]修正finetune text ranking bugs
之前的finetune代码当dataset最后长度不足制定batch size时会出错,现已修正
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10524066
2022-10-26 09:44:25 +08:00
yuze.zyz
c2da44b371 [to #42322933] remove dev model inference and fix some bugs
1. Change structbert dev revision to master revision
2. Fix bug:  Sample code failed because the updating of model configuration
3. Fix bug: Continue training regression failed
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10519992
2022-10-25 22:38:49 +08:00
yichang.zyc
6ddafb3218 [to #42322933]caption finetune done, add belu metric
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10318299
2022-10-25 12:55:41 +08:00
yuze.zyz
605cd7f44a [to #42322933] NLP 1030 Refactor
Features:
1. Refactor the directory structure of nlp models. All model files are placed into either the model folder or the task_model folder
2. Refactor all the comments to google style
3. Add detail comments to important tasks and nlp models, to list the description of the model, and its preprocessor&trainer
4. Model Exporting now supports a direct all to TorchModelExporter(no need to derive from it)
5. Refactor model save_pretrained method to support direct running(independent from trainer)
6. Remove the judgement of Model in the pipeline base class, to support outer register models running in our pipelines
7. Nlp trainer now has a NLPTrainingArguments class , user can pass arguments into the dataclass, and use it as a normal cfg_modify_fn, to simplify the operation of modify cfg.
8. Merge the BACKBONES and the MODELS, so user can get a backbone with the Model.from_pretrained call
9. Model.from_pretrained now support a task argument, so user can use a backbone and load it with a specific task class.
10. Support Preprocessor.from_pretrained method
11. Add standard return classes to important nlp tasks, so some of the pipelines and the models are independent now, the return values of the models will always be tensors, and the pipelines will take care of the conversion to numpy and the following stuffs.
12. Split the file of the nlp preprocessors, to make the dir structure more clear.

Bugs Fixing:
1. Fix a bug that lr_scheduler can be called earlier than the optimizer's step
2. Fix a bug that the direct call of Pipelines (not from pipeline(xxx)) throws error
3. Fix a bug that the trainer will not call the correct TaskDataset class
4. Fix a bug that the internal loading of dataset will throws error in the trainer class
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10490585
2022-10-25 12:26:25 +08:00
bin.xue
525fa3ea89 [to #42322933]test: use 'master' branch in training test
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10509580
2022-10-25 12:10:07 +08:00
huizheng.hz
a1738690c9 [to #42322933]test_image_denoise_trainer
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10465138
2022-10-25 10:08:57 +08:00
yingda.chen
de7b6a06e9 [to #42322933] remove revision usage for face detection
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10507910

    * [to #42322933] remove revision usage for face detection
2022-10-25 09:28:01 +08:00
zhangyanzhao.zyz
c4dbb69d65 [to #42322933]增加对text-ranking任务中文模型的单元测试,以方便得到官方模型打标。
增加对text-ranking任务中文模型的单元测试,以方便得到官方模型打标。
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10492754
2022-10-24 23:41:20 +08:00
yuze.zyz
707cbef013 [to #42322933]Fix bug in daily UT
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10491891
2022-10-22 23:25:18 +08:00
zhangyanzhao.zyz
824ee8232c [to #42322933]更新语义相关性任务英文名称为text ranking,修改对应变量名和类名
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10491951
2022-10-22 17:12:48 +08:00
Yingda Chen
dee93c40e2 [to #42322933] force download dataset for portraint enhancement 2022-10-22 16:28:30 +08:00
baiguan.yt
533ab3df63 [to #42322933]update msdatasets for image-portrait-enhancement training
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10453584
2022-10-21 14:54:24 +08:00
yuze.zyz
acba1786b0 [to #42322933] Fix bug in UT daily
1. Fix bugs in daily test
2. Fix a bug that the updating of lr is before the first time of updating of optimizer
    TODO this will still cause warnings when GA is above 1
3. Remove the judgement of mode in text-classification's preprocessor to fit the base trainer(Bug)
     Update some regression bins to fit the preprocessor
4. Update the regression tool to let outer code modify atol and rtol
5. Add the default metric for text-classification task
6. Remove the useless ckpt conversion method in bert to avoid the requirement of tf when loading modeling_bert
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10430764
2022-10-20 15:29:34 +08:00
shouzhou.bx
01d521dd78 [to #42322933]add face 2d keypoints finetune test case
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10421808

    * add face 2d keypoints & human wholebody keypoint finrtune test case
2022-10-20 11:51:52 +08:00
Yingda Chen
089cadab4b [to #42322933] disable unstable trainer test 2022-10-20 08:51:25 +08:00
huizheng.hz
c0b546a96e [to #42322933]add subset_name when loading dataset (NAFNet image denoising)
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10427797
2022-10-18 14:34:26 +08:00
hanyuan.chy
2d50c812df [to #42322933] support finetune on cv/hand_2d_keypoints
添加2d手部关键点检测finetune功能
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10371710
2022-10-13 13:48:11 +08:00
bin.xue
3863efc14d [to #42322933] add far field KWS trainer
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10275823
2022-10-13 10:15:33 +08:00
yuxiang.tyx
2989492bc0 [to #42322933]更新face_detection_scrfd模型并支持finetune, 新增card_detection模型
1. 调整face_detection的文件层级(scrfd与其余新增face_detection方法平级);
2. 增加极大脸/旋转脸的检测方法,更新了新模型;
3. 支持读入数据集并finetune和eval;
4. 新增card_detection模型,支持读入datasethub数据集并finetune
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10244540
2022-10-12 19:58:50 +08:00
huizheng.hz
922f4c589b [to #42322933]图像去噪using msdataset to load dataset
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10338265
2022-10-11 22:46:30 +08:00
ashui.cbh
69da8f91ac [to #42322933]suport image inpainting
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10111615
2022-10-11 20:49:13 +08:00
laiyin.lyc
09d2296f36 [to #44847108] add sparsity hook (pst algorithm)
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10198228

    * [to #44847108] add sparsity hook (pst algorithm)
2022-10-11 16:05:20 +08:00
yuze.zyz
357a233ee3 [to #42322933] fix bug: checkpoint hook and bestckpthook exists at the same time
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10227608
2022-09-27 23:03:00 +08:00
jiangnana.jnn
1794e08af7 fix dist training
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10185634

    * fix dist training
2022-09-21 17:47:50 +08:00
wenmeng.zwm
6808e9a301 [to #44902099] add license for framework files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10189613
2022-09-20 17:49:31 +08:00
yuze.zyz
4cdd0c23eb [to #42322933] Refactor and fix some bugs
1. Fix a bug in trainer's progress bar
2. Fix a bug that trainer does not support dataset in config file
3. Add feature: support go on training via checkpoint file
4. Add feature: support fixed filename when saving best checkpoint
5. Fix a bug that no id2label in config file after finetune of nlp models
6. Fix some other bugs
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10138906
2022-09-19 17:05:35 +08:00
wenmeng.zwm
7fb25d7bbb [to #42322933]fix UT error for 830 version
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10142442
2022-09-16 22:42:39 +08:00
jiangnana.jnn
adee5d10ae update EasyCV MsDataset
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10103248

    * update EasyCV MSDataset
2022-09-15 18:11:03 +08:00
dingkun.ldk
54e1a6d88b [to #42322933]830NLP 篇章排序/文本表示模型代码check
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9856179
2022-09-10 15:59:56 +08:00
ly119399
b41b10f897 [to #42322933] space finetune on generation task
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10061562
2022-09-09 14:27:08 +08:00
jiangnana.jnn
5e176da3a1 adapt to msdataset for EasyCV
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9935664

    * adapt to msdataset for EasyCV
2022-09-09 10:01:51 +08:00
cyc385202
d4759e4c24 [to #42322933] 加入space模型在banking数据集上的finetune代码
加入space模型在banking数据集上的微调代码
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006792
2022-09-08 13:45:14 +08:00
yuze.zyz
01e768503c [to #42322933] Fix random seed for trainer
1. Fix random seed for trainer and init it at the first line of init
2. Add a regress test for fixed training
3. Change the dataset 'dureader_robust_qg' to 'DuReader_robust-QG'
4. Change some datasets from loading hf.datasets to loading msdataset.load
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10029509
2022-09-06 15:13:14 +08:00
hemu.zp
3d3f9b4537 [to #42322933] fix checkpoint format
1. 修复 palm,gpt3,mplug 模型存在的 finetune 后保存 checkpoint 与原有 checkpoint key 字段存在区别无法使用 from_pretrained 导入的问题
2. 调整 test_finetune_mplug.py 为只保存训练结束时的 checkpoint,减少 ci 耗时
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10016517
2022-09-05 17:51:22 +08:00
bin.xue
b870e4eed5 [to #42322933] test: use custom config to reduce test time
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10011826
2022-09-05 13:26:30 +08:00
yuze.zyz
a9c14e4ead [to #42322933] Support saving the best checkpoint for inference
1. Support saving the best checkpoint for inference
2. Fix a bug that _max_iters field does not exist in trainer
3. Fix a bug that function in lambda_lr field cannot be saved to file
4. Fix a bug that save_pretrained would not be called by iterating
5. Fix a bug that interval is not passed from BestCkptHook's init
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9972765
2022-09-05 11:07:48 +08:00
Yingda Chen
39a309b655 [to #42322933] reduce train epoch from 3 to w 2022-09-03 12:18:52 +08:00