Commit Graph

1195 Commits

Author SHA1 Message Date
suluyana
9ece90ee84 skip plugin test case 2023-07-29 21:35:21 +08:00
suluyan.sly
05e1357c32 Merge branch 'master-github' into master-merge-github-230728 2023-07-28 16:40:34 +08:00
wenmeng.zwm
3b485d5835 fix plugin python module missing files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749
* fix plugin python module missing files
2023-07-28 16:14:56 +08:00
frozoul
2566d028cd cv/cv nerf 3d reconstruction 4k nerf damo (#389)
* add 4k-nerf core files

* update configure file

* update dataloader and model path

* update unittest

* Delete test_4k.py

* update unittest

* update unittest

* update pre-commit

* update dataloader

* update cuda code path

* check with pre-commit

---------

Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com>
Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>
2023-07-28 10:37:13 +08:00
tongmu.wh
475924a421 correct language recognition taks name
modelscope平台同学最终定下语种识别任务名为speech-language-recognition,对应进行代码中的相关改动
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517
* correct language recognition taks name
2023-07-27 21:03:04 +08:00
suluyan.sly
1c6f5fe775 Merge branch 'master-github' into master-merge-github-230727
Conflicts:
       examples/pytorch/baichuan/finetune_baichuan.py
       examples/pytorch/chatglm6b/finetune.py
2023-07-27 17:29:27 +08:00
mengyang.fmy
18f998a85c add text-to-360pano-image pipeline, mod cv requirements
7月份计划上线的360全景图生成模型,自研

模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary


#### 依赖项说明

##### 由于要使用xformers,torch版本最好使用1.13.1
```
pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116
```
##### 对应的diffusers和xformers版本如下
```
pip install -U diffusers==0.18.0
pip install xformers==0.0.16
pip install triton, accelerate, transformers
```

##### ModelScope Library 需要使用cv
```
pip install modelscope
pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
```

##### 此外,还需要安装第三方的一个库,Real-ESRGAN, 安装方法如下
```
# Install basicsr - https://github.com/xinntao/BasicSR
# We use BasicSR for both training and inference
pip install basicsr
# facexlib and gfpgan are for face enhancement
pip install facexlib
pip install gfpgan
pip install Pillow
pip install tqdm
pip install realesrgan==0.3.0
```
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430
* add text-to-360pano-image pipeline

* add text-to-360pano-image pipeline, mod cv requirements

* rm redundant files and cv requirements; add standard input and output definations

* fix diffusers==0.18.0 and run test

* fix diffusers==0.18.0 in multi-modal and run test again

* add model_revision='v1.0.0'

* fix yapf

* add trycatch for enabling xformers

* fix key error

* add install xformers in test/setup

* skip highres.fix in ci

* feat: Fix conflict, auto commit by WebIDE
2023-07-27 11:33:39 +08:00
Wang Qiang
66cf72a75c Merge pull request #376 from XDUWQ/custom_diffusion
Custom method for finetuning stable diffusion
2023-07-27 10:41:38 +08:00
Zackary Shen
ba4db97507 upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407)
* add vq_compression model

* add vq_compression model

* check pre-commit for lint test

* fix by flake8

* update

* update

* update

* the last update

* the laast update

* update test_level>=0

---------

Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>
2023-07-26 17:20:13 +08:00
tongmu.wh
ba1a333ba6 add language recognition pipelines and models
新增语种识别pipeline和model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083
* add language recognition pipelines and models

* add a clustering method for speaker diarization

* define input and output type for language recognition
2023-07-25 21:07:56 +08:00
zeyinzi.jzyz
672c4899e9 add sd swift tuner
SD-Tuner base on Swift (LoRA/Adapter/Prompt)
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798

* sd swift tuner

* fix pre-checker
2023-07-25 19:00:49 +08:00
shuli.cly
526e1371f5 Merge the speaker-turn-detection codes, local test finished
# Speaker Diarization Speaker-Turn Detection CR

和Dialogue-Detection一样,本模型是Speaker Diarization(`audio/speaker diarization`,语音/说话人日志)任务下的一个子模块。

本次提交的是基于文本进行判断的模型,本地模型的初始模型基于huggingface训练的,此提交中复用了部分 `nlp/token-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦,在model、pipeline以及preprocessor中 **单独** 创建了相应模块并重新register。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13364720
* std first commit

* local test pass for speaker-turn-detection

* update speaker-turn-detection pipeline task outputs format; update pipeline outputs; update test scripts
2023-07-25 18:57:47 +08:00
hemu.zp
80f76ca475 Support stream output for transformers model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13271136
* support stream for transformers model

* set test_level >= 2

* support hf model and chatglm2

* remove streaming_output for chatglm2
2023-07-25 17:41:32 +08:00
wenmeng zhou
64203e89ee Compatibility for huggingface transformers (#391) 2023-07-24 20:53:27 +08:00
XDUWQ
8e00d85317 fix bugs 2023-07-24 19:46:22 +08:00
lylalala
f805d86aed llama2 support chat (#404)
* support chat

* update llama2 chat testcase

* add gen kwargs and devices

* update unittest and support max_length in multi-turn dialogue
2023-07-24 15:38:01 +08:00
tingwei.gtw
d16522723a [to #42322933] add files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13158565
* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* update test data

* [to #42322933] add files

* Merge remote-tracking branch 'origin' into feature/sal_try_on

* [to #42322933] add files

* Merge remote-tracking branch 'origin' into feature/sal_try_on
2023-07-24 10:16:29 +08:00
mushenL
f77237b049 add llama2 pipeline (#399)
* Modify the parameter passing of the text_generation_pipeline class

* add llama2 pipeline

* add llama pipeline v1.1

* add llama pipeline v1.2

* add llama pipeline v1.3

* add llama pipeline v1.0.4
2023-07-22 21:53:04 +08:00
shuli.cly
13e345f6d9 add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection
# Speaker Diarization Dialogue Detection CR

本模型是Speaker Diarization(`audio/speaker diarization`,语音/说话人日志)任务下的一个子模块。

本次提交的是基于文本进行判断的模型,其IO和中间过程和 `nlp/text-classification` 很像,且本地模型的初始模型也是基于huggingface训练的,因此此提交中复用了部分 `nlp/text-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦,在model、pipeline以及preprocessor中 **单独** 创建了相应模块并重新register。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13269649
* start to add speaker_diarization_dialogue_detection files; Need to change constant and test

* add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection

* update test case

* add comments for speaker diarization dialogue detection pipelines

* add outputs type and inputs type for speaker_diarization_dialogue_detection
2023-07-20 19:29:59 +08:00
shenweichao.swc
05c65ba225 add s2net for panorama_depth_estimation
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13310819
* add s2net codes

* fix sphdecoder

* Merge branch 'master' into dev/cv_s2net_panorama_depth_estimation

* revise comments in the pipeline

* revise the code
2023-07-20 19:28:03 +08:00
xiangpeng.wxp
4085d821f3 [to #42322933] add polylm, a polyglot large language model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13339595
2023-07-20 18:21:07 +08:00
XDUWQ
66795aa3ff change tests level 2023-07-19 09:41:21 +08:00
Wang Qiang
0b85979f2e Update diffusers version to 0.18.0 (#377)
* update diffusers to 0.18.0

* fix bugs
2023-07-14 19:02:52 +08:00
baiguan.yt
ceac129c6b add parameters height and width for text-to-video
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13171907
2023-07-14 16:22:10 +08:00
XDUWQ
34ab717393 custom_diffusion 2023-07-12 19:47:32 +08:00
XDUWQ
1caa45422c custom diffusion 2023-07-11 20:46:32 +08:00
yeqinghao.yqh
41cbb8e393 mPLUG-Owl 生成长度Bug修复
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13209284
2023-07-10 18:56:12 +08:00
tongmu.wh
a7f7a67855 fix details of speaker models
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13203011
* fix details of speaker models
2023-07-10 18:54:26 +08:00
chenyafeng.cyf
543d03e32b 3dspeaker
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13199180
2023-07-07 15:00:35 +08:00
wenmeng.zwm
0271b9c256 Merge branch 'master-github' into merge_master_github_0628 2023-06-28 20:27:34 +08:00
Wang Qiang
a018cd6107 Dreambooth method for finetuning stable diffusions (#339)
* Copyright

* dreambooth

* dreambooth test trainer

* fix bugs

* pre-commit

---------

Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com>
2023-06-28 20:10:28 +08:00
mulin.lyh
1ea9b58447 fix torch2.0 compatible issue
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13086361

* fix face-aligment compatible

* fix torch2.0 compatible issue
2023-06-28 14:15:48 +08:00
Wang Qiang
6942144ad7 Stable Diffusion model checkpoint export to onnx. (#340)
* stable diffusion export onnx

* fix pre commit bugs

* fix bugs

* safety checker support

* test export stable diffusion
2023-06-28 13:26:19 +08:00
yuze.zyz
8f18274f75 Add teardown for tests
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12643554

* add teardown for tests

* add teardown for dialog_modeling_trainer,document_grounded_dialog_generate_trainer,document_grounded_dialog_rerank_trainer,document_grounded_dialog_retrieval_trainer,training_args,translation_evaluation_trainer,translation_trainer
2023-06-28 09:44:44 +08:00
mulin.lyh
eb0f0216c6 fix torch 2.x compatible issue
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13045011

* fix torch 2.x compatible issue

* fix torch 2.x compatible issue

* fix complex-valued input tensor matching the output from stft with return_complex=True.

* skip plugin test temporarily for modify torch version

* fix test_speech_signal_process.py compatible issue

* fix lint issue

* upgrade funasr to 0.6.5
2023-06-27 14:40:51 +08:00
yuze.zyz
a58be34384 Add Lora/Adapter/Prompt and support for chatglm6B and chatglm2-6B
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12770413

* add prompt and lora

* add adapter

* add prefix

* add tests

* adapter smoke test passed

* prompt test passed

* support model id in petl

* migrate chatglm6b

* add train script for chatglm6b

* move gen_kwargs to finetune.py

* add chatglm2

* add model definination
2023-06-27 14:38:18 +08:00
xingjun.wxj
1dbff6cb48 Support jsonl format in meta data
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13071970

* support jsonl in meta


* add UT and refine fetch_meta_files_from_url
2023-06-27 11:58:19 +08:00
wenmeng.zwm
9e51920fdb Merge branch 'master-github' into merge_master_github_0626 2023-06-26 21:04:05 +08:00
wenmeng zhou
6dea1d5646 Fix/citest timeout (#308)
* timeout for citest set to 240min

* update docker image

* fix ci template not packed in whl

* update docker image version to 1.6.1 and add python3.8 support

* randome choose a model for controlnet to avoid oom
2023-06-26 11:23:10 +08:00
mulin.lyh
698c794070 [to #50537864]fix: fix select case issue
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13055098
2023-06-25 22:44:29 +08:00
chenyafeng.cyf
29062d9f94 eres2net_aug v2
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13046524

* eres2net_aug v2
2023-06-25 18:07:04 +08:00
tongmu.wh
f03c93cda5 add speaker diarization pipeline and improve some speaker pipelines
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12891685

* add new speaker diarization pipeline. improve the capability of speaker pipelines
2023-06-21 17:56:05 +08:00
xingjun.wxj
0db0ec5586 Merge code from github
1. Merge(add) daily regression from github PR (daily_regression.yaml)
2. Add lora stable diffusion from github PR
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13010802
* fix: device arg not work, rename device to ngpu (#272)

* Correcting the lora stable diffusion example script (#300)

* add vad model and punc model in README.md 

add vad model and punc model

* Merge pull request #302 from modelscope/langgz-patch-1

add vad model and punc model in README.md

* add 1.6

* modify ignore

* Merge pull request #307 from modelscope/dev_rs_16

Merge release 1.6

* undo datetime to 2099

* Merge pull request #311 from modelscope/fix_master_version

undo datetime to 2099

* add daily regression workflow

* modify workflow name

* fix cron format issue

* lora trainer

* Merge pull request #315 from liuyhwangyh/add_regression_workflow

add daily regression workflow
2023-06-21 10:22:06 +08:00
xingjun.wxj
cc3c384d5e Fix issues for downloading mplug-youku dataset
1. Optimize downloading meta-csv files for large-scale dataset like mPLUG-youku (> 1GB for meta csv mapping)
2. Add head and overall progress bar for NativeIterableDataset
3. Modify the try-catch info for oss_utils
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12952842
2023-06-15 15:42:21 +08:00
hemu.zp
96c2d42f09 Add StreamingMixin
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12445731
* StreamingMixin poc

* update design

* Merge branch 'master' into feat/StreamingMixin

* add dicstr

* make postprocessor input consistent
2023-06-08 19:40:14 +08:00
mulin.lyh
25a7398aea docker file py38 and py37 compatible merge
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12722824
* debug

* add missing deps

* add stanza

* fix numpy issue

* numpy version

* tf case

* numpy <= 1.22.0

* fix tf 1.15.5 case

* add python38 and python37 compatible docker file

* cv add fvcore

* add easycv

* remove debug code

* fix style issue

* fix python3.7 python3.8 compatible issue

* comment

* modify run config

* scipy<=1.7.3

* revert scipy<=1.7.3

* fix compatible bug

* fix compatible bug

* add easycv

* update version: 1.6.1rc0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12754942
fix python3.8 and python3.8 compatible issue

* numpy version change

* numpy version

* numpy version

* fix numpy version

* fix numpy version

* update version: 1.6.1rc0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12754942
fix python3.8 and python3.8 compatible issue

* restore setup.py

* restore setup.py

* add build base image

* fix style

* numpy version fix

* optimize build image, split base and modelscope image

* fix style issue

* modify example path

* modify examples folder

* add SETUPTOOLS_USE_DISTUTILS=stdlib comments
2023-06-08 10:01:58 +08:00
xixing.tj
1b7e0f50f4 add ocr detection new model db-nas
新增5M的DB-NAS ocr detection 文字检测模型
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12760623
* add ocr detection new model db-nas

* add comment
2023-05-31 21:32:46 +08:00
yuanzhi.zyz
10c39b5ce1 add new ocr recognition model (LightweightEdge) and some functions
1. 增加了新轻量化端侧识别模型 LightweightEdge,并把原来CRNN和ConvNextViT的代码整理了
2. 增加batch inference支持
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12787905
2023-05-31 21:16:22 +08:00
yuze.zyz
003c33648a Fix CI: test merge dataset failed
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12771691
* fix CI

* add teardown
2023-05-29 10:32:33 +08:00
xingjun.wxj
5b99bb24fe update version: 1.6.1rc0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12754942
2023-05-25 20:28:36 +08:00