Commit Graph

1014 Commits

Author SHA1 Message Date
Wang Qiang
78f5e6a8bf Switching model from huggingface to modelscope hub of efficient tuning (#479) 2023-08-18 20:30:04 +08:00
Wang Qiang
f321804ab2 Merge pull request #472 from kangzhao2/baishao_test
Add image2video
2023-08-18 20:29:36 +08:00
kangzhao2
2643d985dc update test_image2video.py 2023-08-18 20:13:20 +08:00
kangzhao2
2605935797 fix pre-commit 2023-08-18 11:47:45 +08:00
kangzhao2
4ca76b0a85 fix comments again 2023-08-17 20:34:43 +08:00
kangzhao2
b8c76a426f fix comments 2023-08-17 20:03:05 +08:00
Wang Qiang
4ed1111d70 Fix bugs of configs file path and duration (#476)
* fix bugs of configs file path and duration

* pre commit

* delete configs

* test videocomposer model version
2023-08-16 21:03:11 +08:00
kangzhao2
90f7a5c6c0 update files 2023-08-16 11:35:15 +08:00
kangzhao2
037e73fe6e baishao 2023-08-15 21:32:30 +08:00
Wang Qiang
ee8afd2d62 VideoComposer: Compositional Video Synthesis with Motion Controllability (#431)
* VideoComposer: Compositional Video Synthesis with Motion Controllability

* videocomposer pipeline

* pre commit

* delete xformers
2023-08-15 12:01:03 +08:00
Jintao
18d33a4825 fix copytree python37 bug (#464)
* fix copytree python37 bug

* add copytree_py37 function
2023-08-14 11:45:33 +08:00
wenmeng zhou
74d8317bb0 fix pipeline check error (#455)
* fix pipeline check error

* update
2023-08-11 15:52:53 +08:00
Ran Zhou
026a9ef227 Add machine reading comprehension model, preprocessor and pipeline (#451)
* Add machine reading comprehension model, preprocessor and pipeline

* fix precommit errors

* Optimize mrc preprocessor, add mrc input output definition, add mrc pipeline docstr

---------

Co-authored-by: seadamo <ran.zhou@alibaba-inc.com>
2023-08-11 13:47:26 +08:00
liuyhwangyh
75a14a36ba get github diff files (#446)
* get github diff files

* add github environment

---------

Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>
2023-08-10 16:22:37 +08:00
wenmeng.zwm
725521a2af skip test_text_to_360panorama_image test 2023-08-08 16:41:19 +08:00
lukeming.lkm
b3a61ef6f4 update LICENSE
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13516119
2023-08-03 11:07:34 +08:00
lukeming.lkm
bd2f70a6eb add quantization in qwen pipelines and relevant unittests
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13499600

* add quant features

* resolve import

* resolve format

* fix save vocab
2023-08-02 14:05:13 +08:00
lukeming.lkm
33bd74a7be add qwen 7b base and chat
添加QWen 7b base模型和chat模型及相关pipelines
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13482235

* add qwen 7b base and chat

* fix logger

* update examples, lint test

* add unittest for qwen base and chat

* rename qwen to qwen-7b

* resolve imports and add a registry to text-generation

* reset load model from pretrained

* fix precheck

* skip qwen test case now

* remove strange file
2023-08-02 09:25:21 +08:00
suluyana
b68b90ba15 skip plugin 2023-07-30 00:30:30 +08:00
suluyana
9ece90ee84 skip plugin test case 2023-07-29 21:35:21 +08:00
suluyan.sly
05e1357c32 Merge branch 'master-github' into master-merge-github-230728 2023-07-28 16:40:34 +08:00
wenmeng.zwm
3b485d5835 fix plugin python module missing files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749
* fix plugin python module missing files
2023-07-28 16:14:56 +08:00
frozoul
2566d028cd cv/cv nerf 3d reconstruction 4k nerf damo (#389)
* add 4k-nerf core files

* update configure file

* update dataloader and model path

* update unittest

* Delete test_4k.py

* update unittest

* update unittest

* update pre-commit

* update dataloader

* update cuda code path

* check with pre-commit

---------

Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com>
Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>
2023-07-28 10:37:13 +08:00
tongmu.wh
475924a421 correct language recognition taks name
modelscope平台同学最终定下语种识别任务名为speech-language-recognition,对应进行代码中的相关改动
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517
* correct language recognition taks name
2023-07-27 21:03:04 +08:00
suluyan.sly
1c6f5fe775 Merge branch 'master-github' into master-merge-github-230727
Conflicts:
       examples/pytorch/baichuan/finetune_baichuan.py
       examples/pytorch/chatglm6b/finetune.py
2023-07-27 17:29:27 +08:00
mengyang.fmy
18f998a85c add text-to-360pano-image pipeline, mod cv requirements
7月份计划上线的360全景图生成模型,自研

模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary


#### 依赖项说明

##### 由于要使用xformers,torch版本最好使用1.13.1
```
pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116
```
##### 对应的diffusers和xformers版本如下
```
pip install -U diffusers==0.18.0
pip install xformers==0.0.16
pip install triton, accelerate, transformers
```

##### ModelScope Library 需要使用cv
```
pip install modelscope
pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
```

##### 此外,还需要安装第三方的一个库,Real-ESRGAN, 安装方法如下
```
# Install basicsr - https://github.com/xinntao/BasicSR
# We use BasicSR for both training and inference
pip install basicsr
# facexlib and gfpgan are for face enhancement
pip install facexlib
pip install gfpgan
pip install Pillow
pip install tqdm
pip install realesrgan==0.3.0
```
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430
* add text-to-360pano-image pipeline

* add text-to-360pano-image pipeline, mod cv requirements

* rm redundant files and cv requirements; add standard input and output definations

* fix diffusers==0.18.0 and run test

* fix diffusers==0.18.0 in multi-modal and run test again

* add model_revision='v1.0.0'

* fix yapf

* add trycatch for enabling xformers

* fix key error

* add install xformers in test/setup

* skip highres.fix in ci

* feat: Fix conflict, auto commit by WebIDE
2023-07-27 11:33:39 +08:00
Wang Qiang
66cf72a75c Merge pull request #376 from XDUWQ/custom_diffusion
Custom method for finetuning stable diffusion
2023-07-27 10:41:38 +08:00
Zackary Shen
ba4db97507 upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407)
* add vq_compression model

* add vq_compression model

* check pre-commit for lint test

* fix by flake8

* update

* update

* update

* the last update

* the laast update

* update test_level>=0

---------

Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>
2023-07-26 17:20:13 +08:00
tongmu.wh
ba1a333ba6 add language recognition pipelines and models
新增语种识别pipeline和model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083
* add language recognition pipelines and models

* add a clustering method for speaker diarization

* define input and output type for language recognition
2023-07-25 21:07:56 +08:00
zeyinzi.jzyz
672c4899e9 add sd swift tuner
SD-Tuner base on Swift (LoRA/Adapter/Prompt)
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798

* sd swift tuner

* fix pre-checker
2023-07-25 19:00:49 +08:00
shuli.cly
526e1371f5 Merge the speaker-turn-detection codes, local test finished
# Speaker Diarization Speaker-Turn Detection CR

和Dialogue-Detection一样,本模型是Speaker Diarization(`audio/speaker diarization`,语音/说话人日志)任务下的一个子模块。

本次提交的是基于文本进行判断的模型,本地模型的初始模型基于huggingface训练的,此提交中复用了部分 `nlp/token-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦,在model、pipeline以及preprocessor中 **单独** 创建了相应模块并重新register。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13364720
* std first commit

* local test pass for speaker-turn-detection

* update speaker-turn-detection pipeline task outputs format; update pipeline outputs; update test scripts
2023-07-25 18:57:47 +08:00
hemu.zp
80f76ca475 Support stream output for transformers model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13271136
* support stream for transformers model

* set test_level >= 2

* support hf model and chatglm2

* remove streaming_output for chatglm2
2023-07-25 17:41:32 +08:00
wenmeng zhou
64203e89ee Compatibility for huggingface transformers (#391) 2023-07-24 20:53:27 +08:00
XDUWQ
8e00d85317 fix bugs 2023-07-24 19:46:22 +08:00
lylalala
f805d86aed llama2 support chat (#404)
* support chat

* update llama2 chat testcase

* add gen kwargs and devices

* update unittest and support max_length in multi-turn dialogue
2023-07-24 15:38:01 +08:00
tingwei.gtw
d16522723a [to #42322933] add files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13158565
* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* update test data

* [to #42322933] add files

* Merge remote-tracking branch 'origin' into feature/sal_try_on

* [to #42322933] add files

* Merge remote-tracking branch 'origin' into feature/sal_try_on
2023-07-24 10:16:29 +08:00
mushenL
f77237b049 add llama2 pipeline (#399)
* Modify the parameter passing of the text_generation_pipeline class

* add llama2 pipeline

* add llama pipeline v1.1

* add llama pipeline v1.2

* add llama pipeline v1.3

* add llama pipeline v1.0.4
2023-07-22 21:53:04 +08:00
shuli.cly
13e345f6d9 add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection
# Speaker Diarization Dialogue Detection CR

本模型是Speaker Diarization(`audio/speaker diarization`,语音/说话人日志)任务下的一个子模块。

本次提交的是基于文本进行判断的模型,其IO和中间过程和 `nlp/text-classification` 很像,且本地模型的初始模型也是基于huggingface训练的,因此此提交中复用了部分 `nlp/text-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦,在model、pipeline以及preprocessor中 **单独** 创建了相应模块并重新register。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13269649
* start to add speaker_diarization_dialogue_detection files; Need to change constant and test

* add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection

* update test case

* add comments for speaker diarization dialogue detection pipelines

* add outputs type and inputs type for speaker_diarization_dialogue_detection
2023-07-20 19:29:59 +08:00
shenweichao.swc
05c65ba225 add s2net for panorama_depth_estimation
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13310819
* add s2net codes

* fix sphdecoder

* Merge branch 'master' into dev/cv_s2net_panorama_depth_estimation

* revise comments in the pipeline

* revise the code
2023-07-20 19:28:03 +08:00
xiangpeng.wxp
4085d821f3 [to #42322933] add polylm, a polyglot large language model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13339595
2023-07-20 18:21:07 +08:00
XDUWQ
66795aa3ff change tests level 2023-07-19 09:41:21 +08:00
Wang Qiang
0b85979f2e Update diffusers version to 0.18.0 (#377)
* update diffusers to 0.18.0

* fix bugs
2023-07-14 19:02:52 +08:00
baiguan.yt
ceac129c6b add parameters height and width for text-to-video
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13171907
2023-07-14 16:22:10 +08:00
XDUWQ
34ab717393 custom_diffusion 2023-07-12 19:47:32 +08:00
XDUWQ
1caa45422c custom diffusion 2023-07-11 20:46:32 +08:00
yeqinghao.yqh
41cbb8e393 mPLUG-Owl 生成长度Bug修复
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13209284
2023-07-10 18:56:12 +08:00
tongmu.wh
a7f7a67855 fix details of speaker models
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13203011
* fix details of speaker models
2023-07-10 18:54:26 +08:00
chenyafeng.cyf
543d03e32b 3dspeaker
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13199180
2023-07-07 15:00:35 +08:00
wenmeng.zwm
0271b9c256 Merge branch 'master-github' into merge_master_github_0628 2023-06-28 20:27:34 +08:00
Wang Qiang
a018cd6107 Dreambooth method for finetuning stable diffusions (#339)
* Copyright

* dreambooth

* dreambooth test trainer

* fix bugs

* pre-commit

---------

Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com>
2023-06-28 20:10:28 +08:00