Commit Graph

1158 Commits

Author SHA1 Message Date
Wang Qiang
ee8afd2d62 VideoComposer: Compositional Video Synthesis with Motion Controllability (#431)
* VideoComposer: Compositional Video Synthesis with Motion Controllability

* videocomposer pipeline

* pre commit

* delete xformers
2023-08-15 12:01:03 +08:00
Jintao
18d33a4825 fix copytree python37 bug (#464)
* fix copytree python37 bug

* add copytree_py37 function
2023-08-14 11:45:33 +08:00
wenmeng zhou
74d8317bb0 fix pipeline check error (#455)
* fix pipeline check error

* update
2023-08-11 15:52:53 +08:00
Ran Zhou
026a9ef227 Add machine reading comprehension model, preprocessor and pipeline (#451)
* Add machine reading comprehension model, preprocessor and pipeline

* fix precommit errors

* Optimize mrc preprocessor, add mrc input output definition, add mrc pipeline docstr

---------

Co-authored-by: seadamo <ran.zhou@alibaba-inc.com>
2023-08-11 13:47:26 +08:00
chenyafeng.cyf
33605de759 eres2net_lre_v2
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13602081

* add eres2net_base_large_lre

* eres2net_language_identification

* eres2net_lre_v2
2023-08-10 17:44:43 +08:00
liuyhwangyh
75a14a36ba get github diff files (#446)
* get github diff files

* add github environment

---------

Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>
2023-08-10 16:22:37 +08:00
wenmeng.zwm
725521a2af skip test_text_to_360panorama_image test 2023-08-08 16:41:19 +08:00
zsl01670416
b0699fd8e2 support llama2 inputs to device in function generate
fix error inputs and model were not on the same device. if they are not on the same device, inputs will be implemented function to model device.
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13546989
* support llama inputs to device in function generate

* modify test qwen text generation according to github code
2023-08-07 15:41:28 +08:00
lukeming.lkm
9e033104af change use_fast_att and fix bfloat loading
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13522792

* check flash att is installed even if  use_fast_att is set True and fix bfloat loading

* skip pipeline model placement for quantization

* update unittest for qwen
2023-08-03 15:59:51 +08:00
lukeming.lkm
b3a61ef6f4 update LICENSE
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13516119
2023-08-03 11:07:34 +08:00
lukeming.lkm
bd2f70a6eb add quantization in qwen pipelines and relevant unittests
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13499600

* add quant features

* resolve import

* resolve format

* fix save vocab
2023-08-02 14:05:13 +08:00
lukeming.lkm
33bd74a7be add qwen 7b base and chat
添加QWen 7b base模型和chat模型及相关pipelines
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13482235

* add qwen 7b base and chat

* fix logger

* update examples, lint test

* add unittest for qwen base and chat

* rename qwen to qwen-7b

* resolve imports and add a registry to text-generation

* reset load model from pretrained

* fix precheck

* skip qwen test case now

* remove strange file
2023-08-02 09:25:21 +08:00
suluyana
b68b90ba15 skip plugin 2023-07-30 00:30:30 +08:00
suluyana
9ece90ee84 skip plugin test case 2023-07-29 21:35:21 +08:00
suluyan.sly
05e1357c32 Merge branch 'master-github' into master-merge-github-230728 2023-07-28 16:40:34 +08:00
wenmeng.zwm
3b485d5835 fix plugin python module missing files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749
* fix plugin python module missing files
2023-07-28 16:14:56 +08:00
frozoul
2566d028cd cv/cv nerf 3d reconstruction 4k nerf damo (#389)
* add 4k-nerf core files

* update configure file

* update dataloader and model path

* update unittest

* Delete test_4k.py

* update unittest

* update unittest

* update pre-commit

* update dataloader

* update cuda code path

* check with pre-commit

---------

Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com>
Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>
2023-07-28 10:37:13 +08:00
tongmu.wh
475924a421 correct language recognition taks name
modelscope平台同学最终定下语种识别任务名为speech-language-recognition,对应进行代码中的相关改动
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517
* correct language recognition taks name
2023-07-27 21:03:04 +08:00
suluyan.sly
1c6f5fe775 Merge branch 'master-github' into master-merge-github-230727
Conflicts:
       examples/pytorch/baichuan/finetune_baichuan.py
       examples/pytorch/chatglm6b/finetune.py
2023-07-27 17:29:27 +08:00
mengyang.fmy
18f998a85c add text-to-360pano-image pipeline, mod cv requirements
7月份计划上线的360全景图生成模型,自研

模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary


#### 依赖项说明

##### 由于要使用xformers,torch版本最好使用1.13.1
```
pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116
```
##### 对应的diffusers和xformers版本如下
```
pip install -U diffusers==0.18.0
pip install xformers==0.0.16
pip install triton, accelerate, transformers
```

##### ModelScope Library 需要使用cv
```
pip install modelscope
pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
```

##### 此外,还需要安装第三方的一个库,Real-ESRGAN, 安装方法如下
```
# Install basicsr - https://github.com/xinntao/BasicSR
# We use BasicSR for both training and inference
pip install basicsr
# facexlib and gfpgan are for face enhancement
pip install facexlib
pip install gfpgan
pip install Pillow
pip install tqdm
pip install realesrgan==0.3.0
```
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430
* add text-to-360pano-image pipeline

* add text-to-360pano-image pipeline, mod cv requirements

* rm redundant files and cv requirements; add standard input and output definations

* fix diffusers==0.18.0 and run test

* fix diffusers==0.18.0 in multi-modal and run test again

* add model_revision='v1.0.0'

* fix yapf

* add trycatch for enabling xformers

* fix key error

* add install xformers in test/setup

* skip highres.fix in ci

* feat: Fix conflict, auto commit by WebIDE
2023-07-27 11:33:39 +08:00
Wang Qiang
66cf72a75c Merge pull request #376 from XDUWQ/custom_diffusion
Custom method for finetuning stable diffusion
2023-07-27 10:41:38 +08:00
Zackary Shen
ba4db97507 upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407)
* add vq_compression model

* add vq_compression model

* check pre-commit for lint test

* fix by flake8

* update

* update

* update

* the last update

* the laast update

* update test_level>=0

---------

Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>
2023-07-26 17:20:13 +08:00
tongmu.wh
ba1a333ba6 add language recognition pipelines and models
新增语种识别pipeline和model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083
* add language recognition pipelines and models

* add a clustering method for speaker diarization

* define input and output type for language recognition
2023-07-25 21:07:56 +08:00
zeyinzi.jzyz
672c4899e9 add sd swift tuner
SD-Tuner base on Swift (LoRA/Adapter/Prompt)
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798

* sd swift tuner

* fix pre-checker
2023-07-25 19:00:49 +08:00
shuli.cly
526e1371f5 Merge the speaker-turn-detection codes, local test finished
# Speaker Diarization Speaker-Turn Detection CR

和Dialogue-Detection一样,本模型是Speaker Diarization(`audio/speaker diarization`,语音/说话人日志)任务下的一个子模块。

本次提交的是基于文本进行判断的模型,本地模型的初始模型基于huggingface训练的,此提交中复用了部分 `nlp/token-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦,在model、pipeline以及preprocessor中 **单独** 创建了相应模块并重新register。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13364720
* std first commit

* local test pass for speaker-turn-detection

* update speaker-turn-detection pipeline task outputs format; update pipeline outputs; update test scripts
2023-07-25 18:57:47 +08:00
hemu.zp
80f76ca475 Support stream output for transformers model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13271136
* support stream for transformers model

* set test_level >= 2

* support hf model and chatglm2

* remove streaming_output for chatglm2
2023-07-25 17:41:32 +08:00
wenmeng zhou
64203e89ee Compatibility for huggingface transformers (#391) 2023-07-24 20:53:27 +08:00
XDUWQ
8e00d85317 fix bugs 2023-07-24 19:46:22 +08:00
lylalala
f805d86aed llama2 support chat (#404)
* support chat

* update llama2 chat testcase

* add gen kwargs and devices

* update unittest and support max_length in multi-turn dialogue
2023-07-24 15:38:01 +08:00
tingwei.gtw
d16522723a [to #42322933] add files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13158565
* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* [to #42322933] add files

* update test data

* [to #42322933] add files

* Merge remote-tracking branch 'origin' into feature/sal_try_on

* [to #42322933] add files

* Merge remote-tracking branch 'origin' into feature/sal_try_on
2023-07-24 10:16:29 +08:00
mushenL
f77237b049 add llama2 pipeline (#399)
* Modify the parameter passing of the text_generation_pipeline class

* add llama2 pipeline

* add llama pipeline v1.1

* add llama pipeline v1.2

* add llama pipeline v1.3

* add llama pipeline v1.0.4
2023-07-22 21:53:04 +08:00
shuli.cly
13e345f6d9 add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection
# Speaker Diarization Dialogue Detection CR

本模型是Speaker Diarization(`audio/speaker diarization`,语音/说话人日志)任务下的一个子模块。

本次提交的是基于文本进行判断的模型,其IO和中间过程和 `nlp/text-classification` 很像,且本地模型的初始模型也是基于huggingface训练的,因此此提交中复用了部分 `nlp/text-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦,在model、pipeline以及preprocessor中 **单独** 创建了相应模块并重新register。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13269649
* start to add speaker_diarization_dialogue_detection files; Need to change constant and test

* add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection

* update test case

* add comments for speaker diarization dialogue detection pipelines

* add outputs type and inputs type for speaker_diarization_dialogue_detection
2023-07-20 19:29:59 +08:00
shenweichao.swc
05c65ba225 add s2net for panorama_depth_estimation
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13310819
* add s2net codes

* fix sphdecoder

* Merge branch 'master' into dev/cv_s2net_panorama_depth_estimation

* revise comments in the pipeline

* revise the code
2023-07-20 19:28:03 +08:00
xiangpeng.wxp
4085d821f3 [to #42322933] add polylm, a polyglot large language model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13339595
2023-07-20 18:21:07 +08:00
XDUWQ
66795aa3ff change tests level 2023-07-19 09:41:21 +08:00
Wang Qiang
0b85979f2e Update diffusers version to 0.18.0 (#377)
* update diffusers to 0.18.0

* fix bugs
2023-07-14 19:02:52 +08:00
baiguan.yt
ceac129c6b add parameters height and width for text-to-video
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13171907
2023-07-14 16:22:10 +08:00
XDUWQ
34ab717393 custom_diffusion 2023-07-12 19:47:32 +08:00
XDUWQ
1caa45422c custom diffusion 2023-07-11 20:46:32 +08:00
yeqinghao.yqh
41cbb8e393 mPLUG-Owl 生成长度Bug修复
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13209284
2023-07-10 18:56:12 +08:00
tongmu.wh
a7f7a67855 fix details of speaker models
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13203011
* fix details of speaker models
2023-07-10 18:54:26 +08:00
chenyafeng.cyf
543d03e32b 3dspeaker
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13199180
2023-07-07 15:00:35 +08:00
wenmeng.zwm
0271b9c256 Merge branch 'master-github' into merge_master_github_0628 2023-06-28 20:27:34 +08:00
Wang Qiang
a018cd6107 Dreambooth method for finetuning stable diffusions (#339)
* Copyright

* dreambooth

* dreambooth test trainer

* fix bugs

* pre-commit

---------

Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com>
2023-06-28 20:10:28 +08:00
mulin.lyh
1ea9b58447 fix torch2.0 compatible issue
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13086361

* fix face-aligment compatible

* fix torch2.0 compatible issue
2023-06-28 14:15:48 +08:00
Wang Qiang
6942144ad7 Stable Diffusion model checkpoint export to onnx. (#340)
* stable diffusion export onnx

* fix pre commit bugs

* fix bugs

* safety checker support

* test export stable diffusion
2023-06-28 13:26:19 +08:00
yuze.zyz
8f18274f75 Add teardown for tests
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12643554

* add teardown for tests

* add teardown for dialog_modeling_trainer,document_grounded_dialog_generate_trainer,document_grounded_dialog_rerank_trainer,document_grounded_dialog_retrieval_trainer,training_args,translation_evaluation_trainer,translation_trainer
2023-06-28 09:44:44 +08:00
mulin.lyh
eb0f0216c6 fix torch 2.x compatible issue
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13045011

* fix torch 2.x compatible issue

* fix torch 2.x compatible issue

* fix complex-valued input tensor matching the output from stft with return_complex=True.

* skip plugin test temporarily for modify torch version

* fix test_speech_signal_process.py compatible issue

* fix lint issue

* upgrade funasr to 0.6.5
2023-06-27 14:40:51 +08:00
yuze.zyz
a58be34384 Add Lora/Adapter/Prompt and support for chatglm6B and chatglm2-6B
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12770413

* add prompt and lora

* add adapter

* add prefix

* add tests

* adapter smoke test passed

* prompt test passed

* support model id in petl

* migrate chatglm6b

* add train script for chatglm6b

* move gen_kwargs to finetune.py

* add chatglm2

* add model definination
2023-06-27 14:38:18 +08:00
xingjun.wxj
1dbff6cb48 Support jsonl format in meta data
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13071970

* support jsonl in meta


* add UT and refine fetch_meta_files_from_url
2023-06-27 11:58:19 +08:00