Wang Qiang
ee8afd2d62
VideoComposer: Compositional Video Synthesis with Motion Controllability ( #431 )
...
* VideoComposer: Compositional Video Synthesis with Motion Controllability
* videocomposer pipeline
* pre commit
* delete xformers
2023-08-15 12:01:03 +08:00
Jintao
18d33a4825
fix copytree python37 bug ( #464 )
...
* fix copytree python37 bug
* add copytree_py37 function
2023-08-14 11:45:33 +08:00
wenmeng zhou
74d8317bb0
fix pipeline check error ( #455 )
...
* fix pipeline check error
* update
2023-08-11 15:52:53 +08:00
Ran Zhou
026a9ef227
Add machine reading comprehension model, preprocessor and pipeline ( #451 )
...
* Add machine reading comprehension model, preprocessor and pipeline
* fix precommit errors
* Optimize mrc preprocessor, add mrc input output definition, add mrc pipeline docstr
---------
Co-authored-by: seadamo <ran.zhou@alibaba-inc.com >
2023-08-11 13:47:26 +08:00
chenyafeng.cyf
33605de759
eres2net_lre_v2
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13602081
* add eres2net_base_large_lre
* eres2net_language_identification
* eres2net_lre_v2
2023-08-10 17:44:43 +08:00
liuyhwangyh
75a14a36ba
get github diff files ( #446 )
...
* get github diff files
* add github environment
---------
Co-authored-by: mulin.lyh <mulin.lyh@taobao.com >
2023-08-10 16:22:37 +08:00
wenmeng.zwm
725521a2af
skip test_text_to_360panorama_image test
2023-08-08 16:41:19 +08:00
zsl01670416
b0699fd8e2
support llama2 inputs to device in function generate
...
fix error inputs and model were not on the same device. if they are not on the same device, inputs will be implemented function to model device.
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13546989
* support llama inputs to device in function generate
* modify test qwen text generation according to github code
2023-08-07 15:41:28 +08:00
lukeming.lkm
9e033104af
change use_fast_att and fix bfloat loading
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13522792
* check flash att is installed even if use_fast_att is set True and fix bfloat loading
* skip pipeline model placement for quantization
* update unittest for qwen
2023-08-03 15:59:51 +08:00
lukeming.lkm
b3a61ef6f4
update LICENSE
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13516119
2023-08-03 11:07:34 +08:00
lukeming.lkm
bd2f70a6eb
add quantization in qwen pipelines and relevant unittests
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13499600
* add quant features
* resolve import
* resolve format
* fix save vocab
2023-08-02 14:05:13 +08:00
lukeming.lkm
33bd74a7be
add qwen 7b base and chat
...
添加QWen 7b base模型和chat模型及相关pipelines
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13482235
* add qwen 7b base and chat
* fix logger
* update examples, lint test
* add unittest for qwen base and chat
* rename qwen to qwen-7b
* resolve imports and add a registry to text-generation
* reset load model from pretrained
* fix precheck
* skip qwen test case now
* remove strange file
2023-08-02 09:25:21 +08:00
suluyana
b68b90ba15
skip plugin
2023-07-30 00:30:30 +08:00
suluyana
9ece90ee84
skip plugin test case
2023-07-29 21:35:21 +08:00
suluyan.sly
05e1357c32
Merge branch 'master-github' into master-merge-github-230728
2023-07-28 16:40:34 +08:00
wenmeng.zwm
3b485d5835
fix plugin python module missing files
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749
* fix plugin python module missing files
2023-07-28 16:14:56 +08:00
frozoul
2566d028cd
cv/cv nerf 3d reconstruction 4k nerf damo ( #389 )
...
* add 4k-nerf core files
* update configure file
* update dataloader and model path
* update unittest
* Delete test_4k.py
* update unittest
* update unittest
* update pre-commit
* update dataloader
* update cuda code path
* check with pre-commit
---------
Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com >
Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com >
2023-07-28 10:37:13 +08:00
tongmu.wh
475924a421
correct language recognition taks name
...
modelscope平台同学最终定下语种识别任务名为speech-language-recognition,对应进行代码中的相关改动
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517
* correct language recognition taks name
2023-07-27 21:03:04 +08:00
suluyan.sly
1c6f5fe775
Merge branch 'master-github' into master-merge-github-230727
...
Conflicts:
examples/pytorch/baichuan/finetune_baichuan.py
examples/pytorch/chatglm6b/finetune.py
2023-07-27 17:29:27 +08:00
mengyang.fmy
18f998a85c
add text-to-360pano-image pipeline, mod cv requirements
...
7月份计划上线的360全景图生成模型,自研
模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary
#### 依赖项说明
##### 由于要使用xformers,torch版本最好使用1.13.1
```
pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116
```
##### 对应的diffusers和xformers版本如下
```
pip install -U diffusers==0.18.0
pip install xformers==0.0.16
pip install triton, accelerate, transformers
```
##### ModelScope Library 需要使用cv
```
pip install modelscope
pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
```
##### 此外,还需要安装第三方的一个库,Real-ESRGAN, 安装方法如下
```
# Install basicsr - https://github.com/xinntao/BasicSR
# We use BasicSR for both training and inference
pip install basicsr
# facexlib and gfpgan are for face enhancement
pip install facexlib
pip install gfpgan
pip install Pillow
pip install tqdm
pip install realesrgan==0.3.0
```
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430
* add text-to-360pano-image pipeline
* add text-to-360pano-image pipeline, mod cv requirements
* rm redundant files and cv requirements; add standard input and output definations
* fix diffusers==0.18.0 and run test
* fix diffusers==0.18.0 in multi-modal and run test again
* add model_revision='v1.0.0'
* fix yapf
* add trycatch for enabling xformers
* fix key error
* add install xformers in test/setup
* skip highres.fix in ci
* feat: Fix conflict, auto commit by WebIDE
2023-07-27 11:33:39 +08:00
Wang Qiang
66cf72a75c
Merge pull request #376 from XDUWQ/custom_diffusion
...
Custom method for finetuning stable diffusion
2023-07-27 10:41:38 +08:00
Zackary Shen
ba4db97507
upload cv_nerf_3d-reconstruction_vector-quantize-compression ( #407 )
...
* add vq_compression model
* add vq_compression model
* check pre-commit for lint test
* fix by flake8
* update
* update
* update
* the last update
* the laast update
* update test_level>=0
---------
Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com >
2023-07-26 17:20:13 +08:00
tongmu.wh
ba1a333ba6
add language recognition pipelines and models
...
新增语种识别pipeline和model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083
* add language recognition pipelines and models
* add a clustering method for speaker diarization
* define input and output type for language recognition
2023-07-25 21:07:56 +08:00
zeyinzi.jzyz
672c4899e9
add sd swift tuner
...
SD-Tuner base on Swift (LoRA/Adapter/Prompt)
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798
* sd swift tuner
* fix pre-checker
2023-07-25 19:00:49 +08:00
shuli.cly
526e1371f5
Merge the speaker-turn-detection codes, local test finished
...
# Speaker Diarization Speaker-Turn Detection CR
和Dialogue-Detection一样,本模型是Speaker Diarization(`audio/speaker diarization`,语音/说话人日志)任务下的一个子模块。
本次提交的是基于文本进行判断的模型,本地模型的初始模型基于huggingface训练的,此提交中复用了部分 `nlp/token-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦,在model、pipeline以及preprocessor中 **单独** 创建了相应模块并重新register。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13364720
* std first commit
* local test pass for speaker-turn-detection
* update speaker-turn-detection pipeline task outputs format; update pipeline outputs; update test scripts
2023-07-25 18:57:47 +08:00
hemu.zp
80f76ca475
Support stream output for transformers model
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13271136
* support stream for transformers model
* set test_level >= 2
* support hf model and chatglm2
* remove streaming_output for chatglm2
2023-07-25 17:41:32 +08:00
wenmeng zhou
64203e89ee
Compatibility for huggingface transformers ( #391 )
2023-07-24 20:53:27 +08:00
XDUWQ
8e00d85317
fix bugs
2023-07-24 19:46:22 +08:00
lylalala
f805d86aed
llama2 support chat ( #404 )
...
* support chat
* update llama2 chat testcase
* add gen kwargs and devices
* update unittest and support max_length in multi-turn dialogue
2023-07-24 15:38:01 +08:00
tingwei.gtw
d16522723a
[to #42322933 ] add files
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13158565
* [to #42322933 ] add files
* [to #42322933 ] add files
* [to #42322933 ] add files
* [to #42322933 ] add files
* [to #42322933 ] add files
* update test data
* [to #42322933 ] add files
* Merge remote-tracking branch 'origin' into feature/sal_try_on
* [to #42322933 ] add files
* Merge remote-tracking branch 'origin' into feature/sal_try_on
2023-07-24 10:16:29 +08:00
mushenL
f77237b049
add llama2 pipeline ( #399 )
...
* Modify the parameter passing of the text_generation_pipeline class
* add llama2 pipeline
* add llama pipeline v1.1
* add llama pipeline v1.2
* add llama pipeline v1.3
* add llama pipeline v1.0.4
2023-07-22 21:53:04 +08:00
shuli.cly
13e345f6d9
add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection
...
# Speaker Diarization Dialogue Detection CR
本模型是Speaker Diarization(`audio/speaker diarization`,语音/说话人日志)任务下的一个子模块。
本次提交的是基于文本进行判断的模型,其IO和中间过程和 `nlp/text-classification` 很像,且本地模型的初始模型也是基于huggingface训练的,因此此提交中复用了部分 `nlp/text-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦,在model、pipeline以及preprocessor中 **单独** 创建了相应模块并重新register。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13269649
* start to add speaker_diarization_dialogue_detection files; Need to change constant and test
* add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection
* update test case
* add comments for speaker diarization dialogue detection pipelines
* add outputs type and inputs type for speaker_diarization_dialogue_detection
2023-07-20 19:29:59 +08:00
shenweichao.swc
05c65ba225
add s2net for panorama_depth_estimation
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13310819
* add s2net codes
* fix sphdecoder
* Merge branch 'master' into dev/cv_s2net_panorama_depth_estimation
* revise comments in the pipeline
* revise the code
2023-07-20 19:28:03 +08:00
xiangpeng.wxp
4085d821f3
[to #42322933 ] add polylm, a polyglot large language model
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13339595
2023-07-20 18:21:07 +08:00
XDUWQ
66795aa3ff
change tests level
2023-07-19 09:41:21 +08:00
Wang Qiang
0b85979f2e
Update diffusers version to 0.18.0 ( #377 )
...
* update diffusers to 0.18.0
* fix bugs
2023-07-14 19:02:52 +08:00
baiguan.yt
ceac129c6b
add parameters height and width for text-to-video
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13171907
2023-07-14 16:22:10 +08:00
XDUWQ
34ab717393
custom_diffusion
2023-07-12 19:47:32 +08:00
XDUWQ
1caa45422c
custom diffusion
2023-07-11 20:46:32 +08:00
yeqinghao.yqh
41cbb8e393
mPLUG-Owl 生成长度Bug修复
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13209284
2023-07-10 18:56:12 +08:00
tongmu.wh
a7f7a67855
fix details of speaker models
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13203011
* fix details of speaker models
2023-07-10 18:54:26 +08:00
chenyafeng.cyf
543d03e32b
3dspeaker
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13199180
2023-07-07 15:00:35 +08:00
wenmeng.zwm
0271b9c256
Merge branch 'master-github' into merge_master_github_0628
2023-06-28 20:27:34 +08:00
Wang Qiang
a018cd6107
Dreambooth method for finetuning stable diffusions ( #339 )
...
* Copyright
* dreambooth
* dreambooth test trainer
* fix bugs
* pre-commit
---------
Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com >
2023-06-28 20:10:28 +08:00
mulin.lyh
1ea9b58447
fix torch2.0 compatible issue
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13086361
* fix face-aligment compatible
* fix torch2.0 compatible issue
2023-06-28 14:15:48 +08:00
Wang Qiang
6942144ad7
Stable Diffusion model checkpoint export to onnx. ( #340 )
...
* stable diffusion export onnx
* fix pre commit bugs
* fix bugs
* safety checker support
* test export stable diffusion
2023-06-28 13:26:19 +08:00
yuze.zyz
8f18274f75
Add teardown for tests
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12643554
* add teardown for tests
* add teardown for dialog_modeling_trainer,document_grounded_dialog_generate_trainer,document_grounded_dialog_rerank_trainer,document_grounded_dialog_retrieval_trainer,training_args,translation_evaluation_trainer,translation_trainer
2023-06-28 09:44:44 +08:00
mulin.lyh
eb0f0216c6
fix torch 2.x compatible issue
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13045011
* fix torch 2.x compatible issue
* fix torch 2.x compatible issue
* fix complex-valued input tensor matching the output from stft with return_complex=True.
* skip plugin test temporarily for modify torch version
* fix test_speech_signal_process.py compatible issue
* fix lint issue
* upgrade funasr to 0.6.5
2023-06-27 14:40:51 +08:00
yuze.zyz
a58be34384
Add Lora/Adapter/Prompt and support for chatglm6B and chatglm2-6B
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12770413
* add prompt and lora
* add adapter
* add prefix
* add tests
* adapter smoke test passed
* prompt test passed
* support model id in petl
* migrate chatglm6b
* add train script for chatglm6b
* move gen_kwargs to finetune.py
* add chatglm2
* add model definination
2023-06-27 14:38:18 +08:00
xingjun.wxj
1dbff6cb48
Support jsonl format in meta data
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13071970
* support jsonl in meta
* add UT and refine fetch_meta_files_from_url
2023-06-27 11:58:19 +08:00