Commit Graph

1135 Commits

Author SHA1 Message Date
mulin.lyh
28405071b8 Merge branch 'master-github' of https://github.com/modelscope/modelscope into master-github 2023-08-30 15:21:54 +08:00
mulin.lyh
7c0f6aa5c5 Merge branch 'master-github' into master-merge-github0829 2023-08-29 21:20:54 +08:00
pingzhi.lzh
db27d5218f cones2 version 1.0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13778227

* cones2 version 1.0
2023-08-29 21:05:23 +08:00
tastelikefeet
39922160ae fix pre commit (#510) 2023-08-29 17:55:01 +08:00
Xiaobin
8ce3b4de72 Feature/seq gpt (#507) 2023-08-29 17:47:59 +08:00
tastelikefeet
f2e9ea4199 Replace code with swift wheel (#467) 2023-08-29 17:27:18 +08:00
Jintao
2ee5ebaf35 new Feat/0817 (#504) 2023-08-29 16:43:36 +08:00
myf272609
0a6658c375 [to #42322933] add 3D surface reconstruction model
新增3D表面重建模型推理代码
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13807071
* test

* update

* update

* fix issues

* fix mesh export
2023-08-29 15:41:59 +08:00
huizheng.hz
7aef73a761 [to #42322933] add image editing model masactrl
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13671142
* add image editing model MasaCtrl

* add image editing model MasaCtrl

* Merge remote-tracking branch 'origin/master' into cv/image-editing-masactrl
2023-08-24 21:20:47 +08:00
lipandeng.lpd
7db8248dfb add prost text video retrival model
https://www.modelscope.cn/models/damo/multi_modal_clip_vtretrieval_prost/summary

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13487540
2023-08-24 21:06:34 +08:00
bin.xue
6682fb6052 增加语音分离模型MossFormer2 2023-08-24 16:49:22 +08:00
Wang Qiang
de33f4dc87 Support sdxl finetune by lora method (#468)
* support sdxl finetune by lora

* remove useless imports

* support sdxl finetune

* upgrade diffusers to 0.19.0

* sdxl finetune

* fix bugs

* pre commit

* diffusers>=0.19.0
2023-08-23 11:23:34 +08:00
mulin.lyh
cba4e40bc1 fix numpy pandas compatible issue
明确受影响的模型(damo):  
ONE-PEACE-4B	ModuleNotFoundError: MyCustomPipeline: MyCustomModel: No module named 'one_peace',缺少依赖。
cv_resnet50_face-reconstruction	 不兼容tf2  
nlp_automatic_post_editing_for_translation_en2de	tf2.0兼容性问题,tf1.x需要  
cv_resnet18_ocr-detection-word-level_damo	tf2.x兼容性问题  
cv_resnet18_ocr-detection-line-level_damo	tf兼容性问题  
cv_resnet101_detection_fewshot-defrcn	模型限制必须detection0.3+torch1.11.0"  
speech_dfsmn_ans_psm_48k_causal	"librosa, numpy兼容性问题  
cv_mdm_motion-generation	"依赖numpy版本兼容性问题:   File ""/opt/conda/lib/python3.8/site-packages/smplx/body_models.py"",  
cv_resnet50_ocr-detection-vlpt	numpy兼容性问题  
cv_clip-it_video-summarization_language-guided_en	tf兼容性问题

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13744636
* numpy and pandas no version

* modify compatible issue

* fix numpy compatible issue

* modify ci

* fix lint issue

* replace Image.ANTIALIAS to Image.Resampling.LANCZOS pillow compatible

* skip uncompatible cases

* fix numpy compatible issue, skip cases that can not compatbile numpy or tensorflow2.x

* skip compatible cases

* fix clip model issue

* fix body 3d keypoints compatible issue
2023-08-22 23:04:31 +08:00
Kang
cccd502fa2 fix video output of image2video (#488)
* fix video output

* fix logger.error

* fix log error
2023-08-22 10:57:23 +08:00
Kang
040698e201 add video2video (#486)
* add video2video

* fix bugs of pre-commit

* update some files

* fix video write module

* fix max_frames
2023-08-21 18:44:14 +08:00
yanyi.ys
be26a62c48 人像生成代码评审
该CR提交了关于人像生成的代码,能够通过给定人像图片以及相应的target姿势数据生成相应姿势的图片。

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13715612
2023-08-21 18:42:24 +08:00
wenmeng.zwm
7f7573f15d fix ut error
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13716099
* fix ut error
2023-08-19 21:16:07 +08:00
Wang Qiang
78f5e6a8bf Switching model from huggingface to modelscope hub of efficient tuning (#479) 2023-08-18 20:30:04 +08:00
Wang Qiang
f321804ab2 Merge pull request #472 from kangzhao2/baishao_test
Add image2video
2023-08-18 20:29:36 +08:00
kangzhao2
2643d985dc update test_image2video.py 2023-08-18 20:13:20 +08:00
kangzhao2
2605935797 fix pre-commit 2023-08-18 11:47:45 +08:00
kangzhao2
4ca76b0a85 fix comments again 2023-08-17 20:34:43 +08:00
kangzhao2
b8c76a426f fix comments 2023-08-17 20:03:05 +08:00
Wang Qiang
4ed1111d70 Fix bugs of configs file path and duration (#476)
* fix bugs of configs file path and duration

* pre commit

* delete configs

* test videocomposer model version
2023-08-16 21:03:11 +08:00
kangzhao2
90f7a5c6c0 update files 2023-08-16 11:35:15 +08:00
kangzhao2
037e73fe6e baishao 2023-08-15 21:32:30 +08:00
wenmeng.zwm
8db088d2cb fix virtural tryon ut error 2023-08-15 16:49:15 +08:00
Wang Qiang
ee8afd2d62 VideoComposer: Compositional Video Synthesis with Motion Controllability (#431)
* VideoComposer: Compositional Video Synthesis with Motion Controllability

* videocomposer pipeline

* pre commit

* delete xformers
2023-08-15 12:01:03 +08:00
Jintao
18d33a4825 fix copytree python37 bug (#464)
* fix copytree python37 bug

* add copytree_py37 function
2023-08-14 11:45:33 +08:00
wenmeng zhou
74d8317bb0 fix pipeline check error (#455)
* fix pipeline check error

* update
2023-08-11 15:52:53 +08:00
Ran Zhou
026a9ef227 Add machine reading comprehension model, preprocessor and pipeline (#451)
* Add machine reading comprehension model, preprocessor and pipeline

* fix precommit errors

* Optimize mrc preprocessor, add mrc input output definition, add mrc pipeline docstr

---------

Co-authored-by: seadamo <ran.zhou@alibaba-inc.com>
2023-08-11 13:47:26 +08:00
chenyafeng.cyf
33605de759 eres2net_lre_v2
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13602081

* add eres2net_base_large_lre

* eres2net_language_identification

* eres2net_lre_v2
2023-08-10 17:44:43 +08:00
liuyhwangyh
75a14a36ba get github diff files (#446)
* get github diff files

* add github environment

---------

Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>
2023-08-10 16:22:37 +08:00
wenmeng.zwm
725521a2af skip test_text_to_360panorama_image test 2023-08-08 16:41:19 +08:00
zsl01670416
b0699fd8e2 support llama2 inputs to device in function generate
fix error inputs and model were not on the same device. if they are not on the same device, inputs will be implemented function to model device.
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13546989
* support llama inputs to device in function generate

* modify test qwen text generation according to github code
2023-08-07 15:41:28 +08:00
lukeming.lkm
9e033104af change use_fast_att and fix bfloat loading
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13522792

* check flash att is installed even if  use_fast_att is set True and fix bfloat loading

* skip pipeline model placement for quantization

* update unittest for qwen
2023-08-03 15:59:51 +08:00
lukeming.lkm
b3a61ef6f4 update LICENSE
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13516119
2023-08-03 11:07:34 +08:00
lukeming.lkm
bd2f70a6eb add quantization in qwen pipelines and relevant unittests
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13499600

* add quant features

* resolve import

* resolve format

* fix save vocab
2023-08-02 14:05:13 +08:00
lukeming.lkm
33bd74a7be add qwen 7b base and chat
添加QWen 7b base模型和chat模型及相关pipelines
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13482235

* add qwen 7b base and chat

* fix logger

* update examples, lint test

* add unittest for qwen base and chat

* rename qwen to qwen-7b

* resolve imports and add a registry to text-generation

* reset load model from pretrained

* fix precheck

* skip qwen test case now

* remove strange file
2023-08-02 09:25:21 +08:00
suluyana
b68b90ba15 skip plugin 2023-07-30 00:30:30 +08:00
suluyana
9ece90ee84 skip plugin test case 2023-07-29 21:35:21 +08:00
suluyan.sly
05e1357c32 Merge branch 'master-github' into master-merge-github-230728 2023-07-28 16:40:34 +08:00
wenmeng.zwm
3b485d5835 fix plugin python module missing files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749
* fix plugin python module missing files
2023-07-28 16:14:56 +08:00
frozoul
2566d028cd cv/cv nerf 3d reconstruction 4k nerf damo (#389)
* add 4k-nerf core files

* update configure file

* update dataloader and model path

* update unittest

* Delete test_4k.py

* update unittest

* update unittest

* update pre-commit

* update dataloader

* update cuda code path

* check with pre-commit

---------

Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com>
Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>
2023-07-28 10:37:13 +08:00
tongmu.wh
475924a421 correct language recognition taks name
modelscope平台同学最终定下语种识别任务名为speech-language-recognition,对应进行代码中的相关改动
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517
* correct language recognition taks name
2023-07-27 21:03:04 +08:00
suluyan.sly
1c6f5fe775 Merge branch 'master-github' into master-merge-github-230727
Conflicts:
       examples/pytorch/baichuan/finetune_baichuan.py
       examples/pytorch/chatglm6b/finetune.py
2023-07-27 17:29:27 +08:00
mengyang.fmy
18f998a85c add text-to-360pano-image pipeline, mod cv requirements
7月份计划上线的360全景图生成模型,自研

模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary


#### 依赖项说明

##### 由于要使用xformers,torch版本最好使用1.13.1
```
pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116
```
##### 对应的diffusers和xformers版本如下
```
pip install -U diffusers==0.18.0
pip install xformers==0.0.16
pip install triton, accelerate, transformers
```

##### ModelScope Library 需要使用cv
```
pip install modelscope
pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
```

##### 此外,还需要安装第三方的一个库,Real-ESRGAN, 安装方法如下
```
# Install basicsr - https://github.com/xinntao/BasicSR
# We use BasicSR for both training and inference
pip install basicsr
# facexlib and gfpgan are for face enhancement
pip install facexlib
pip install gfpgan
pip install Pillow
pip install tqdm
pip install realesrgan==0.3.0
```
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430
* add text-to-360pano-image pipeline

* add text-to-360pano-image pipeline, mod cv requirements

* rm redundant files and cv requirements; add standard input and output definations

* fix diffusers==0.18.0 and run test

* fix diffusers==0.18.0 in multi-modal and run test again

* add model_revision='v1.0.0'

* fix yapf

* add trycatch for enabling xformers

* fix key error

* add install xformers in test/setup

* skip highres.fix in ci

* feat: Fix conflict, auto commit by WebIDE
2023-07-27 11:33:39 +08:00
Wang Qiang
66cf72a75c Merge pull request #376 from XDUWQ/custom_diffusion
Custom method for finetuning stable diffusion
2023-07-27 10:41:38 +08:00
Zackary Shen
ba4db97507 upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407)
* add vq_compression model

* add vq_compression model

* check pre-commit for lint test

* fix by flake8

* update

* update

* update

* the last update

* the laast update

* update test_level>=0

---------

Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>
2023-07-26 17:20:13 +08:00
tongmu.wh
ba1a333ba6 add language recognition pipelines and models
新增语种识别pipeline和model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083
* add language recognition pipelines and models

* add a clustering method for speaker diarization

* define input and output type for language recognition
2023-07-25 21:07:56 +08:00