modelscope

mirror of https://github.com/modelscope/modelscope.git synced 2026-07-10 04:22:33 +02:00

Author	SHA1	Message	Date
Wang Qiang	78f5e6a8bf	Switching model from huggingface to modelscope hub of efficient tuning (#479 )	2023-08-18 20:30:04 +08:00
Wang Qiang	f321804ab2	Merge pull request #472 from kangzhao2/baishao_test Add image2video	2023-08-18 20:29:36 +08:00
kangzhao2	2643d985dc	update test_image2video.py	2023-08-18 20:13:20 +08:00
kangzhao2	2605935797	fix pre-commit	2023-08-18 11:47:45 +08:00
kangzhao2	4ca76b0a85	fix comments again	2023-08-17 20:34:43 +08:00
kangzhao2	b8c76a426f	fix comments	2023-08-17 20:03:05 +08:00
Wang Qiang	4ed1111d70	Fix bugs of configs file path and duration (#476 ) * fix bugs of configs file path and duration * pre commit * delete configs * test videocomposer model version	2023-08-16 21:03:11 +08:00
kangzhao2	90f7a5c6c0	update files	2023-08-16 11:35:15 +08:00
kangzhao2	037e73fe6e	baishao	2023-08-15 21:32:30 +08:00
Wang Qiang	ee8afd2d62	VideoComposer: Compositional Video Synthesis with Motion Controllability (#431 ) * VideoComposer: Compositional Video Synthesis with Motion Controllability * videocomposer pipeline * pre commit * delete xformers	2023-08-15 12:01:03 +08:00
Jintao	18d33a4825	fix copytree python37 bug (#464 ) * fix copytree python37 bug * add copytree_py37 function	2023-08-14 11:45:33 +08:00
wenmeng zhou	74d8317bb0	fix pipeline check error (#455 ) * fix pipeline check error * update	2023-08-11 15:52:53 +08:00
Ran Zhou	026a9ef227	Add machine reading comprehension model, preprocessor and pipeline (#451 ) * Add machine reading comprehension model, preprocessor and pipeline * fix precommit errors * Optimize mrc preprocessor, add mrc input output definition, add mrc pipeline docstr --------- Co-authored-by: seadamo <ran.zhou@alibaba-inc.com>	2023-08-11 13:47:26 +08:00
liuyhwangyh	75a14a36ba	get github diff files (#446 ) * get github diff files * add github environment --------- Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>	2023-08-10 16:22:37 +08:00
wenmeng.zwm	725521a2af	skip test_text_to_360panorama_image test	2023-08-08 16:41:19 +08:00
lukeming.lkm	b3a61ef6f4	update LICENSE Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13516119	2023-08-03 11:07:34 +08:00
lukeming.lkm	bd2f70a6eb	add quantization in qwen pipelines and relevant unittests Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13499600 * add quant features * resolve import * resolve format * fix save vocab	2023-08-02 14:05:13 +08:00
lukeming.lkm	33bd74a7be	add qwen 7b base and chat 添加QWen 7b base模型和chat模型及相关pipelines Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13482235 * add qwen 7b base and chat * fix logger * update examples, lint test * add unittest for qwen base and chat * rename qwen to qwen-7b * resolve imports and add a registry to text-generation * reset load model from pretrained * fix precheck * skip qwen test case now * remove strange file	2023-08-02 09:25:21 +08:00
suluyana	b68b90ba15	skip plugin	2023-07-30 00:30:30 +08:00
suluyana	9ece90ee84	skip plugin test case	2023-07-29 21:35:21 +08:00
suluyan.sly	05e1357c32	Merge branch 'master-github' into master-merge-github-230728	2023-07-28 16:40:34 +08:00
wenmeng.zwm	3b485d5835	fix plugin python module missing files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749 * fix plugin python module missing files	2023-07-28 16:14:56 +08:00
frozoul	2566d028cd	cv/cv nerf 3d reconstruction 4k nerf damo (#389 ) * add 4k-nerf core files * update configure file * update dataloader and model path * update unittest * Delete test_4k.py * update unittest * update unittest * update pre-commit * update dataloader * update cuda code path * check with pre-commit --------- Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com> Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>	2023-07-28 10:37:13 +08:00
tongmu.wh	475924a421	correct language recognition taks name modelscope平台同学最终定下语种识别任务名为speech-language-recognition，对应进行代码中的相关改动 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517 * correct language recognition taks name	2023-07-27 21:03:04 +08:00
suluyan.sly	1c6f5fe775	Merge branch 'master-github' into master-merge-github-230727 Conflicts: examples/pytorch/baichuan/finetune_baichuan.py examples/pytorch/chatglm6b/finetune.py	2023-07-27 17:29:27 +08:00
mengyang.fmy	18f998a85c	add text-to-360pano-image pipeline, mod cv requirements 7月份计划上线的360全景图生成模型，自研模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary #### 依赖项说明 ##### 由于要使用xformers，torch版本最好使用1.13.1 ``` pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116 ``` ##### 对应的diffusers和xformers版本如下 ``` pip install -U diffusers==0.18.0 pip install xformers==0.0.16 pip install triton, accelerate, transformers ``` ##### ModelScope Library 需要使用cv ``` pip install modelscope pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html ``` ##### 此外，还需要安装第三方的一个库，Real-ESRGAN，安装方法如下 ``` # Install basicsr - https://github.com/xinntao/BasicSR # We use BasicSR for both training and inference pip install basicsr # facexlib and gfpgan are for face enhancement pip install facexlib pip install gfpgan pip install Pillow pip install tqdm pip install realesrgan==0.3.0 ``` Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430 * add text-to-360pano-image pipeline * add text-to-360pano-image pipeline, mod cv requirements * rm redundant files and cv requirements; add standard input and output definations * fix diffusers==0.18.0 and run test * fix diffusers==0.18.0 in multi-modal and run test again * add model_revision='v1.0.0' * fix yapf * add trycatch for enabling xformers * fix key error * add install xformers in test/setup * skip highres.fix in ci * feat: Fix conflict, auto commit by WebIDE	2023-07-27 11:33:39 +08:00
Wang Qiang	66cf72a75c	Merge pull request #376 from XDUWQ/custom_diffusion Custom method for finetuning stable diffusion	2023-07-27 10:41:38 +08:00
Zackary Shen	ba4db97507	upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407 ) * add vq_compression model * add vq_compression model * check pre-commit for lint test * fix by flake8 * update * update * update * the last update * the laast update * update test_level>=0 --------- Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>	2023-07-26 17:20:13 +08:00
tongmu.wh	ba1a333ba6	add language recognition pipelines and models 新增语种识别pipeline和model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083 * add language recognition pipelines and models * add a clustering method for speaker diarization * define input and output type for language recognition	2023-07-25 21:07:56 +08:00
zeyinzi.jzyz	672c4899e9	add sd swift tuner SD-Tuner base on Swift (LoRA/Adapter/Prompt) Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798 * sd swift tuner * fix pre-checker	2023-07-25 19:00:49 +08:00
shuli.cly	526e1371f5	Merge the speaker-turn-detection codes, local test finished # Speaker Diarization Speaker-Turn Detection CR 和Dialogue-Detection一样，本模型是Speaker Diarization（`audio/speaker diarization`，语音/说话人日志）任务下的一个子模块。本次提交的是基于文本进行判断的模型，本地模型的初始模型基于huggingface训练的，此提交中复用了部分 `nlp/token-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦，在model、pipeline以及preprocessor中单独创建了相应模块并重新register。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13364720 * std first commit * local test pass for speaker-turn-detection * update speaker-turn-detection pipeline task outputs format; update pipeline outputs; update test scripts	2023-07-25 18:57:47 +08:00
hemu.zp	80f76ca475	Support stream output for transformers model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13271136 * support stream for transformers model * set test_level >= 2 * support hf model and chatglm2 * remove streaming_output for chatglm2	2023-07-25 17:41:32 +08:00
wenmeng zhou	64203e89ee	Compatibility for huggingface transformers (#391 )	2023-07-24 20:53:27 +08:00
XDUWQ	8e00d85317	fix bugs	2023-07-24 19:46:22 +08:00
lylalala	f805d86aed	llama2 support chat (#404 ) * support chat * update llama2 chat testcase * add gen kwargs and devices * update unittest and support max_length in multi-turn dialogue	2023-07-24 15:38:01 +08:00
tingwei.gtw	d16522723a	[to #42322933 ] add files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13158565 * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * update test data * [to #42322933] add files * Merge remote-tracking branch 'origin' into feature/sal_try_on * [to #42322933] add files * Merge remote-tracking branch 'origin' into feature/sal_try_on	2023-07-24 10:16:29 +08:00
mushenL	f77237b049	add llama2 pipeline (#399 ) * Modify the parameter passing of the text_generation_pipeline class * add llama2 pipeline * add llama pipeline v1.1 * add llama pipeline v1.2 * add llama pipeline v1.3 * add llama pipeline v1.0.4	2023-07-22 21:53:04 +08:00
shuli.cly	13e345f6d9	add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection # Speaker Diarization Dialogue Detection CR 本模型是Speaker Diarization（`audio/speaker diarization`，语音/说话人日志）任务下的一个子模块。本次提交的是基于文本进行判断的模型，其IO和中间过程和 `nlp/text-classification` 很像，且本地模型的初始模型也是基于huggingface训练的，因此此提交中复用了部分 `nlp/text-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦，在model、pipeline以及preprocessor中单独创建了相应模块并重新register。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13269649 * start to add speaker_diarization_dialogue_detection files; Need to change constant and test * add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection * update test case * add comments for speaker diarization dialogue detection pipelines * add outputs type and inputs type for speaker_diarization_dialogue_detection	2023-07-20 19:29:59 +08:00
shenweichao.swc	05c65ba225	add s2net for panorama_depth_estimation Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13310819 * add s2net codes * fix sphdecoder * Merge branch 'master' into dev/cv_s2net_panorama_depth_estimation * revise comments in the pipeline * revise the code	2023-07-20 19:28:03 +08:00
xiangpeng.wxp	4085d821f3	[to #42322933 ] add polylm, a polyglot large language model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13339595	2023-07-20 18:21:07 +08:00
XDUWQ	66795aa3ff	change tests level	2023-07-19 09:41:21 +08:00
Wang Qiang	0b85979f2e	Update diffusers version to 0.18.0 (#377 ) * update diffusers to 0.18.0 * fix bugs	2023-07-14 19:02:52 +08:00
baiguan.yt	ceac129c6b	add parameters height and width for text-to-video Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13171907	2023-07-14 16:22:10 +08:00
XDUWQ	34ab717393	custom_diffusion	2023-07-12 19:47:32 +08:00
XDUWQ	1caa45422c	custom diffusion	2023-07-11 20:46:32 +08:00
yeqinghao.yqh	41cbb8e393	mPLUG-Owl 生成长度Bug修复 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13209284	2023-07-10 18:56:12 +08:00
tongmu.wh	a7f7a67855	fix details of speaker models Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13203011 * fix details of speaker models	2023-07-10 18:54:26 +08:00
chenyafeng.cyf	543d03e32b	3dspeaker Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13199180	2023-07-07 15:00:35 +08:00
wenmeng.zwm	0271b9c256	Merge branch 'master-github' into merge_master_github_0628	2023-06-28 20:27:34 +08:00
Wang Qiang	a018cd6107	Dreambooth method for finetuning stable diffusions (#339 ) * Copyright * dreambooth * dreambooth test trainer * fix bugs * pre-commit --------- Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com>	2023-06-28 20:10:28 +08:00

1 2 3 4 5 ...

1014 Commits