modelscope

mirror of https://github.com/modelscope/modelscope.git synced 2026-02-24 20:19:51 +01:00

Author	SHA1	Message	Date
mulin.lyh	28405071b8	Merge branch 'master-github' of https://github.com/modelscope/modelscope into master-github	2023-08-30 15:21:54 +08:00
mulin.lyh	7c0f6aa5c5	Merge branch 'master-github' into master-merge-github0829	2023-08-29 21:20:54 +08:00
pingzhi.lzh	db27d5218f	cones2 version 1.0 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13778227 * cones2 version 1.0	2023-08-29 21:05:23 +08:00
tastelikefeet	39922160ae	fix pre commit (#510 )	2023-08-29 17:55:01 +08:00
Xiaobin	8ce3b4de72	Feature/seq gpt (#507 )	2023-08-29 17:47:59 +08:00
tastelikefeet	f2e9ea4199	Replace code with swift wheel (#467 )	2023-08-29 17:27:18 +08:00
Jintao	2ee5ebaf35	new Feat/0817 (#504 )	2023-08-29 16:43:36 +08:00
myf272609	0a6658c375	[to #42322933 ] add 3D surface reconstruction model 新增3D表面重建模型推理代码 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13807071 * test * update * update * fix issues * fix mesh export	2023-08-29 15:41:59 +08:00
huizheng.hz	7aef73a761	[to #42322933 ] add image editing model masactrl Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13671142 * add image editing model MasaCtrl * add image editing model MasaCtrl * Merge remote-tracking branch 'origin/master' into cv/image-editing-masactrl	2023-08-24 21:20:47 +08:00
lipandeng.lpd	7db8248dfb	add prost text video retrival model https://www.modelscope.cn/models/damo/multi_modal_clip_vtretrieval_prost/summary Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13487540	2023-08-24 21:06:34 +08:00
bin.xue	6682fb6052	增加语音分离模型MossFormer2	2023-08-24 16:49:22 +08:00
Wang Qiang	de33f4dc87	Support sdxl finetune by lora method (#468 ) * support sdxl finetune by lora * remove useless imports * support sdxl finetune * upgrade diffusers to 0.19.0 * sdxl finetune * fix bugs * pre commit * diffusers>=0.19.0	2023-08-23 11:23:34 +08:00
mulin.lyh	cba4e40bc1	fix numpy pandas compatible issue 明确受影响的模型(damo)： ONE-PEACE-4B ModuleNotFoundError: MyCustomPipeline: MyCustomModel: No module named 'one_peace'，缺少依赖。 cv_resnet50_face-reconstruction 不兼容tf2 nlp_automatic_post_editing_for_translation_en2de tf2.0兼容性问题，tf1.x需要 cv_resnet18_ocr-detection-word-level_damo tf2.x兼容性问题 cv_resnet18_ocr-detection-line-level_damo tf兼容性问题 cv_resnet101_detection_fewshot-defrcn 模型限制必须detection0.3+torch1.11.0" speech_dfsmn_ans_psm_48k_causal "librosa， numpy兼容性问题 cv_mdm_motion-generation "依赖numpy版本兼容性问题： File ""/opt/conda/lib/python3.8/site-packages/smplx/body_models.py"", cv_resnet50_ocr-detection-vlpt numpy兼容性问题 cv_clip-it_video-summarization_language-guided_en tf兼容性问题 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13744636 * numpy and pandas no version * modify compatible issue * fix numpy compatible issue * modify ci * fix lint issue * replace Image.ANTIALIAS to Image.Resampling.LANCZOS pillow compatible * skip uncompatible cases * fix numpy compatible issue, skip cases that can not compatbile numpy or tensorflow2.x * skip compatible cases * fix clip model issue * fix body 3d keypoints compatible issue	2023-08-22 23:04:31 +08:00
Kang	cccd502fa2	fix video output of image2video (#488 ) * fix video output * fix logger.error * fix log error	2023-08-22 10:57:23 +08:00
Kang	040698e201	add video2video (#486 ) * add video2video * fix bugs of pre-commit * update some files * fix video write module * fix max_frames	2023-08-21 18:44:14 +08:00
yanyi.ys	be26a62c48	人像生成代码评审该CR提交了关于人像生成的代码，能够通过给定人像图片以及相应的target姿势数据生成相应姿势的图片。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13715612	2023-08-21 18:42:24 +08:00
wenmeng.zwm	7f7573f15d	fix ut error Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13716099 * fix ut error	2023-08-19 21:16:07 +08:00
Wang Qiang	78f5e6a8bf	Switching model from huggingface to modelscope hub of efficient tuning (#479 )	2023-08-18 20:30:04 +08:00
Wang Qiang	f321804ab2	Merge pull request #472 from kangzhao2/baishao_test Add image2video	2023-08-18 20:29:36 +08:00
kangzhao2	2643d985dc	update test_image2video.py	2023-08-18 20:13:20 +08:00
kangzhao2	2605935797	fix pre-commit	2023-08-18 11:47:45 +08:00
kangzhao2	4ca76b0a85	fix comments again	2023-08-17 20:34:43 +08:00
kangzhao2	b8c76a426f	fix comments	2023-08-17 20:03:05 +08:00
Wang Qiang	4ed1111d70	Fix bugs of configs file path and duration (#476 ) * fix bugs of configs file path and duration * pre commit * delete configs * test videocomposer model version	2023-08-16 21:03:11 +08:00
kangzhao2	90f7a5c6c0	update files	2023-08-16 11:35:15 +08:00
kangzhao2	037e73fe6e	baishao	2023-08-15 21:32:30 +08:00
wenmeng.zwm	8db088d2cb	fix virtural tryon ut error	2023-08-15 16:49:15 +08:00
Wang Qiang	ee8afd2d62	VideoComposer: Compositional Video Synthesis with Motion Controllability (#431 ) * VideoComposer: Compositional Video Synthesis with Motion Controllability * videocomposer pipeline * pre commit * delete xformers	2023-08-15 12:01:03 +08:00
Jintao	18d33a4825	fix copytree python37 bug (#464 ) * fix copytree python37 bug * add copytree_py37 function	2023-08-14 11:45:33 +08:00
wenmeng zhou	74d8317bb0	fix pipeline check error (#455 ) * fix pipeline check error * update	2023-08-11 15:52:53 +08:00
Ran Zhou	026a9ef227	Add machine reading comprehension model, preprocessor and pipeline (#451 ) * Add machine reading comprehension model, preprocessor and pipeline * fix precommit errors * Optimize mrc preprocessor, add mrc input output definition, add mrc pipeline docstr --------- Co-authored-by: seadamo <ran.zhou@alibaba-inc.com>	2023-08-11 13:47:26 +08:00
chenyafeng.cyf	33605de759	eres2net_lre_v2 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13602081 * add eres2net_base_large_lre * eres2net_language_identification * eres2net_lre_v2	2023-08-10 17:44:43 +08:00
liuyhwangyh	75a14a36ba	get github diff files (#446 ) * get github diff files * add github environment --------- Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>	2023-08-10 16:22:37 +08:00
wenmeng.zwm	725521a2af	skip test_text_to_360panorama_image test	2023-08-08 16:41:19 +08:00
zsl01670416	b0699fd8e2	support llama2 inputs to device in function generate fix error inputs and model were not on the same device. if they are not on the same device, inputs will be implemented function to model device. Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13546989 * support llama inputs to device in function generate * modify test qwen text generation according to github code	2023-08-07 15:41:28 +08:00
lukeming.lkm	9e033104af	change use_fast_att and fix bfloat loading Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13522792 * check flash att is installed even if use_fast_att is set True and fix bfloat loading * skip pipeline model placement for quantization * update unittest for qwen	2023-08-03 15:59:51 +08:00
lukeming.lkm	b3a61ef6f4	update LICENSE Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13516119	2023-08-03 11:07:34 +08:00
lukeming.lkm	bd2f70a6eb	add quantization in qwen pipelines and relevant unittests Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13499600 * add quant features * resolve import * resolve format * fix save vocab	2023-08-02 14:05:13 +08:00
lukeming.lkm	33bd74a7be	add qwen 7b base and chat 添加QWen 7b base模型和chat模型及相关pipelines Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13482235 * add qwen 7b base and chat * fix logger * update examples, lint test * add unittest for qwen base and chat * rename qwen to qwen-7b * resolve imports and add a registry to text-generation * reset load model from pretrained * fix precheck * skip qwen test case now * remove strange file	2023-08-02 09:25:21 +08:00
suluyana	b68b90ba15	skip plugin	2023-07-30 00:30:30 +08:00
suluyana	9ece90ee84	skip plugin test case	2023-07-29 21:35:21 +08:00
suluyan.sly	05e1357c32	Merge branch 'master-github' into master-merge-github-230728	2023-07-28 16:40:34 +08:00
wenmeng.zwm	3b485d5835	fix plugin python module missing files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749 * fix plugin python module missing files	2023-07-28 16:14:56 +08:00
frozoul	2566d028cd	cv/cv nerf 3d reconstruction 4k nerf damo (#389 ) * add 4k-nerf core files * update configure file * update dataloader and model path * update unittest * Delete test_4k.py * update unittest * update unittest * update pre-commit * update dataloader * update cuda code path * check with pre-commit --------- Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com> Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>	2023-07-28 10:37:13 +08:00
tongmu.wh	475924a421	correct language recognition taks name modelscope平台同学最终定下语种识别任务名为speech-language-recognition，对应进行代码中的相关改动 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517 * correct language recognition taks name	2023-07-27 21:03:04 +08:00
suluyan.sly	1c6f5fe775	Merge branch 'master-github' into master-merge-github-230727 Conflicts: examples/pytorch/baichuan/finetune_baichuan.py examples/pytorch/chatglm6b/finetune.py	2023-07-27 17:29:27 +08:00
mengyang.fmy	18f998a85c	add text-to-360pano-image pipeline, mod cv requirements 7月份计划上线的360全景图生成模型，自研模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary #### 依赖项说明 ##### 由于要使用xformers，torch版本最好使用1.13.1 ``` pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116 ``` ##### 对应的diffusers和xformers版本如下 ``` pip install -U diffusers==0.18.0 pip install xformers==0.0.16 pip install triton, accelerate, transformers ``` ##### ModelScope Library 需要使用cv ``` pip install modelscope pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html ``` ##### 此外，还需要安装第三方的一个库，Real-ESRGAN，安装方法如下 ``` # Install basicsr - https://github.com/xinntao/BasicSR # We use BasicSR for both training and inference pip install basicsr # facexlib and gfpgan are for face enhancement pip install facexlib pip install gfpgan pip install Pillow pip install tqdm pip install realesrgan==0.3.0 ``` Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430 * add text-to-360pano-image pipeline * add text-to-360pano-image pipeline, mod cv requirements * rm redundant files and cv requirements; add standard input and output definations * fix diffusers==0.18.0 and run test * fix diffusers==0.18.0 in multi-modal and run test again * add model_revision='v1.0.0' * fix yapf * add trycatch for enabling xformers * fix key error * add install xformers in test/setup * skip highres.fix in ci * feat: Fix conflict, auto commit by WebIDE	2023-07-27 11:33:39 +08:00
Wang Qiang	66cf72a75c	Merge pull request #376 from XDUWQ/custom_diffusion Custom method for finetuning stable diffusion	2023-07-27 10:41:38 +08:00
Zackary Shen	ba4db97507	upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407 ) * add vq_compression model * add vq_compression model * check pre-commit for lint test * fix by flake8 * update * update * update * the last update * the laast update * update test_level>=0 --------- Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>	2023-07-26 17:20:13 +08:00
tongmu.wh	ba1a333ba6	add language recognition pipelines and models 新增语种识别pipeline和model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083 * add language recognition pipelines and models * add a clustering method for speaker diarization * define input and output type for language recognition	2023-07-25 21:07:56 +08:00

1 2 3 4 5 ...

1135 Commits