mirror of
https://github.com/modelscope/modelscope.git
synced 2026-02-24 20:19:51 +01:00
7月份计划上线的360全景图生成模型,自研 模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary #### 依赖项说明 ##### 由于要使用xformers,torch版本最好使用1.13.1 ``` pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116 ``` ##### 对应的diffusers和xformers版本如下 ``` pip install -U diffusers==0.18.0 pip install xformers==0.0.16 pip install triton, accelerate, transformers ``` ##### ModelScope Library 需要使用cv ``` pip install modelscope pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html ``` ##### 此外,还需要安装第三方的一个库,Real-ESRGAN, 安装方法如下 ``` # Install basicsr - https://github.com/xinntao/BasicSR # We use BasicSR for both training and inference pip install basicsr # facexlib and gfpgan are for face enhancement pip install facexlib pip install gfpgan pip install Pillow pip install tqdm pip install realesrgan==0.3.0 ``` Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430 * add text-to-360pano-image pipeline * add text-to-360pano-image pipeline, mod cv requirements * rm redundant files and cv requirements; add standard input and output definations * fix diffusers==0.18.0 and run test * fix diffusers==0.18.0 in multi-modal and run test again * add model_revision='v1.0.0' * fix yapf * add trycatch for enabling xformers * fix key error * add install xformers in test/setup * skip highres.fix in ci * feat: Fix conflict, auto commit by WebIDE
30 lines
548 B
Plaintext
30 lines
548 B
Plaintext
accelerate
|
|
cloudpickle
|
|
decord>=0.6.0
|
|
diffusers==0.18.0
|
|
fairseq
|
|
ftfy>=6.0.3
|
|
librosa==0.9.2
|
|
opencv-python
|
|
pycocoevalcap>=1.2
|
|
pycocotools>=2.0.4
|
|
pydot
|
|
# compatible with taming-transformers-rom1504
|
|
pytorch_lightning<=1.7.7
|
|
rapidfuzz
|
|
# rough-score was just recently updated from 0.0.4 to 0.0.7
|
|
# which introduced compatability issues that are being investigated
|
|
rouge_score<=0.0.4
|
|
sacrebleu
|
|
safetensors
|
|
# scikit-video
|
|
soundfile
|
|
taming-transformers-rom1504
|
|
timm
|
|
tokenizers
|
|
torchvision
|
|
transformers>=4.27.1
|
|
# triton==2.0.0.dev20221120
|
|
unicodedata2
|
|
zhconv
|