最近ChatGPT非常火,以对话的形式回答你的大部分问题,大家用它写脚本、稿件,修改代码等等。用的我心惊胆战,一不小心那天就会失业。自己也偷偷问了些自己难搞的问题。
是一种机器学习模型,它经过训练可以逐步对随机高斯噪声进行去噪以获得感兴趣的样本,例如生成图像。 扩散模型有一个主要的缺点就是去噪过程的时间和内存消耗都非常昂贵。这会使进程变慢,并消耗大量内存。主要原因是它们在像素空间中运行,特别是在生成高分辨率图像时。
其实人工智能AI画图现在也很厉害,这几天看到stable diffusion AI画图非常厉害,水平相当高。先看几张图。
这东西这么厉害,自己应该拿来试试,研究几天,我的电脑可以安装。安装过程比较复杂,安装加调试用了一天多时间。
Stable Diffusion是基于python开发的,有人开发了用户界面,就是Stable Diffusion webUI版。安装过程按这个软件包跑代码就可以。官方给的教程只有下面这些:
Install Python 3.10.6, checking “Add Python to PATH”Install git.Download the stable-diffusion-webui repository, for example by runninggit clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git.Run webui-user.bat from Windows Explorer as normal, non-administrator, user.
关于安装,最好机器有显卡,而且是N卡,AMD的显卡也可以,比较麻烦。安装需要python3.10.6,安装过程经常出错,可能是网络环境因素。安装过程中会安装六七个调用的python库,那几个库可以单独安装,检测通过就可以跳过。
涉及的包有这些,这些都可以看说明自己安装,再最后打开安装。
Stable Diffusion – https://github.com/CompVis/stable-diffusion, https://github.com/CompVis/taming-transformersk-diffusion – https://github.com/crowsonkb/k-diffusion.gitGFPGAN – https://github.com/TencentARC/GFPGAN.gitCodeFormer – https://github.com/sczhou/CodeFormerESRGAN – https://github.com/xinntao/ESRGANSwinIR – https://github.com/JingyunLiang/SwinIRSwin2SR – https://github.com/mv-lab/swin2srLDSR – https://github.com/Hafiidz/latent-diffusionMiDaS – https://github.com/isl-org/MiDaSIdeas for optimizations – https://github.com/basujindal/stable-diffusionCross Attention layer optimization – Doggettx – https://github.com/Doggettx/stable-diffusion, original idea for prompt editing.Cross Attention layer optimization – InvokeAI, lstein – https://github.com/invoke-ai/InvokeAI (originally http://github.com/lstein/stable-diffusion)Sub-quadratic Cross Attention layer optimization – Alex Birch (https://github.com/Birch-san/diffusers/pull/1), Amin Rezaei (https://github.com/AminRezaei0x443/memory-efficient-attention)Textual Inversion – Rinon Gal – https://github.com/rinongal/textual_inversion (were not using his code, but we are using his ideas).Idea for SD upscale – https://github.com/jquesnelle/txt2imghdNoise generation for outpainting mk2 – https://github.com/parlance-zz/g-diffuser-botCLIP interrogator idea and borrowing some code – https://github.com/pharmapsychotic/clip-interrogatorIdea for Composable Diffusion – https://github.com/energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorchxformers – https://github.com/facebookresearch/xformersDeepDanbooru – interrogator for anime diffusers https://github.com/KichangKim/DeepDanbooruSampling in float32 precision from a float16 UNet – marunine for the idea, Birch-san for the example Diffusers implementation (https://github.com/Birch-san/diffusers-play/tree/92feee6)Instruct pix2pix – Tim Brooks (star), Aleksander Holynski (star), Alexei A. Efros (no star) – https://github.com/timothybrooks/instruct-pix2pixSecurity advice – RyotaKInitial Gradio script – posted on 4chan by an Anonymous user. Thank you Anonymous user.
安装好,有一个网页页面可以开始画图。输入prompt,就可以绘图。
先生成几张公众号封面。