Is there any actual standalone AI software?

Ad4mWayn3@lemmy.world · 1 year ago

Is there any actual standalone AI software?

ichbinjasokreativ@lemmy.world · 1 year ago

Stable diffusion and ollama for image and text generation locally. Super easy to do on linux and support gpu acceleration out of the box

MacN'Cheezus@lemmy.today · 1 year ago

For LLMs, the already mentioned LM Studio does a good job as far as beginner friendliness goes.

For text-to-image, I like Fooocus, which is a custom Stable Diffusion setup with automatic prompt enhancement, which can comfortably compete with Midjourney.

Here’s a setup guide for first time users. There’s also an online version to try it out.

iturnedintoanewt@lemm.ee · edit-2 4 months ago

deleted by creator

Grimy@lemmy.world · 1 year ago

You need a GPU for any kind of performance.

For text I suggest: Ollama backend - command line interface, very easy to download models with one line of code. Supports most models and you can talk with the model inside the terminal so it’s stand alone OpenWebUI - easy install with docker and is meant to work easily with ollama. Comes with web search features and uploading pdfs. A bunch of different community tools and modules are available.

For img I suggest either: Automatic1111 - Traditional UI using gradio. Lots of extras you can download through the UI to do different things. ComfyUI - Node based UI, a bit more complicated but more powerful than automatic1111

For models, you can go on civitai and just download whatever you need and drop it into their respective folders for both auto and comfy.

For text, there’s also LMStudio which is very user friendly. It is closed source and much slower than ollama from my experience though. I have a 4060 in my laptop (8gb VRAM) and I’m getting an image every 2 secs about using stable diffusion 1.5 models and text speed is on par with chatgpt with the smaller 8b-9b model. For text I suggest gemma2 which is probably the best small model out right now.

JackGreenEarth@lemm.ee · 1 year ago

I use Krita with the AI Diffusion plugin for Image Generation, which is working great, and Jan for text Generation, using the Llama 3 8B Q4 model. I have a NVIDIA GTX 1660 Ti with 6GB of VRAM and both are reasonably fast.

Starbuck@lemmy.world · 1 year ago

If you are into development, the setup I use is ollama running codegemma:7b along with the Continue.dev plugin for vscode.

random72guy@lemmy.world · edit-2 1 year ago

For LLMs, I’ve had really good results running Llama 3 in the Open Web UI docker container on a Nvidia Titan X (12GB VRAM).

For image generation tho, I agree more VRAM is better, but the algorithms still struggle with large image dimensions, ao you wind up needing to start small and iterarively upscale, which afaik works ok on weaker GPUs, but will gake problems. (I’ve been using the Automatic 1111 mode of the Stable Diffusion Web UI docker project.)

I’m on thumbs so I don’t have the links to the git repos atm, but you basically clone them and run the docker compose files. The readmes are pretty good!

Evotech@lemmy.world · 1 year ago

ComfyUI is the best for image AI