fal (fal)

FLUX Prompt Generator Updates

- gokaygokay/FLUX-Prompt-Generator

- There are now hundreds of new selections across diverse categories, each offering a lot of choices:

Architecture, Art, Artist, Brands, Character, Cinematic, Fashion, Feelings, Geography, Human, Interaction, Keywords, Objects, People, Photography, Plots, Poses, Scene, Science, Stuff, Time, Typography, Vehicle, Video Game

- In addition to Hugging Face, I've integrated new LLM providers: Groq, OpenAI, and Claude.

- Upgraded Vision Language Models (VLMs): We now feature Qwen2-VL, JoyCaption and Florence-2-large.

- New specialized system prompts for various styles and themes, including Happy, Simple, Poster, Only Objects, No Figure, Landscape, Fantasy.

2 replies

·

isidentical

posted an update 4 months ago

Post

3167

Introducing fal/AuraFace-v1: Commercially available & open source identity encoder model for next generation one shot personalization. Read more about it here: https://huggingface.co/blog/isidentical/auraface

isidentical

posted an update 5 months ago

Post

619

Added FLUX.1 pro/dev/schnell and AuraFlow v0.2 to fal/imgsys !!! Go play with it and get us some votez

isidentical

posted an update 5 months ago

Post

1797

fal/AuraFlow-v0.3 is now here with support for different aspect resolutions (w/h up to 1536px!) and much nicer aesthetics! Make sure to install the latest diffusers to get support for it.

gokaygokay

posted an update 5 months ago

Post

10035

I've built a space for creating prompts for FLUX

gokaygokay/FLUX-Prompt-Generator

You can create long prompts from images or simple words. Enhance your short prompts with prompt enhancer. You can configure various settings such as artform, photo type, character details, scene details, style, and artist to create tailored prompts.

And you can combine all of them with custom prompts using llms (Mixtral, Mistral, Llama 3, and Mistral-Nemo).

The UI is a bit complex, but it includes almost everything you need. Choosing random option is the most fun!

And i've created some other spaces for using FLUX models with captioners and enhancers.

- gokaygokay/FLUX.1-dev-with-Captioner

4 replies

·

gokaygokay

posted an update 6 months ago

Post

4768

InSPyReNet Background Removal

I've built a space for fast background removal.

- gokaygokay/Inspyrenet-Rembg

- https://github.com/plemeri/InSPyReNet

2 replies

·

gokaygokay

posted an update 6 months ago

Post

4651

I've made a creative version of Tile Upscaler

- gokaygokay/TileUpscalerV2

- https://github.com/gokayfem/Tile-Upscaler

- New tiling strategy
- Now it's closer to Clarity Upscaler
- It has more parameters to play and it has more room to fail because of that
- You should try different resolutions, strength and controlnet strength

Original Tile Upscaler
- gokaygokay/Tile-Upscaler

isidentical

posted an update 6 months ago

Post

4024

Announcing the second open model in our Aura series of media models at @fal : fal/AuraFlow

Try it using diffusers or ComfyUI from publicly available weights, and read more about it in our blog https://blog.fal.ai/auraflow.

3 replies

·

gokaygokay

posted an update 6 months ago

Post

6192

Kolors with VLM support

I've built a space for using Kolors image generation model with captioner models and prompt enhancers.

- Space with VLM and Prompt Enhancer
gokaygokay/KolorsPlusPlus

- Original Space for model
gokaygokay/Kolors

- Captioner VLMs
- gokaygokay/sd3-long-captioner-v2

- microsoft/Florence-2-base

- Prompt Enhancers
- gokaygokay/Lamini-Prompt-Enchance-Long

- gokaygokay/Lamini-Prompt-Enchance

gokaygokay

posted an update 6 months ago

Post

5031

Kolors: Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis

Kolors is a large-scale text-to-image generation model based on latent diffusion, developed by the Kuaishou Kolors team.

Hugging Face Spaces
- gokaygokay/Kolors

Model Page
- Kwai-Kolors/Kolors

gokaygokay

posted an update 6 months ago

Post

4057

I've created a space for chatting with Gemma 2 using llama.cpp

- 🎛️ Choose between 27B IT and 9b IT models
- 🚀 Fast inference using llama.cpp

- gokaygokay/Gemma-2-llamacpp

1 reply

·

gokaygokay

posted an update 6 months ago

Post

3005

I've created a Stable Diffusion 3 (SD3) image generation space for convenience. Now you can:

1. Generate SD3 prompts from images
2. Enhance your text prompts (turn 1-2 words into full SD3 prompts)

https://huggingface.co/spaces/gokaygokay/SD3-with-VLM-and-Prompt-Enhancer

These features are based on my custom models:

- VLM captioner for prompt generation:
- gokaygokay/sd3-long-captioner

- Prompt Enhancers for SD3 Models:
- gokaygokay/Lamini-Prompt-Enchance-Long
- gokaygokay/Lamini-Prompt-Enchance

You can now simplify your SD3 workflow with these tools!

isidentical

posted an update 7 months ago

Post

1511

It is time for some Aura.

First in our series of fully open sourced / commercially available models by @fal-ai : AuraSR - a 600M parameter upscaler based on GigaGAN.

Blog: https://blog.fal.ai/introducing-aurasr-an-open-reproduction-of-the-gigagan-upscaler-2/

HF: https://huggingface.co/fal-ai/AuraSR

Code: https://github.com/fal-ai/aura-sr

Playground: https://fal.ai/models/fal-ai/aura-sr/playground

What other models would you like to see open-sourced and commercially available? :)

gokaygokay

posted an update 7 months ago

Post

5950

I've fine-tuned three types of PaliGemma image captioner models for generating prompts for Text2Image models. They generate captions similar to prompts we give to the image generation models. I used google/docci and google/imageinwords datasets for fine-tuning.

This one gives you longer captions.

gokaygokay/SD3-Long-Captioner

This one gives you middle size captions.

https://huggingface.co/spaces/gokaygokay/SD3-Long-Captioner-V2

And this one gives you shorter captions.

https://huggingface.co/spaces/gokaygokay/SDXL-Captioner

10 replies

·

isidentical

posted an update 7 months ago

Post

1246

One shot evaluations is hard. That is honestly what I learnt throughout the last couple of weeks trying to make imgsys.org data more and more relevant. There is just so much diversity in these models that saying one is better than other one even at a particular domain is impossible.

If you have any suggestions on how we can make the testing easier for one shot, single question image model testing; please give your suggestions under this thread so we can provide a more meaningful data point to the community!

Warlord-K

posted an update 8 months ago

Post

1402

What are some areas that Image generation models are currently lacking in?

4 replies

·

isidentical

posted an update 9 months ago

Post

2106

Happy to announce https://imgsys.org -- a sister project to Chatbot Arena by lmsys -- for comparing different text guided image generation models models. Try it natively on HuggingFace: https://huggingface.co/spaces/fal-ai/imgsys

1 reply

·

fal

AI & ML interests

Recent Activity

fal's activity

[bot] Conversion to Parquet

Continuous latents

fal/cosmos-openvid-1m

AI & ML interests

Recent Activity

Team members 12

fal's activity

[bot] Conversion to Parquet

Continuous latents