Anime Vision DiT Model Overview
I’m excited to introduce my latest checkpoint model, Anime Vision DiT. This model, trained over 180,000 steps, is designed to generate high-quality anime-style images with exceptional attention to detail, vibrant colors, and expressive characters.
Model Details :
Type: Anime-themed model with vibrant details
Trigger Words: None required
Chinese language support: No
NSFW: No
Output: High-detail, high-resolution anime images with a focus on artistic expression and vivid scenes.
Configuration Used for Training:
GPU: A6000
Dataset: 5,000 anime images
Batch Size: 2
Optimizer: AdamW
Scheduler: Cosine
Learning Rate: 1e-5
Epochs: Target of 100 epochs
Captioning: GPT-4
Quick Guide and Parameters:
VAE: SDXL
Sampler: dpmpp_2m
Scheduler: sgm_uniform (Recommended for best results)
Sampling Steps: 25+
CFG Scale: 7
Important: Please avoid using NSFW or mature content in your prompts, as it may lead to unreliable results. Additionally, shorter prompts tend to work better with both SD3 and DiT models.
Note:
This is not a merged or modified model. It is the original Realistic Vision fine-tuned model. Some users have been spreading incorrect information in the model's comment section. If you have any questions or want to know more, join my Discord server or share your thoughts in the comment section. Thank you for your time.