Loading...
Loading...
Found 2 Skills
Fine-tune LLMs with Unsloth using GRPO or SFT. Supports FP8, vision models, mobile deployment, Docker, packing, GGUF export. Use when: train with GRPO, fine-tune, reward functions, SFT training, FP8 training, vision fine-tuning, phone deployment, docker training, packing, export to GGUF.
Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training