Toolsformatter

OpenAI Fine-Tuning API for Fine-Tuning & Custom Models

Converts raw text datasets into structured LoRA-compatible training formats with tokenization guidance

Try the tool

client runner

Formatted training data

Run the tool to see output.

Examples

Basic JSONL conversion

{
  "raw_text": "Hello world\\nThis is a test",
  "format_type": "JSONL"
}

Expected output

["Hello world\nThis is a test"]

CSV with tokens

{
  "raw_text": "System: Keep it concise\\nUser: Explain AI\\nAssistant: Artificial intelligence...",
  "format_type": "CSV",
  "add_special_tokens": true
}

Expected output

text\n"<s>[INST] Keep it concise\nExplain AI[/INST] Artificial intelligence..."

How it works

Processes raw text inputs through tokenization rules optimized for LoRA training, adding model-specific special tokens when requested. Outputs structured datasets in selected formats with proper formatting for training frameworks.

Related tools