doc-to-lora/webui/SELF_GEN_VIEWER.md
2026-02-27 03:47:04 +00:00

31 lines
905 B
Markdown

# Self-Gen Data Viewer
Thanks Claude.
Running the viewer
```bash
uv run self_gen_viewer.py
```
Then open your browser and go to: **http://localhost:5001**
## Usage
1. **Select a Model Folder**: Choose from the dropdown list (e.g., `google/gemma-2-2b-it_temp_0.0_closed_qa_prob_1.0`)
2. **Select a Parquet File**: Once a folder is selected, available parquet files will appear
3. **Set Number of Samples**: Adjust the sample count (default: 100, max: 1000)
4. **Click "Load Data"**: View the visualized data with context and Q&A pairs
## Data Structure
The viewer expects data in the following structure:
```
data/raw_datasets/self_gen/
├── google/
│ └── gemma-2-2b-it_temp_0.0_closed_qa_prob_1.0/
│ └── fw_qa_v2/
│ └── *.parquet
└── mistralai/
└── Mistral-7B-Instruct-v0.2_temp_0.0_closed_qa_prob_1.0/
└── *.parquet
```