mirror of
https://github.com/SakanaAI/doc-to-lora.git
synced 2026-04-26 08:36:23 +02:00
Doc-to-LoRA release
This commit is contained in:
commit
1abe8ae16d
92 changed files with 22131 additions and 0 deletions
31
webui/SELF_GEN_VIEWER.md
Normal file
31
webui/SELF_GEN_VIEWER.md
Normal file
|
|
@ -0,0 +1,31 @@
|
|||
# Self-Gen Data Viewer
|
||||
|
||||
Thanks Claude.
|
||||
|
||||
Running the viewer
|
||||
```bash
|
||||
uv run self_gen_viewer.py
|
||||
```
|
||||
|
||||
Then open your browser and go to: **http://localhost:5001**
|
||||
|
||||
## Usage
|
||||
|
||||
1. **Select a Model Folder**: Choose from the dropdown list (e.g., `google/gemma-2-2b-it_temp_0.0_closed_qa_prob_1.0`)
|
||||
2. **Select a Parquet File**: Once a folder is selected, available parquet files will appear
|
||||
3. **Set Number of Samples**: Adjust the sample count (default: 100, max: 1000)
|
||||
4. **Click "Load Data"**: View the visualized data with context and Q&A pairs
|
||||
|
||||
## Data Structure
|
||||
|
||||
The viewer expects data in the following structure:
|
||||
```
|
||||
data/raw_datasets/self_gen/
|
||||
├── google/
|
||||
│ └── gemma-2-2b-it_temp_0.0_closed_qa_prob_1.0/
|
||||
│ └── fw_qa_v2/
|
||||
│ └── *.parquet
|
||||
└── mistralai/
|
||||
└── Mistral-7B-Instruct-v0.2_temp_0.0_closed_qa_prob_1.0/
|
||||
└── *.parquet
|
||||
```
|
||||
Loading…
Add table
Add a link
Reference in a new issue