This commit is contained in:
2026-02-05 10:15:09 +03:00
parent 2427fce842
commit 67241a5ed0
33 changed files with 13147 additions and 154 deletions

View File

@@ -0,0 +1,42 @@
# Consolidated CSV Data Documentation
## Overview
This directory contains a consolidated CSV file (`consolidated_data_simple.csv`) that combines data from multiple individual CSV files in the `sample_data` directory. Each original CSV file is identified by a sheet number and filename prefix in the consolidated file.
## File Structure
### `consolidated_data_simple.csv`
- **Columns**: `[Sheet_Number, Original_File, Original_Column_1, Original_Column_2, ...]`
- **Sheet Numbers**:
1. `8-Table 1.csv`
2. `1-Table 1.csv`
3. `10-Table 1.csv`
4. `Лист3-Table 1.csv`
5. `7-Table 1.csv`
6. `Реестр заявлений на перевод 252-Table 1.csv`
7. `ТАЙМПАД-Table 1.csv`
8. `6 -Table 1.csv`
9. `11-Table 1.csv`
10. `4 -Table 1.csv`
11. `3 -Table 1.csv`
12. `2 -Table 1.csv`
13. `5 -Table 1.csv`
14. `9-Table 1.csv`
15. `АНГЛ-Table 1.csv`
## Format Details
- Column 1: `Sheet_Number` - The numeric identifier for the original CSV file
- Column 2: `Original_File` - The filename of the original CSV file
- Columns 3+: The original data columns from each CSV file
## Purpose
This consolidated file is designed for AI/ML analysis where each original CSV sheet can be identified by its sheet number, allowing algorithms to treat each original dataset separately while analyzing the combined data.
## Total Records
- Total rows in consolidated file: 3283
- Number of original CSV files consolidated: 15
## Notes
- All files were encoded in UTF-8 to preserve Cyrillic characters
- Some original files may have been skipped if they did not contain student data (e.g., notification texts)
- The consolidation preserves the original row and column structure from each source file