42 lines
1.7 KiB
Markdown
42 lines
1.7 KiB
Markdown
# Consolidated CSV Data Documentation
|
||
|
||
## Overview
|
||
This directory contains a consolidated CSV file (`consolidated_data_simple.csv`) that combines data from multiple individual CSV files in the `sample_data` directory. Each original CSV file is identified by a sheet number and filename prefix in the consolidated file.
|
||
|
||
## File Structure
|
||
|
||
### `consolidated_data_simple.csv`
|
||
- **Columns**: `[Sheet_Number, Original_File, Original_Column_1, Original_Column_2, ...]`
|
||
- **Sheet Numbers**:
|
||
1. `8-Table 1.csv`
|
||
2. `1-Table 1.csv`
|
||
3. `10-Table 1.csv`
|
||
4. `Лист3-Table 1.csv`
|
||
5. `7-Table 1.csv`
|
||
6. `Реестр заявлений на перевод 252-Table 1.csv`
|
||
7. `ТАЙМПАД-Table 1.csv`
|
||
8. `6 -Table 1.csv`
|
||
9. `11-Table 1.csv`
|
||
10. `4 -Table 1.csv`
|
||
11. `3 -Table 1.csv`
|
||
12. `2 -Table 1.csv`
|
||
13. `5 -Table 1.csv`
|
||
14. `9-Table 1.csv`
|
||
15. `АНГЛ-Table 1.csv`
|
||
|
||
## Format Details
|
||
- Column 1: `Sheet_Number` - The numeric identifier for the original CSV file
|
||
- Column 2: `Original_File` - The filename of the original CSV file
|
||
- Columns 3+: The original data columns from each CSV file
|
||
|
||
## Purpose
|
||
This consolidated file is designed for AI/ML analysis where each original CSV sheet can be identified by its sheet number, allowing algorithms to treat each original dataset separately while analyzing the combined data.
|
||
|
||
## Total Records
|
||
- Total rows in consolidated file: 3283
|
||
- Number of original CSV files consolidated: 15
|
||
|
||
## Notes
|
||
- All files were encoded in UTF-8 to preserve Cyrillic characters
|
||
- Some original files may have been skipped if they did not contain student data (e.g., notification texts)
|
||
- The consolidation preserves the original row and column structure from each source file |