# Consolidated CSV Data Documentation ## Overview This directory contains a consolidated CSV file (`consolidated_data_simple.csv`) that combines data from multiple individual CSV files in the `sample_data` directory. Each original CSV file is identified by a sheet number and filename prefix in the consolidated file. ## File Structure ### `consolidated_data_simple.csv` - **Columns**: `[Sheet_Number, Original_File, Original_Column_1, Original_Column_2, ...]` - **Sheet Numbers**: 1. `8-Table 1.csv` 2. `1-Table 1.csv` 3. `10-Table 1.csv` 4. `Лист3-Table 1.csv` 5. `7-Table 1.csv` 6. `Реестр заявлений на перевод 252-Table 1.csv` 7. `ТАЙМПАД-Table 1.csv` 8. `6 -Table 1.csv` 9. `11-Table 1.csv` 10. `4 -Table 1.csv` 11. `3 -Table 1.csv` 12. `2 -Table 1.csv` 13. `5 -Table 1.csv` 14. `9-Table 1.csv` 15. `АНГЛ-Table 1.csv` ## Format Details - Column 1: `Sheet_Number` - The numeric identifier for the original CSV file - Column 2: `Original_File` - The filename of the original CSV file - Columns 3+: The original data columns from each CSV file ## Purpose This consolidated file is designed for AI/ML analysis where each original CSV sheet can be identified by its sheet number, allowing algorithms to treat each original dataset separately while analyzing the combined data. ## Total Records - Total rows in consolidated file: 3283 - Number of original CSV files consolidated: 15 ## Notes - All files were encoded in UTF-8 to preserve Cyrillic characters - Some original files may have been skipped if they did not contain student data (e.g., notification texts) - The consolidation preserves the original row and column structure from each source file