Files
ai7-m3/scheduler_bots/README_consolidated_csv.md
2026-02-05 10:15:09 +03:00

42 lines
1.7 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Consolidated CSV Data Documentation
## Overview
This directory contains a consolidated CSV file (`consolidated_data_simple.csv`) that combines data from multiple individual CSV files in the `sample_data` directory. Each original CSV file is identified by a sheet number and filename prefix in the consolidated file.
## File Structure
### `consolidated_data_simple.csv`
- **Columns**: `[Sheet_Number, Original_File, Original_Column_1, Original_Column_2, ...]`
- **Sheet Numbers**:
1. `8-Table 1.csv`
2. `1-Table 1.csv`
3. `10-Table 1.csv`
4. `Лист3-Table 1.csv`
5. `7-Table 1.csv`
6. `Реестр заявлений на перевод 252-Table 1.csv`
7. `ТАЙМПАД-Table 1.csv`
8. `6 -Table 1.csv`
9. `11-Table 1.csv`
10. `4 -Table 1.csv`
11. `3 -Table 1.csv`
12. `2 -Table 1.csv`
13. `5 -Table 1.csv`
14. `9-Table 1.csv`
15. `АНГЛ-Table 1.csv`
## Format Details
- Column 1: `Sheet_Number` - The numeric identifier for the original CSV file
- Column 2: `Original_File` - The filename of the original CSV file
- Columns 3+: The original data columns from each CSV file
## Purpose
This consolidated file is designed for AI/ML analysis where each original CSV sheet can be identified by its sheet number, allowing algorithms to treat each original dataset separately while analyzing the combined data.
## Total Records
- Total rows in consolidated file: 3283
- Number of original CSV files consolidated: 15
## Notes
- All files were encoded in UTF-8 to preserve Cyrillic characters
- Some original files may have been skipped if they did not contain student data (e.g., notification texts)
- The consolidation preserves the original row and column structure from each source file