Added
This commit is contained in:
42
scheduler_bots/README_consolidated_csv.md
Normal file
42
scheduler_bots/README_consolidated_csv.md
Normal file
@@ -0,0 +1,42 @@
|
||||
# Consolidated CSV Data Documentation
|
||||
|
||||
## Overview
|
||||
This directory contains a consolidated CSV file (`consolidated_data_simple.csv`) that combines data from multiple individual CSV files in the `sample_data` directory. Each original CSV file is identified by a sheet number and filename prefix in the consolidated file.
|
||||
|
||||
## File Structure
|
||||
|
||||
### `consolidated_data_simple.csv`
|
||||
- **Columns**: `[Sheet_Number, Original_File, Original_Column_1, Original_Column_2, ...]`
|
||||
- **Sheet Numbers**:
|
||||
1. `8-Table 1.csv`
|
||||
2. `1-Table 1.csv`
|
||||
3. `10-Table 1.csv`
|
||||
4. `Лист3-Table 1.csv`
|
||||
5. `7-Table 1.csv`
|
||||
6. `Реестр заявлений на перевод 252-Table 1.csv`
|
||||
7. `ТАЙМПАД-Table 1.csv`
|
||||
8. `6 -Table 1.csv`
|
||||
9. `11-Table 1.csv`
|
||||
10. `4 -Table 1.csv`
|
||||
11. `3 -Table 1.csv`
|
||||
12. `2 -Table 1.csv`
|
||||
13. `5 -Table 1.csv`
|
||||
14. `9-Table 1.csv`
|
||||
15. `АНГЛ-Table 1.csv`
|
||||
|
||||
## Format Details
|
||||
- Column 1: `Sheet_Number` - The numeric identifier for the original CSV file
|
||||
- Column 2: `Original_File` - The filename of the original CSV file
|
||||
- Columns 3+: The original data columns from each CSV file
|
||||
|
||||
## Purpose
|
||||
This consolidated file is designed for AI/ML analysis where each original CSV sheet can be identified by its sheet number, allowing algorithms to treat each original dataset separately while analyzing the combined data.
|
||||
|
||||
## Total Records
|
||||
- Total rows in consolidated file: 3283
|
||||
- Number of original CSV files consolidated: 15
|
||||
|
||||
## Notes
|
||||
- All files were encoded in UTF-8 to preserve Cyrillic characters
|
||||
- Some original files may have been skipped if they did not contain student data (e.g., notification texts)
|
||||
- The consolidation preserves the original row and column structure from each source file
|
||||
Reference in New Issue
Block a user