I'm looking for advice on scraping data from various Excel timetable files using Python. I've experimented with Regex, but the variety of timetable formats makes it ineffective. Additionally, employing an 'AI oversight' method tends to be slow. Does anyone have suggestions or resources that could help streamline this process?
2 Answers
Have you considered using AI to read each row? The OpenRouter API could be a good option to explore.
I haven't encountered any tricky Excel files before. Can you share an example of a row or field that isn’t working for you? You don’t have to provide real data, just something that illustrates the pattern.

The formats of the timetables vary too much. One solution works well for one file but fails on others.