- Renamed run_tests_with_better_output.py to run_tests_by_system.py (more descriptive)
- Removed legacy test_phase1.py file (no tests collected)
- Updated test sections to reflect current test structure
- Test runner now organizes tests by system/component with timing
- All 197 tests passing
Co-authored-by: openhands <openhands@all-hands.dev>