beam
beam copied to clipboard
[GSOC 25] Enhanced Interactive Pipeline Development Environment for JupyterLab
GSoC 2025 Progress Tracker
Student: [Canyu CHEN] (@Chenzo1001)
Mentors: [XQ Hu] (@liferoad)
Organization: [Apache Beam]
Proposal Link: Here
📌 Project Overview
BeamVision transforms Apache Beam pipeline development in JupyterLab by replacing fragmented visualizations with an integrated workspace. The solution implements:
- Dynamic side panel for consolidated pipeline inspection
- Hybrid rendering (kernel HTML + D3.js) per technical specs
- Cluster management for Flink/Dataproc integration
- Multi-tab comparison interface for workflow analysis
Key Objectives:
- [x] Upgrade JupyterLab version of the Sidepanel to 4.x (PR #34495)
- [ ] Fix some bugs of Interactive Beam or Sidepanel
- [ ] Implement collapsible visualization panel
- [ ] Develop kernel-to-frontend data protocol
- [ ] Achieve 100% backward compatibility
- [ ] Deliver Flink cluster control interface
- [ ] Optimize rendering for 1GB+ datasets
🗓 Timeline
Community Bonding Period (May 1 - May 31)
- [x] Establish dev environment (Beam 2.65, JupyterLab 4.3)
- [x] Upgrade the JupterLab version of Sidepanel.
- [x] Attend Beam College and its hackathon. hackathon demo
Coding Phase 1 (June 1 - June 30)
Milestone 1: []
- [ ] Fix crash problem when
pip install apache-beam[interactive]on Google Colab. - [ ] Release the upgraded NPM package apache-beam-jupyterlab-sidepanel v4.0
Coding Phase 2 (July 1 - July 31)
Milestone 2: []
Final Phase (August 1 - August 31)
Milestone 3: []
🎯 Final Deliverables
- [ ] Upgraded JupyterLab Sidepanel extension
- [ ] Interactive demo notebook
- [ ] New Sidepanel UI/UX
- [ ] Release documents, etc.
- [ ] Fixed bugs.