beam icon indicating copy to clipboard operation
beam copied to clipboard

[GSOC 25] Enhanced Interactive Pipeline Development Environment for JupyterLab

Open Chenzo1001 opened this issue 6 months ago • 0 comments

GSoC 2025 Progress Tracker

Student: [Canyu CHEN] (@Chenzo1001)
Mentors: [XQ Hu] (@liferoad)
Organization: [Apache Beam]
Proposal Link: Here

📌 Project Overview

BeamVision transforms Apache Beam pipeline development in JupyterLab by replacing fragmented visualizations with an integrated workspace. The solution implements:

  • Dynamic side panel for consolidated pipeline inspection
  • Hybrid rendering (kernel HTML + D3.js) per technical specs
  • Cluster management for Flink/Dataproc integration
  • Multi-tab comparison interface for workflow analysis

Key Objectives:

  • [x] Upgrade JupyterLab version of the Sidepanel to 4.x (PR #34495)
  • [ ] Fix some bugs of Interactive Beam or Sidepanel
  • [ ] Implement collapsible visualization panel
  • [ ] Develop kernel-to-frontend data protocol
  • [ ] Achieve 100% backward compatibility
  • [ ] Deliver Flink cluster control interface
  • [ ] Optimize rendering for 1GB+ datasets

🗓 Timeline

Community Bonding Period (May 1 - May 31)

  • [x] Establish dev environment (Beam 2.65, JupyterLab 4.3)
  • [x] Upgrade the JupterLab version of Sidepanel.
  • [x] Attend Beam College and its hackathon. hackathon demo

Coding Phase 1 (June 1 - June 30)

Milestone 1: []

Coding Phase 2 (July 1 - July 31)

Milestone 2: []

Final Phase (August 1 - August 31)

Milestone 3: []

🎯 Final Deliverables

  • [ ] Upgraded JupyterLab Sidepanel extension
  • [ ] Interactive demo notebook
  • [ ] New Sidepanel UI/UX
  • [ ] Release documents, etc.
  • [ ] Fixed bugs.

💡 Additional Notes

Chenzo1001 avatar Jun 03 '25 05:06 Chenzo1001