[Failing Test]: Utilization of 'Source' and 'Sink'
What happened?
https://github.com/vaishnavipandey-vp/Apache-Beam-YAML/blob/c467c59f5f13a00514e90be1cfd505982577524b/Apache-Beam-YAML/Pipelines/transformUsingSourceSink.yaml#L1C1-L24C17
Not able to utilize 'source' and 'sink' transforms, facing challenges.
Issue Failure
Failure: Test is continually failing
Issue Priority
Priority: 1 (unhealthy code / failing or flaky postcommit so we cannot be sure the product is healthy)
Issue Components
- [ ] Component: Python SDK
- [ ] Component: Java SDK
- [ ] Component: Go SDK
- [ ] Component: Typescript SDK
- [ ] Component: IO connector
- [X] Component: Beam YAML
- [ ] Component: Beam examples
- [ ] Component: Beam playground
- [ ] Component: Beam katas
- [ ] Component: Website
- [ ] Component: Infrastructure
- [ ] Component: Spark Runner
- [ ] Component: Flink Runner
- [ ] Component: Samza Runner
- [ ] Component: Twister2 Runner
- [ ] Component: Hazelcast Jet Runner
- [ ] Component: Google Cloud Dataflow Runner
Your source and sink parameters should contain a single transform not a list of them (even if this is a list of size 1). We could look into providing a better error here.
.take-issue
Hi @robertwb
I would like to work on this issue, my proposal is the following:
- Improve the error message.
- Modify the
_closest_linefunction so it returns both the line number and the last key of the path. - Modify the exception message to this:
exn.message = f"Error found on key '{key}' around line {line}. Cause : {exn.message}".
The resulting errors will look like this:
jsonschema.exceptions.ValidationError: Error found on key 'source' around line 2. Cause : [{'type': 'ReadFromCsv', 'name': 'ReadMyData', 'config': {'path': 'D:\\\\Programs\\\\Apache-Beam-YAML\\\\Datasets\\\\sample.csv', '__line__': 6, '__uuid__': 'a7ecf8a4-aa80-4e74-8b42-784ee4d9c88e'}, '__line__': 3, '__uuid__': 'd785ab49-9064-448c-8a94-7330b49e0bf1'}] is not of type 'object'
...
- Update the website section Source and sink transforms of the Beam YAML API page in order to explicitly state that both
sourceandsinkare intented to be objects in YAML.
This issue has been marked as stale due to 150 days of inactivity. It will be closed in 30 days if no further activity occurs. If you think that’s incorrect or this issue still needs to be addressed, please simply write any comment. If closed, you can reopen the issue at any time. Thank you for your contributions.
This issue has been closed due to lack of activity. If you think that is incorrect, you can reopen the issue at any time.