net.jgp.books.spark.ch09
net.jgp.books.spark.ch09 copied to clipboard
Datasource example for Python
I am following the datasource example and noticed that no code for python.
Please let me know how to access Scala short name datasource from the python code!
https://github.com/jgperrin/net.jgp.books.spark.ch09/blob/master/src/main/python/lab400_photo_datasource/photoMetadataIngestionApp.py => this file is empty
Hey @purna344, thanks for getting in touch... The Python and Scala code is supported by @rambabu-posa, so I'll let Ram chime in on this one. A quick note though:
- You can probably not write your data source in Python (but I am not 100% sure on that, as my Python is pretty rustic).
- You should be able to use the short name in the python code, but you will have dependencies on a Jar containing your Java/Scala code.
@rambabu-posa - can you add to that and help @purna344 ?
Thanks @jgperrin Hi @purna344, As I felt implementing custom datasource in PySpark is bit complex, I left that file as empty. I will try to work on it again and update you.
@rambabu-posa can you provide an update or close the issue?
@jgperrin its bit tough to implement.