Taneli Leppä
Taneli Leppä
I can confirm that switching to @casept repository fixes string corruption issues for me when processing tags in concurrent goroutines.
I did some investigation on this. I believe the reason is because Elasticsearch has [deprecated mapping types](https://www.elastic.co/guide/en/elasticsearch/reference/current/removal-of-types.html) in ES 7.0+, so this causes a warning like this emitted for every...
@Cherepushko I'm trying to build a test version of `com.google.cloud.teleport.v2.elasticsearch.templates.PubSubToElasticsearch`, but I'm not having too much luck with the `mvn compile exec:java` stuff on the front page. Would you be...
@GustafKisi Could you give the main branch here a try: https://github.com/rosmo/DataflowTemplates ? I've attempted to fix a few issues that cropped up during my testing. You should get it built...
@GustafKisi I suggest Cloud Shell, as it seems to have most stuff installed. Otherwise you'll have to have the Java JDK and Maven installed.
Yeah, it'll take 20-30 minutes or something.
@GustafKisi Great! Did you add the `javascriptTextTransformGcsPath` to the input spec? (I think just passing it via command line would have worked too) I do want to improve the bulk...
Yes, absolutely, but do note it's not officially Google version (but the changes are pretty transparent).
Hey @GustafKisi, I'm not an expert on Dataflow cost optimization, but certainly the streaming engine can help. You can also look into limiting the amount of worker nodes (max workers),...
For the `diskSizeGb`, I think that arguments needs to go into the `--parameters` part. The ingestion errors seem like coming from ES side of things. There are retry settings (`maxRetryAttempts`...