edx-analytics-pipeline
edx-analytics-pipeline copied to clipboard
I have installed edx analytics pipeline and it is running successfully. edxops/analytics_pipeline:latest edxops/analytics_pipeline_spark_worker:latest edxops/analytics_pipeline_hadoop_nodemanager:latest edxops/analytics_pipeline_hadoop_resourcemanager:latest edxops/analytics_pipeline_hadoop_datanode:latest edxops/analytics_pipeline_spark_master:latest edxops/analytics_pipeline_hadoop_namenode:latest But when I open insights and login to it, it shows following...
Hi, Is this wiki outdated or can we still follow these steps to install the analytics pipeline? Thanks, Somansh
This is the second part of the work to add a pipeline to load GA360 data into Snowflake. This part of the code change DOES depend on a Luigi upgrade,...
This command-line tool generates curated synthetic enrollment event files from log files created separately by enrollment validation runs. These event files are in a format where they can be directly...
This PR includes following tasks: TotalEventsDailyTask UserActivityTask CourseActivityPartitionTask InternalReportingUserActivityPartitionTask Note: **Not to be merged with master branch yet**
This is a redux of Brian's work, and then Alex's work, on working towards making AnswerDistribution incremental, by partially achieving incremental-ness by generating Hive partitions for each day's data.
DE-69. This pulls information from SailThru about email blasts. Two tables are created: statistics about each email blast -- how many were opened and clicked and such; and information about...
Succeeds on acceptance tests. Not tested yet on release-candidate runs, but available for comment.
This PR is part of the dockerization of the analyticstack, most of the documentation can be found here: https://github.com/edx/configuration/pull/3582 It implements the following changes: 1. Add a docker.cfg config file...
For the details, check the issue https://github.com/openedx/public-engineering/issues/233