Spark-The-Definitive-Guide icon indicating copy to clipboard operation
Spark-The-Definitive-Guide copied to clipboard

Spark: The Definitive Guide's Code Repository

Results 32 Spark-The-Definitive-Guide issues
Sort by recently updated
recently updated
newest added

https://github.com/databricks/Spark-The-Definitive-Guide/blob/38e881406cd424991a624dddb7e68718747b626b/code/Structured_APIs-Chapter_6_Working_with_Different_Types_of_Data.py#L313 It seems SCALA version, same as #L319

Issue in last code to get maximum destination countries. Count should be typecast to int before taking sum ,and syntax error is also coming with existing code

It will be great if we have Java implementations as well for all the codes in the book

I'm studying spark advanced RDD API and got a little bit confused by one example. `// in Scala import org.apache.spark.Partitioner class DomainPartitioner extends Partitioner { def numPartitions = 3 def...

Value format is not a member of org.apache.spark.sql.DataFrame

Hi, I'm really stuck with this section of Spark book. staticDataFrame = spark.read.format("csv")\ .option("header", "true")\ .option("inferSchema", "true")\ **.load("/mnt/defg/retail-data/by-day/*.csv")** 1) I'm not able to understand the "load("/mnt/...") section. I have downloaded...

Hi, I start learning **Apache Spark** by reading that book. I'm now at ``chapter 3 - Streaming part``. For snippet code, I choose ``python3`` My problem is that nothing is...

Hello, I tried to follow the Transfer learning example in pyspark on page 534-535. However, when I try to do p_model = p.fit(train_df) I get a 'Number of source Raster...

Is there any specific reason? Can it be updated to run on python 3 since thats the standard now?