Spark-The-Definitive-Guide issues

ch 6 map -> create_map

3

https://github.com/databricks/Spark-The-Definitive-Guide/blob/38e881406cd424991a624dddb7e68718747b626b/code/Structured_APIs-Chapter_6_Working_with_Different_Types_of_Data.py#L313 It seems SCALA version, same as #L319

dpuddu86

Code issue chapter2

Issue in last code to get maximum destination countries. Count should be typecast to int before taking sum ,and syntax error is also coming with existing code

Aviral92-create

Spark The Definitive Guide dataz

2

교재 정보, 자료

mariner610

Implement codes from the book in Java as well

2

It will be great if we have Java implementations as well for all the codes in the book

neeleshkumar-mannur

Chapter 13 - Advanced RDD example of Custom partitioner may need correction

1

I'm studying spark advanced RDD API and got a little bit confused by one example. `// in Scala import org.apache.spark.Partitioner class DomainPartitioner extends Partitioner { def numPartitions = 3 def...

izayarniy

Update Streaming-Chapter_21_Structured_Streaming_Basics.scala

Value format is not a member of org.apache.spark.sql.DataFrame

abouklila

Clarification on loading data folder having multiple CSV file from local hard drive

1

Hi, I'm really stuck with this section of Spark book. staticDataFrame = spark.read.format("csv")\ .option("header", "true")\ .option("inferSchema", "true")\ **.load("/mnt/defg/retail-data/by-day/*.csv")** 1) I'm not able to understand the "load("/mnt/...") section. I have downloaded...

vishnu-muraly

chap3 - streaming write results in console

3

Hi, I start learning **Apache Spark** by reading that book. I'm now at ``chapter 3 - Streaming part``. For snippet code, I choose ``python3`` My problem is that nothing is...

maelfosso

Deep Learning - Transfer learning

Hello, I tried to follow the Transfer learning example in pyspark on page 534-535. However, when I try to do p_model = p.fit(train_df) I get a 'Number of source Raster...

lao8n

The code seems to be running in python 2

1

Is there any specific reason? Can it be updated to run on python 3 since thats the standard now?

hajimurtaza

Spark-The-Definitive-Guide
Spark-The-Definitive-Guide copied to clipboard

Metadata

ch 6 map -> create_map

Code issue chapter2

Spark The Definitive Guide dataz

Implement codes from the book in Java as well

Chapter 13 - Advanced RDD example of Custom partitioner may need correction

Update Streaming-Chapter_21_Structured_Streaming_Basics.scala

Clarification on loading data folder having multiple CSV file from local hard drive

chap3 - streaming write results in console

Deep Learning - Transfer learning

The code seems to be running in python 2

← Metadata

Owner

Metadata

Spark-The-Definitive-Guide Spark-The-Definitive-Guide copied to clipboard

Metadata

← Metadata

Owner

Metadata

Spark-The-Definitive-Guide
Spark-The-Definitive-Guide copied to clipboard