Spark - Key-Value RDD

1 - About

Spark supports Key-Value pairs RDD in Python trough a list of tuple.

A count of an RDD with tuple will return the number of tuples. A tuple can be seen as a row.

3 - Construction

Spark - RDD Data Type (Creation|Construction|Initialization)

rdd = sc.parallelize([(1, 2), (3, 4)]) 
RDD: [(1, 2), (3, 4)]

4 - Transformation

5 - Action

6 - Documentation / Reference

db/spark/key_value.txt ยท Last modified: 2017/09/06 20:15 by gerardnico