During my Metis bootcamp, I did an intro presentation to my class about spark.
The presentation explored the content and length of a sample of the commit messages on github.
Here is the link to the ipython notebook covering concepts such as SparkCore,SparkSql.
Full Repo Link