According to Odersky, Scala is an acronym for “scalable language”, meaning it allows the language to change into whatever use you see fit. This is true to some extent; for example though Scala’s subtyping system is unsound, you can “expand” the language to make a better one. Of course it would be preferable for the…
Apache Spark comes with the built-in functionality to pull data from S3 as it would with HDFS using the SparContext’s
textFiles allows for glob syntax, which allows you to pull hierarchal data as in
textFiles(s3n://bucket/2015/*/*). Though this seems great at first, there is an underlying issue with…
Maybe this be the issue?