Tag - Spark

Apache Flink: The Next Distributed Data Processing Revolution?

Will Apache Flink displace Apache Spark as the new champion of Big Data Processing? We compare Spark and Apache Flink performance for batch processing and stream processing. comments By Kevin Jacobs, Data Blogger. Disclaimer: The results are valid on...

Apache Spark : Python vs. Scala

When it comes to using the Apache Spark framework, the comments By Preet Gandhi, NYU Center for Apache Spark is one of the most popular framework for big data analysis. Spark is written in Scala as it can be quite fast because it's statically typed a...

Data Science and the Imposter Syndrome

You are not the only one who wonders how much longer they can get away with pretending to be a comments I am not a real Even Ewoks feel like imposters sometimes. (Photo courtesy of Diane Rohrer.) What a real data scientist looks like “Data science” i...

Did Spark Really Kill Hadoop?

A comprehensive survey conducted by iDatalabs shows us the trends of the future of these two Data Science technologies. comments By Julia Cook, iDatalabs Apache Hadoop, built by Yahoo for engineers and data scientists, is showing its age. Once praise...