DuyHai DOAN is an Apache Cassandra Evangelist at DataStax and committer for Apache Zeppelin. He spends his time between technical presentations/meetups on Cassandra, coding on open source projects like Achilles or Apache Zeppelin to support the community and helping all companies using Cassandra to make their project successful. Previously he was working as a freelance Java/Cassandra consultant.
BigData is quite trendy and now you're in a new project with Spark/Cassandra/Hadoop/ to build the DataLake/ of the 21st century. To have a good start with the new world and all its concepts you'll need some foundational knowledge:
1) basic notions in distributed systems: time, failures, latency, replication 2) the CAP theorem, a brief look on consistency and availability 3) master/slave, multi-master and masterless architecture, advantages and drawbacks