

So how do we solve this issue? If not SQL then what? It the data is not following the predefined format, relational databases do not store it Elasticity: Structured databases need data in a predefined format.Scalability: It is very difficult to scale as the database grows larger.Structured databases have two major disadvantages: But if your organization works with unstructured data, SQL databases can not fulfill the requirements. SQL is a wonderful language to learn as a data scientist and it does work well when we’re dealing with structured data. Most importantly this data is unstructuredĬan you imagine using SQL to work with this volume of data? It’s setting yourself up for a nightmare! Furthermore, Large Hadron Collider (27 kilometers long most powerful particle accelerator of the world) generates one petabyte of data per second.Google generates twenty petabytes of data every day.

Facebook generates four petabytes of data in just one day.The scale and size of this data - it’s mind-boggling! Just check out these numbers: We are generating data at an unprecedented pace right now. We will cover all the basic operations in MongoDB using Python.We will discuss how you can work with a MongoDB database using Python (and the PyMongo library).MongoDB is a popular unstructured database that data scientists should be aware of.MongoDB in Python Tutorial for Beginners (using PyMongo) Overview
