Database Management Systems (DBMS)

What are Database Management Systems?

A Database Management System (DBMS) is software that interacts with end-users, applications, and the database itself to capture and analyze data. It provides a centralized interface that allows users to create, read, update, and delete data in a database. It handles the instructions given by users or applications and translates them into complex operations on the physical data storage.

 

Why are Database Management Systems used?

They are used to manage large amounts of data efficiently and securely. A DBMS enforces a structured format for data entry and storage. It provides built-in mechanisms for data security, data backup, and data recovery, ensuring that information is not lost during hardware failures and is only accessible to authorized users.

 

What is the difference between a Database and a Database Management System?

A database is the actual organized collection of stored data. The Database Management System is the software application installed on a server or computer used to manage, access, and manipulate that data collection. You cannot directly interact with a database without the DBMS software to process your commands.

 

What are the standard types of Database Management Systems?

The most common type is the Relational Database Management System (RDBMS), which organizes data into rigid tables with rows and columns. Another common type is the NoSQL DBMS, which is designed to handle unstructured data and stores information in flexible document formats or key-value pairs rather than tables.

 

How is a Database Management System used in Data Science?

In Data Science, a DBMS is used as the primary source to store, query, and extract the raw data required for analytical projects. Data scientists use query languages, such as SQL, directly within the DBMS to filter, aggregate, and export specific datasets. For example, a data scientist will execute a query in a DBMS to extract exactly three years of historical customer transaction records, which they will then export to train a predictive machine learning model.