Data Visualization II - Plotly

A data dashboard is an information management tool that visually tracks, analyzes and displays key performance indicators, metrics and key data points. Behind the scenes, a dashboard connects to your files, attachments, services and API’s, but on the surface displays all this data in the form of tables, line charts, bar charts and gauges.

Plotly provides a web-service for hosting graphs and we can view the dashboard offline as well.

Database for Financial Analysis II - MongoDB

MongoDB is a document-oriented database.

Instead of storing your data in tables made out of individual rows, like a relational database does, it stores your data in collections made out of individual documents. In MongoDB, a document is a big JSON blob with no particular format or schema.

You can have all your data in one single collection.

Data Analysis IV - Merging and Reshaping Data

Sometimes the way that data is stored in files or databases is not the way you need it for a data processing application.
Pandas along with the Python standard library provide us with a high-level, flexible, and high-performance set of core manipulations
and algorithms to enable us to wrangle data into the right form without much trouble.

Data Analysis III - Validating Data

A favorite old saying of mine with respect to data quality is "Garbage In and Garbage Out". Your analysis is only ever as good as your dataset.
Pandas is a popular Python library used for data science and analysis. Used in conjunction with other data science toolsets like SciPy, NumPy, and Matplotlib, a modeler can create end-to-end analytic workflows to solve business problems.
My objective here is to go through some examples for how to perform basic data validation.