Dataframes with Pandas on JupyterLab

Introduction Pandas standing for panel data is managing an outstanding object called DataFrame and the first time I encountered the famous « dataframe object » concept was when I was asked to put in place an Apache Spark SQL environment in my current customer’s day. As a SQL and java developper, I…

lire la suite

Scheduling with Apache Airflow

Airflow at a glance How does it work ? Quick demo Launching Airflow Even if the official repository provide a docker compose file, I fine tuned the file to simplify the getting started: Once ready, launch these commands Open your favorite browser and go to the Airflow login page: http://airflow.localtest.me…

lire la suite