Comprehensive Guide To Facebook’s Prophet With Python Code

Prophet, a Facebook Research’s project, has marked its place among the tools used by ML and Data Science enthusiasts for time-series forecasting.

Share

Published on April 27, 2021

by Nikita Shiledarbaxi

Prophet, a Facebook Research’s project, has marked its place among the tools used by ML and Data Science enthusiasts for time-series forecasting. Open-sourced on February 23, 2017 (blog), it uses an additive model to forecast time-series data. This article aims at providing an overview of the extensively used tool along with its Pythonic demonstration.

Highlighting features of Prophet

It performs time-series forecasting “at scale” which means memory usage and computations complexity are not big-deal concerns for the Prophet while making a forecast.
It can fit time-series data having non-linearity in trends as well as holiday effects.
It works quite well with data having daily, weekly, monthly and/or yearly seasonality and in cases where we have several seasons of recorded historical data for making future forecasts.
It has R and Python APIs for time-series forecasting.
It can be downloaded as a CRAN or PyPI package.
It is highly susceptible to missing data, outliers and erratic changes in time-series data.
It makes use of the Stan platform for making forecasts quickly and with easily interpretable parameters.

NOTE: ‘Trend’ in time-series refers to an overall change in the data with time. While the term ‘seasonality’ means the way the data changes over a specific period e.g. week, month, year etc.

Working of Prophet

Image source: Facebook blog

Prophet employs an additive regression model having four constituents at its core:

A curve for detecting changes in trends of the variable for which forecast is to be made by picking variation-points from the time-series data.
A yearly seasonal component (uses Fourier series)
A weekly seasonal component
A customizable list representing holiday effects in the data

Practical implementation

Here’s a demonstration of using Python API for forecasting avocados’ prices using Prophet. The dataset used is available on Kaggle. The code implementation has been done using Google Colab and fbprophet 0.7.1 library. Step-wise implementation of the code is as follows:

Install the fbprophet Python library.

!pip install fbprophet

Import required libraries

 import numpy as np
 import pandas as pd
 import matplotlib.pyplot as plt
 import seaborn as sns
 from fbprophet import Prophet

Load the avocado dataset.

df = pd.read_csv('avocado.csv')

Display the initial records of the dataset.

df.head()

Output:

Get information about columns, number of entries, data types etc. of the dataset.

df.info()

Output:

Sort the DataFrame in ascending order of recorded date and create a new DataFrame having sorted records.

df1 = df.sort_values("Date")

Display some initial records of the sorted data.

df1.head()

Output:

Plot the recorded prices and observe the trend.

First, get the minimum and maximum dates in the historical data.

df1[‘Date’].min()

Output: 2015-01-04

df2[‘Date’].max()

Output: 2018-03-25

These outputs show that we have records from January 2015 to March 2018.Plot the prices of that period.

 plt.figure(figsize=(25,10))
 plt.plot(df1['Date'],df1['AveragePrice'])

Output:

We can also observe region-wise distribution of the data.

 plt.figure(figsize=(25,12))
 sns.countplot(x='region',data=df1)
 plt.xticks(rotation=45)

Output:

 (array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15, 16,
         17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33,
         34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50,
         51, 52, 53]), <a list of 54 Text major ticklabel objects>)

The plot shows that the data is balanced i.e. equally distributed region-wise.

Know the year-wise count of records in the data.

sns.countplot(x='year',data=df1)

Output:

Prophet expects a DataFrame as input in which there are two columns specifically named as ‘ds’ and ‘y’. ‘ds’ is the datestamp column while ‘y’ is the numeric variable for which forecast is to be made.So we need to keep only the ‘Date’ and ‘AveragePrice’ columns of df1 DataFrame and rename them as ‘ds’ and ‘y’ respectively.

Extract the two required columns

 df1 = df1[['Date','AveragePrice']]
 df1

Output:

Rename the columns

 df1.columns = ['ds','y']
 #Display initial columns to check if the columns have got renamed
 df1.head()

Output:

Forecast the future prices using Prophet.

Create a Prophet instance

m = Prophet()

Fit the historical data

m.fit(df1)

Create a DataFrame with future dates for forecast.

 future = m.make_future_dataframe(periods=365)
 #periods=365 specifies that forecast will be made for next 1 year

df1 has dates till 25/3/2018 so ‘future’ will be till 25/3/2019. Predict the prices for this new data having future dates as well

forecast = m.predict(future)

Get information on the ‘forecast’ DataFrame created by Prophet.

forecast.info()

Display a few initial records of ‘forecast’.

forecast.head()

Condensed output:

11) Plot the data with recorded as well as forecasted prices.

 figure = m.plot(forecast,xlabel='Date',ylabel='Price')

Output:

Our original data had monthly records till February 2019. The blue-shaded portion of the above plot shows the prices predicted for the next one year’s span, i.e. till February 2019.

Actual recorded prices have been marked with black dots in the above plot, while the The blue non-linear line shows the average predicted prices.

Plot the components of the forecast.

figure = m.plot_components(forecast)

Output:

The above forecast is made for all regions in general. We can make forecast for a specific region as follows:

Extract data of the required region from the original data.

df2 = df[df['region']=='West']

Display initial records.

df.head()

Output:

Sort the regional data in ascending order of dates.

df2 = df2.sort_values('Date')

Plot the recorded prices for that specific region.

 plt.figure(figsize=(15,10))
 plt.plot(df2['Date'],df2['AveragePrice'])

Output:

Extract the ‘Date’ and ‘AveragePrice’ column and rename them as ‘ds’ and ‘y’ respectively.

 df2 = df2[['Date','AveragePrice']]
 df2.columns = ['ds','y']

Create Prophet instance and fit the data

 m = Prophet()
 m.fit(df2)

Forecast prices for the next one year for that specific region.

 future = m.make_future_dataframe(periods=365)
 forecast = m.predict(future)

Plot the recorded and forecasted prices for the region.

figure = m.plot(forecast,xlabel='Date',ylabel='Price')

Output:

(Black dots: actual price values, Blue curve: predicted prices)

figure = m.plot_components(forecast)

Output:

Check Google colab notebook for the whole code here.

References

Access all our open Survey & Awards Nomination forms in one place

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

10 Deepfake AI Tools to Help You Create Content within Minutes

Gopika Raj

Deepfake is a double edged sword that can ignite creativity for social media engagement and can also cause immense harm

Commvault’s Arlie Teams Up with Microsoft to Elevate Cyber Resilience Globally

Shyam Nandan Upadhyay

Ready or Not, AI Agents Are Coming

Sukriti Gupta

Top Editorial Picks

SBI to Leverage HCL Unica to Digitally Transform Customer Engagement

Pritam Bordoloi

African Tech Companies Prefer Zoho Enterprise over Google Workspace

Vandana Nair

Reid Hoffman Creates a DeepFake of Himself, Reid AI

Gopika Raj

GitHub Copilot Rival, Augment Secures $252 Mn at $1 Bn Valuation to Boost AI for Developers

K L Krithika

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Also in News

Become a Certified Generative AI Engineer

Check our Industry Research Reports

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.

AIM Videos

Zerodha CTO Dr. Kailash Nadh Decodes AI Culture in Tech

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Developer’s Corner

In Case You Missed It

Which is the Most Frustrating Programming Language?

Mohit Pandey 18/03/2024

AI4Bharat Rolls Out IndicLLMSuite for Building LLMs in Indian Languages

Shritama Saha 15/03/2024

Google Introduces Synth^2 to Enhance the Training of Visual Language Models

K L Krithika 14/03/2024

Infosys Funds Llama 2 Project with 22 Indian Languages

Infosys Founder Funds Meta’s Llama 2 Project with 22 Indian Languages

Mohit Pandey 13/03/2024

Comprehensive Guide To Facebook’s Prophet With Python Code

Highlighting features of Prophet

Working of Prophet

Practical implementation

References

Nikita Shiledarbaxi

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to stay informed

Top Editorial Picks

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Also in News

AI Courses & Careers

Become a Certified Generative AI Engineer

Industry Insights

Check our Industry Research Reports

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.

AIM Videos

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

GenAI Corner

Data Dialogues

Future Talks

Developer’s Corner

In Case You Missed It

Webstories

Also in Trends

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

Subscribe to Our Newsletter

Download the easiest way to
stay informed

Industry
Insights

GenAI
Corner

Data
Dialogues

Future
Talks