Fast Data Apps with Streamlit - Case: How Much Will the Pandemic Cost My

How Much Will the Pandemic Cost My Company?

Any business organization must constantly respond to changes in their operating environment. Measure the change, predict the implications for you, and adapt – simple as that. In practice, no matter how data-driven you are, measuring the right things in a new situation can be challenging, and often new data sources and approaches are needed. Collecting data from internal and external sources, building decision-support dashboards and machine learning models with customized user interfaces can be a few-month job for a sizable data team! If the change is sudden, you need to react quickly, – but when the impact is uncertain, launching a large-scale development project might be an expensive overreaction. In this post we applied a fast data app prototyping tool, Streamlit, on exploring revenue impact scenarios in the “new normal”.

The Cost of COVID-19

For COVID-19, there are a number of dashboards available for monitoring every aspect of the pandemic. About the most famous one developed at Johns Hopkins, it was particularly impressive how quickly the creators were able to find and aggregate all the data sources when the virus began to attract public interest (they were professionals before it was cool). A global dashboard, however, is hardly actionable for most organizations. How do we gain relevant insight from this abundance of data for, say, a major European retailer?

The publicly available, high-quality data collected for the above mentioned dashboard seems like a good starting point for this purpose as well. We could also utilize regional mobility trends, e.g. by Google (available for the time being) to better understand current consumer behavior. It would be nice to present the interesting parts of the data together with some internal company data, so maybe we should build data warehouse pipelines and modify existing sales dashboards, or make new ones, . and pPerhaps we would also like to have our data scientists exploring and building machine learning models for supporting difficult decisions and automating simple ones with this up-to-date data. On second thought, maybe just look elsewhere, wait out the pandemic, and hope the company is still standing after the dust has settled? With Streamlit, you can deliver all this before the need to commit to any significant investment – no return calculations necessary.

Enter Streamlit – Tool for the Python Data Specialist

Released a few months ago, steamlit.io is an open-source app framework for building an ad-hoc fast data app “in hours, not weeks”. Streamlit essentially turns a Python script into a responsive web app that you might think was custom-built with a modern front-end framework, all with very little added effort. To create an interactive front-end widget for controlling a variable, just wrap it in a Streamlit call. Caching the result of a data pipeline or an expensive computation only takes a Streamlit decorator. Advanced caching and hot reload make Streamlit scripting as fast-paced and addictive as front-end development, even when working with fairly complex data apps. When you need an interactive interface instead of an API, a notebook does not quite cut it, but a custom full-stack web app would be too much, Streamlit might be just what you need. Attractive use cases include interactive data exploreationsrs, quick machine learning app prototyping, and what-if tooling to reduce iteration between business stakeholders and data developers.

To model the sales impact of COVID-19, we can hop on the citizen epidemiologist bandwagon, and implement a simple model to simulate the susceptible, exposed, infected, and recovered (SEIR) fractions of the population in the region of interest. With data on actual confirmed cases available, we can approximate the current date in terms of simulation days, while adjusting the duration and magnitude of social distancing (reduced disease transmission), as appropriate given the emergency measures of the local government at the moment. With regional mobility trends, we can estimate the reduction in consumer retail activity, and adjust the timeframe of our anomalous event (with case-specific start/end definitions) accordingly. Finally, we can fit a time series model to our sales data prior to the pandemic and estimate the revenue impact for the expected duration of the event e.g. separately for retail stores and ecommerce. Here we used Prophet to readily model e.g. seasonalities and national holidays, but in simple cases reasonable estimates could be achieved by only looking at linear trends and constant intervals. With a single pure-Python script, we end up with the estimated sales impact for user-selectable region of interest, adjustable social distancing effects, and lovely browser-UI responsivity due to cached data pipelines and computation – all in a day’s work without even bothering our busy front-end professionals.

Final thoughts on a fast data app

The above impact calculator barely scratches the surface of Streamlit, which works seamlessly with deep learning models as well. To entertain your bored family at home, for instance, add about 30 lines of Streamlit code to a neural style transfer script to magically turn it into a web app on your idle gaming PC personal GPU-workstation that a kid with a phone can use to make family photos look like they were painted by renaissance masters. There are various tutorials and demos available to get started with Streamlit, one of our favorites being the neat GAN face generator that now also runs on “The Kraken” at our office to hopefully cheer up the occasional quarantine-escapee. If you prefer Docker, you can try our image instead with docker run –gpus all -p 8501:8501 ahtonen/demo_face_gan:latest.

These office mates don’t exist.

Written by Juho Kerttula. Juho is a Senior Data Scientist at DAIN Studios, based in Helsinki.

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFlare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
ARRAffinity	session	ARRAffinity cookie is set by Azure app service, and allows the service to choose the right instance established by a user to deliver subsequent requests made by that user.
ARRAffinitySameSite	session	This cookie is set by Windows Azure cloud, and is used for load balancing to make sure the visitor page requests are routed to the same server in any browsing session.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-non-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non-necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
li_gc	5 months 27 days	Used to store consent of guests regarding the use of cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
tableau_locale	session	We embed Tableau charts and interactivity on some of our pages. These cookies expire at the end of your session.
tableau_public_negotiated_locale	session	We embed Tableau charts and interactivity on some of our pages. These cookies expire at the end of your session.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duration	Description
_dc_gtm_UA-111640802-1	1 minute	This cookie is used by Google Tag Manager to support Google Analytics on our Sites. It helps us monitor the use and performance of our Sites.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga_JWW0KP3X8Q	2 years	This cookie is installed by Google Analytics 4.
_gat_UA-111640802-1	1 minute	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
ai_session	30 minutes	This is a unique anonymous session identifier cookie set by Microsoft Application Insights software to gather statistical usage and telemetry data for apps built on the Azure cloud platform.
ai_user	1 year	A unique user identifier cookie, set by Microsoft Application Insights software, that enables counting of the number of users accessing the application over time.
AnalyticsSyncHistory	1 month	Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries
prism_252943399	1 month	This cookie is used by Active Campaign for site tracking purposes.
visitorId	1 year	By default, the visitor ID is supplied to Coveo UA using the visitor (string) query parameter and kept in the local storage of the user browser. A third-party cookie can also be used to store the visitor ID if the current user browser accepts these kinds of cookies.
WFESessionId	session	These cookies are used by Microsoft Azure Application Insights, which collects site telemetry information, allowing us to analyze how some of our Sites are performing and to perform optimization.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
LinkedIn
muc_ads	2 years	Collects data on user behaviour and interaction in order to optimize the website and make advertisement on the website more relevant.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.

Cookie	Duration	Description
CONSENT	16 years 7 months 20 days 16 hours 15 minutes	No description
GetLocalTimeZone	session	No description
hid	session	No description available.

Fast Data Apps with Streamlit – Case: How Much Will the Pandemic Cost My Company?

How Much Will the Pandemic Cost My Company?

The Cost of COVID-19

Enter Streamlit – Tool for the Python Data Specialist

Final thoughts on a fast data app

References & more

Details

Computer Vision: Create an API in 60 minutes

Data Governance Roles and Responsibilities

Guiding C-Level Executives Through Business Ethics in the Data and AI Age

DAIN Studios

Studio HELSINKI

Studio BERLIN

Studio MUNICH