RStudio endeavors to create free and open-source software for data science, scientific research, and technical communication in a sustainable way, because it benefits everyone when the essential tools to produce and consume knowledge are available to all, regardless of economic means.
We believe businesses should fulfill a purpose beneficial to the public and be run for the benefit of all stakeholders including employees, customers, and the community at large.
As a Delaware Public Benefit Corporation (PBC) and a Certified B Corporation®, RStudio’s open-source mission and commitment to a beneficial public purpose are codified in our charter, requiring our corporate decisions to balance the interests of community, customers, employees, and shareholders.
B Corps™ meet the highest verified standards of social and environmental performance, transparency, and accountability. RStudio measures its public benefit by utilizing the non-profit B Lab®’s “Impact Assessment”, a rigorous assessment of a company’s impact on its workers, customers, community, and environment. In 2019, RStudio met the B Corporation certification requirements set by the B Lab. The Certification process uses credible, comprehensive, transparent, and independent standards of social and environmental performance. Details of this assessment are available at bcorporation.net/directory/rstudio. In accordance with B Lab practices, our next certification will be done in December 2022.
As a PBC, RStudio publishes an annual report that describes the public benefit we have created, along with how we seek to provide public benefits in the future. This is the second of these reports. For the reader’s convenience it includes information from prior report(s) that has not changed, along with material updates. The first report from 2019 may be found here.
To fulfill its beneficial purposes, RStudio intends to remain an independent company over the long term. With the support of our customers, employees, and the community, we remain excited to contribute useful solutions to the important problems of knowledge they face.
CEO, RStudio, PBC
RStudio’s mission is to create free and open-source software for data science, scientific research, and technical communication. We do this to enhance the production and consumption of knowledge by everyone, regardless of economic means, and to facilitate collaboration and reproducible research, both of which are critical to the integrity and efficacy of work in science, education, government, and industry.
RStudio also produces a modular platform of commercial software products that enable teams to adopt R, Python, and other open-source data science software at scale; along with online services to make it easier to learn and use them over the web.
Together, RStudio’s open-source software and commercial software form a virtuous cycle: The adoption of open-source data science software at scale in organizations creates demand for RStudio’s commercial software; and the revenue from commercial software, in turn, enables deeper investment in open-source software, which benefits everyone.
In 2020, RStudio spent over 50% of its engineering resources on open-source software, and led contributions to over 320 open-source projects, targeting a broad range of areas including the RStudio IDE; infrastructure libraries for R; numerous packages and tools to streamline data manipulation, exploration and visualization, modeling, and machine learning; and integration with external data sources. RStudio also sponsors or contributes to more than a dozen open-source projects led by others, including NumFocus and the cross-language Apache Arrow project led by Ursa Computing.
Additional company and product highlights from 2020 can be found on RStudio’s January 2021 blog post: 2020 at RStudio: A Year in Review
RStudio’s approach is not typical. Traditionally, scientific and technical computing companies created exclusively proprietary software. While it can provide a robust foundation for investing in product development, proprietary software can also create excessive dependency that is not good for data science practitioners and the community. In contrast, RStudio provides core productivity tools, packages, protocols, and file formats as open-source software so that customers aren’t overly dependent on a single software vendor. Additionally, while our commercial products enhance the development and use of our open-source software, they are not fundamentally required for those without the need or the ability to pay for them.
Today, millions of people download and use RStudio open-source products in their daily lives. Additionally, more than 1,350 organizations that have the need and ability to pay for our commercial product help us to sustain this work. It is an inspiration to consider that we are helping many participate in global economies that increasingly reward data literacy, and that our tools help produce insights essential to making the modern world a better place.
Some of the significant open-source projects led or substantially supported by RStudio include the following popular software for data science:
The tidyverse is an opinionated collection of R packages designed for data science. All packages share an underlying design philosophy, grammar, and data structures.
The tidyverse consists of 27 R packages including ggplot2, dplyr, tidyr, and readr.
There are approximately 6.5 full time equivalent (FTE) RStudio employees developing Tidyverse and related open-source products as of December 2020.
Tidymodels is a cohesive collection of packages that perform tasks relevant to statistical modeling and machine learning. Tidymodels packages share a common syntax and design philosophy, and are designed to work seamlessly with Tidyverse packages.
There are currently 35 tidymodels packages, an increase of 8 from 2019. Popular tidymodels packages include parsnip, rsample, recipes, tune, and yardstick.
There are 3.5 full time equivalent (FTE) RStudio employees developing Tidymodels and related open-source products as of December 2020.
Shiny is a popular R package and web application framework that makes it easy to tell data stories in interactive point-and-click web applications. Shiny applications can be shared with others via an open-source Shiny Server, the commercial hosted shinyapps.io service, or with RStudio Connect. Shiny and related packages include shiny, shinytest, shinyloadtest, shinydashboard, leaflet, and crosstalk.
There are 7 full time equivalent (FTE) employees developing the open-source Shiny and Shiny Server products as of December 2020.
R Markdown is an authoring format for computational documents, which are fully reproducible reports whose analysis can be re-executed on new data with the click of a button. R Markdown documents can be shared as Notebooks, slideshows, web pages, email attachments, print documents, and more.
There are 3 full time equivalent (FTE) RStudio employees developing R Markdown and related open-source products as of December 2020.
RStudio increases the efficiency of R users by making open-source R packages that connect data scientists to spreadsheets, databases, distributed storage frameworks for big data, machine learning platforms, and the programming environments of other languages, like python.
There are approximately 2 full time equivalent RStudio funded developers creating connectivity-related open-source packages as of December 2020.
R-lib is a large collection of R packages that make it easier to build, find, and use effective tools for data analysis.
There are 2 full time equivalent (FTE) RStudio employees developing r-lib and related open-source packages as of December 2020.
RStudio is a multi-language IDE designed for Data Science with R and Python. It augments the standard code console with an editor that can display Notebooks, launch apps, highlight code syntax, spot code errors, and directly execute code. Built into the IDE are also tools for debugging, plotting, browsing files, and managing project histories and workspaces. Together these tools make data scientists and developers much more efficient.
There are 8 full time equivalent (FTE) employees developing the RStudio IDE desktop and server products as of December 2020.
In addition to the open-source software that we make freely available, and our support for NumFocus and Ursa Computing, RStudio recognizes the importance of contributing financially to other important open-source initiatives. To date, RStudio has given over $900,000 to projects led by others. Current open-source related commitments include contributing to the R Consortium, and to authors and maintainers of fourteen smaller open-source projects, whose software we use.
The B Lab Impact Assessment (see https://bimpactassessment.net/) is measured on a 200-point scale, with a minimum score of 80 required for a company to be eligible for B Lab certification. RStudio completed its first Impact Assessment in the fall of 2019, and received an overall score of 86.1. To put this score in context, the average score of “ordinary” (non-certified BCorp) businesses of our size is 53.4, while the median score for companies on the B Lab’s list of “Best for the World” honorees is 131. [Source: bcorporation.net/directory/rstudio.]
Details of RStudio’s 2019 Impact Assessment scores can be seen in our 2019 PBC Report.
RStudio seeks to improve our internal governance, increase our workforce diversity and employee development efforts, expand our stewardship of the environment, deepen our engagement in our communities, and serve customers, so that our public benefit will continue to improve each year.
In our initial assessment we received high marks for incorporating as a benefit corporation, the health, wellness, safety, and financial security of our employees, and for educating and serving customers. We identified formal goal setting, career development, diversity, equity & inclusion, civic engagement & giving, and air & climate as areas for improvement.
In 2020 we made notable progress in the following areas:
A company’s positive governance impact is measured by the extent to which the company is accountable to stakeholders, and the extent to which its decision-making is transparent to all constituents. As noted last year, RStudio scored 16.1 points out of a possible 21.9+ points in the Governance Impact Area, including 10 points awarded for the specific legal structures we have put in place as a Benefit Corporation that preserve our mission and consider our stakeholders regardless of company ownership.
RStudio continues to share financial and other company performance information transparently with its shareholders and employees. The company strengthened its formal planning process and added experienced customer education and go-to-market leaders in 2020 to support more formal goal setting and business planning, organizational design, and growth . We continue to have a relatively broad pool of shareholders, including many current and former employees. RStudio shareholders with a beneficial interest greater than 5% include J.J. Allaire, CEO, and Tareef Kawaf, President.
To improve our governance impact in 2021, RStudio is developing metrics to help us more definitively track the success of our mission.
A company’s positive impact on workers is measured by the extent to which it maintains a compensation and benefit structure beneficial to its employees, supports ongoing career development, and fosters a positive work environment. As noted last year, RStudio scored 30.5 out of a possible 43.2 points in the Workers impact area of the B Lab assessment, attributable in large part to our generous benefit offerings, including 12 weeks of paid leave for all new parents, a 401k matching program, and an annual profit-sharing plan open to all regular employees. RStudio’s flexible work practices, particularly our remote model and unlimited PTO policies, were also significant factors in our impact in this area and served both employees and the company well during the Covid-19 pandemic.
Despite missing out on valuable in-person company gatherings in 2020 because of the pandemic, the company was able to implement processes and tools for 360º feedback and employee surveys, and implemented a mid-year wellness survey, which will strengthen future assessments of RStudio’s impact in this area.
To foster a positive work environment in 2021, RStudio will conduct regular employee surveys to gauge engagement and satisfaction and develop improved career development guidelines.
Community impact is measured by the extent to which a company creates jobs within local communities; fosters inclusion and diversity within the organization; demonstrates civic engagement through philanthropy and advocacy; and favors suppliers that share B Corp values. As noted last year, RStudio scored 11.9 out of a possible 20+ points in the Community impact area. Our inclusive hiring practices, equitable pay ratios (e.g., between the highest- and lowest-paid workers), charitable giving history, and strong job-growth rates are some of the factors behind this positive impact.
Some elements of the community impact measures, especially those that analyze RStudio’s economic impact on “local” geographies, may be difficult for us to achieve given our remote workforce model. On the other hand, we can significantly strengthen our impact on the community by furthering the diversity within our team – for example, by increasing the percent of women employees and managers, broadening the age distribution of our workers, and continuing to actively source talent from underrepresented or minority social, racial, and ethnic groups.
In 2020 RStudio strengthened its Community impact in a number of ways:
For 2021 RStudio will implement more accessible, voluntary tracking of employee demographics as part of our continuing effort to reflect the diversity of our Community.
A company’s positive environmental impact is measured by the extent to which its products, services, suppliers, and decisions promote positive environmental outcomes. As noted last year, RStudio scored 3.4 out of a possible 8.9+ points across all Environment impact area questions.
As a software company, we do not conduct any physical manufacturing, and our marketing, sales, and support models are almost entirely digital – eliminating many of the most common sources of environmental hazards found in business operations. Beyond this environmentally-neutral base, RStudio’s positive environmental impact is largely based on our remote-first work culture, which drastically reduces the footprint of our physical workspace, as well as the pollution generated by daily commuting.
To further improve our environmental impact, we began measuring GHG (greenhouse gas) from company travel/events and purchased $10k of carbon offsets in 2020. As of December 30, 2020, RStudio has offset 900 tonnes of CO2 emissions, an amount equivalent to the total emissions produced by company air travel since our inception. These offset investments have been directed to conservation projects in New England and the Amazon, and to reforestation in South America and Africa. In 2021 we will continue offset purchases and work towards our goal of becoming Climate Neutral Certified.
Customer impact is measured by the degree to which a company’s products and/or services deliver social, educational, or environmental value to customers, as well the extent to which company practices serve customer interests in areas such as quality control, data privacy, and customer satisfaction. As noted last year, RStudio earned 24.1 out of a possible 25+ points in the customer impact area, with 20.6 of these points awarded for the strong orientation toward education, knowledge-sharing, and skill-building in our products and community contributions.
While our scores in the customer impact area are already strong, we provided additional methods for all of our stakeholders to assess customer feedback and customer satisfaction by participating in the Notebook-Based Predictive Analytics and Machine Learning Q3 2020 Forrester Wave ™ where we were rated a “Strong Performer”. The ratings were based on Forrester analyst scoring, part of which included conversations with RStudio customers. We also engaged with the enterprise software-focused review site TrustRadius to capture authentic Customer reviews. While it is sometimes difficult to distinguish reviews of our free open source software less suited to enterprise-scale use from our enterprise-focused professional products, our average rating is 8.8 out of 10, with 75 reviews at the end of 2020.
RStudio endeavors to create public benefits through the hard work of our employees and partners and in collaboration with the open-source data science community we serve. As a public benefit corporation we will continue to pursue and report improvements in internal governance, workforce diversity and employee development efforts, our stewardship of the environment, engagement in our communities, and, of course, substantial contributions to open-source software for science.
Apache Arrow, Arrow, Apache, the Apache feather logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.
RStudio and Shiny are registered trademarks of RStudio, PBC.