Skip to content
ESG solutions, ESG Overview

Procuring Environmental and Social Governance Data from Various Public Sources

Posted on : October 8th 2021

Author : Sudhakaran Jampala

Environmental and Social Governance Data Acquisition from Assorted Sources and Formats

Reliable environmental and social governance (ESG) scores require a lot of data that need to be parsed, analyzed, and translated into benchmark scores. However, analyzing, even collecting, ESG data is a complex process. Stakeholders primarily collect ESG data from various public sources, such as the World Bank, the United Nations, the International Monetary Fund, other civic agencies like nongovernmental organizations, company websites, news clippings, and social media.

Generally, only a limited number of ESG indicators are developed from privately sourced data points. [1] For example, rating providers use proprietary methodologies that use company-reported ESG data, along with data from websites, other public sources (like news articles), and, in some cases, primary research contacts within companies. [2]

Typically, even large firms do not report ESG parameters in the same lucid way as they do financial metrics. Thus, every opportunity to gather ESG data points must be grabbed, so those points can be put through further vetting and analysis. Every sliver of information from any reliable public source is precious because information taken from company sources can be compared against any evidence available from non-company public sources, such as news articles. Doing so also helps with triage and triangulation processes.

The Blending Process

Investors can leverage sources such as reports by the U.S. Occupational Safety and Health Administration (OSHA) to gather more evidence about actual company practices. [3] For instance, OSHA reports can provide evidence about companies’ workforce pay practices. This type of evidence can add value and deepen insights from data points culled from other public sources, such as World Bank reports or articles published by the United Nations.

Because ESG criteria are generally not integrated under a structured data-oriented manner, capturing whatever data are available should be systematic. [4] Where ESG analytical frameworks lack transparency, ESG integration can be facilitated by systematic data capture, which provides the additional benefit of repeatedly obtaining data in the future.

Overcoming the Challenges

Investors find that most ESG data providers procure data from public sources. The key challenge for the investment community is to systematically process these data, structure them, and keep track of updates. Investors who do it well gain competitive advantage. Data providers’ opaque methodologies can hide how exactly data points are treated or what assumptions are involved. Thus, having only top-line scores available, without underlying data or transparent calculations, for companies is a vital issue to address.

Exhibit 1: Challenges in Dealing with ESG Data


In such a scenario, a competitive advantage is possible by blending data available from company websites, which contain a myriad of information, with rarely tapped data sources such as OSHA. In essence, adding these data to existing ESG ratings can improve the underlying models and their mapping to financial materiality.

The main ESG challenge of today is the lack of universally accepted principles regarding how companies disclose ESG metrics. Thus, tracking and assessing how companies are performing on their ESG goals are difficult. Systematically capturing data from various sources and blending them with financially material factors can help better join the dots.


Large, still untapped public data sources can improve ESG research and overall data acquisition strategies. Firms that tap into these sources will find tremendous value in them, especially in uncovering financial or material implications more effectively.

Because ESG reporting, unlike financial reporting, is not regulated, companies must optimize their data supply chain across public and private sources to monitor any change. Moreover, third-party ESG score and ratings providers largely provide top-line scores, often leaving customers to accept at face value any methodology deployed.

As a pushback, many companies are capturing data themselves or through vendors, thereby blending the treated data sets into their ESG analytics. Thus, procuring and monitoring ESG data will depend more on automation, particularly artificial intelligence and machine learning, for scale and accuracy.


[1]. Environmental Finance, “ESG Data Guide 2021,” n.d.,

[2]. Ibid

[3]. Raj Gnanaraja and Gary Shorter, “Introduction to Financial Services: Environmental, Social, and Governance (ESG) Issues,” IF11716, Congressional Research Service, January 4, 2021,

[4]. Sebastien Boitreaud, Ekaterina M. Gratcheva, Byran Gurhy, Cindy Paladines, and Andrius Skarnulis, “Riding the Wave: Navigating the ESG Landscape for Sovereign Debt Managers,” Equitable Growth, Finance & Institutions Insight, World Bank Group, 2020,

Similar Blogs

The process of data extraction involves identifying and recovering alternative and semi-structured data from various data sources such as files, XMLs, JSON, etc.

Capital markets are an excellent example of a perfect competition. The nature of the market is such the participants have to be competitive and result focussed. For instance, brokerages and investment banks have to deliver passive gains for their clients and, at the same time, earn a margin for themselves.

Today’s ESG analytics require processing data, patterns, and hidden connections to provide insights that investors, asset managers, and companies need. For example, Straive deploys advanced machine learning algorithms to analyze reams of documents to collect evidence across executive statements for signs of vagueness or obfuscation.

Talking about using data to gain insights is easy. But actually doing it will uncover a newer set of challenges, especially when it comes to unstructured data.

Integrating ESG data into commodities trading operations requires structured, easy-to-consume data. By their nature, ESG data resist such integration, and highly scalable data solutions across the data life cycle are needed to allow stakeholders to deploy end-to-end data solutions for a successful data-to-intelligence journey.

We want tohear from you

Leave a message

Our solutioning team is eager to know about your challenge and how we can help.