Skip to main content

Research databases

This page provides access to databases that have been made available, following publication in a journal or in the ECGI Working Paper Series, by ECGI Research Members to other members of the ECGI community, for the public good. Commercial datasets are not accepted. 

data

To add a database to this repository, please send it to admin@ecgi.org

Available databases:  

Global Corporate Patent Dataset (GCPD v2) (Bena, Ferreira, Matos, Pires, and Barkemeyer, 2026)

The Global Corporate Patent Dataset (v2) links, using sophisticated matching techniques, about 5.3 million patents — awarded by the U.S. Patent and Trademark Office (USPTO) between January 1976 and September 2023 — to 25 thousand publicly listed companies across 99 countries (as identified by the location of parent companies’ headquarters).

Originally released in 2019, the dataset has been significantly updated and expanded in 2026.

Please consult Section 2.4 and Appendix D of this working paper for detailed information about the construction of the dataset: Barkemeyer, David; Bena, Jan; and Li, Xiyue (2026), “Technology Life Cycles: Investment, Growth, and Firm Value”.

If you agree to the terms below, you may download the granular dataset directly from here

patent

Read the Working paper ↓

Terms of Use

The dataset available for academic and other non-commercial purposes only. Please note:

1. Citation requirement: If you use the data, please cite both:

  1. Bena, Ferreira, Matos, and Pires (2017), “Are Foreign Investors Locusts? The Long-Term Effects of Foreign Institutional Ownership,” Journal of Financial Economics 126(1), 122–146.
  2. Barkemeyer, Bena, and Li (2026), “Technology Life Cycles: Investment, Growth, and Firm Value” SSRN Working Paper.

2. The dataset is licensed under a Creative Commons Attribution–NonCommercial 4.0 International License

3. Instead of sharing the data file directly, please refer others to: https://patents.darden.virginia.edu/get-data to ensure they access the latest version.

4. This remains a beta release. There is no guarantee, implied or otherwise, regarding accuracy. If you identify errors, please contact us at: GCPD@darden.virginia.edu

Dataset Contributor

Pedro Matos

James A. & Stacy Cooper Bicentennial Professor of Business Administration (Finance)
Darden School of Business, University of Virginia
Research Member

Startup board composition data (Ewens and Malenko, 2025)

The dataset reports the dynamic composition of 7800 venture capital-backed startup boards from first VC financing to exit over the 2002-2017 period, classifying directors into three types - executives, investors, and independent. It corresponds to the sample used in Ewens and Malenko (2025) "Board Dynamics over the Startup Lifecycle" (Journal of Finance, forthcoming). 

board room

Read the Working paper ↓

Data description

Data description and sample creation | Building this database used a combination of Form D filings and VentureSource. Users of the data should read section 2 ("Data") and the internet appendix from the paper.

Sample code | Please see the code folder for Python and Stata code to recreate the first figures from the original paper.

Data structure | The file has the following fields":

  • year: the year of the startup board data
  • numOut: the number of independent directors (or outsiders)
  • ...

Dataset Contributor

Nadya Malenko

Professor of Finance and Wargo Family Faculty Fellow
Boston College
Research Member

Incorporations of Foreign Firms in the UK, 1997-2006

This dataset provides information on the number of newly incorporated firms in the UK from 1997 to 2006, categorized by the country of origin of their directors. Details on the construction of the database can be found in Where Do Firms Incorporate? Deregulation and the Cost of Entry by Marco Becht (ECARES and ECGI), Colin Mayer (Said Business School, University of Oxford, CEPR, and ECGI), and Hannes Wagner (London Business School and University of Munich). 

uk

Read the Working paper ↓

Data description

This dataset presents information on the number of firms newly incorporated in the UK between 1997 and 2006, classified according to the country of residence of their directors. The origin of firms is determined based on two distinct criteria: 

  • Column H ("Majority of Directors from Country X") – Firms are classified by the country in which the majority of their directors reside. 
  • Column I ("All Directors from Country X") – Firms are classified by the country in which all directors reside.

The database can be downloaded in either of two formats:

- Click here for an Excel spreadsheet

- Click here for Stata format

Dataset Contributor

Marco Becht

Professor of Finance and the Goldschmidt Professor of Corporate Governance
Solvay Brussels School of Economics and Management, Université libre de Bruxelles
Board Member, Fellow, Research Member

Common Ownership

This dataset provides access to the code and analytical methodology used by Analysis Group to evaluate the findings of the 2018 AST Paper ("Anticompetitive Effects of Common Ownership" by Azar, Schmalz, and Tecu). The AST Paper suggested that institutional investors holding shares in multiple firms within a concentrated industry—referred to as common ownership—lead to decreased competition and higher consumer costs.

network

Read the Working papers ↓

Data description

This dataset provides access to the code and analytical methodology used by Analysis Group to evaluate the findings of the AST Paper, which examined the relationship between institutional investors' common ownership of firms in concentrated industries and its alleged impact on competition and consumer costs. 

The full package needed to replicate the sensitivity analysis challenging the empirical evidence underpinning these proposals can be downloaded at: https://www.blackrock.com/common-ownership.

Dataset Contributor

Subscribe