Each line in the table represents a company. Even when we talk about companies, we also mean other economic players such as universities, research institutes and associations. Our data is sourced from national commercial registers and other public sources. See country coverage for the number of companies per country.

Our central identifier for a company is the web address (domain). If several entries in a commercial register share a domain (e.g. in the case of companies from the same group of companies), we identify the company's headquarters and only keep this observation.

Note that not all of these columns are shown in the dashboard. However, you can extract all of the information when you download our data.

Colum name Description
domain The main URL of the respective company. It is used as a unique identifier in our database.
domain_redirect True if the domain automatically redirects to another webpage.
country Country (NUTS-0) where the firm is located. Official name (e.g. "Deutschland").
country_english English name of country (e.g . "Germany")
country_code ISO code of the country.
state State (NUTS-1) within the country.
state_code NUTS code representing the state.
region Region (NUTS-2) within the state.
region_code NUTS code representing the region.
district District or locality (NUTS-3) within the region.
district_code Code representing the district.
title Title provided in the HTML head of the company website.
keywords Keywords provided in the HTML head of the company website.
description Description provided in the HTML head of the company website.
main_contact_mail Main contact e-mail address.
all_mails All e-mail addresses found on the respective website.
main_contact_number Main contact phone number.
all_phones All phone numbers found on the respective website.
all_linkedin_profiles All links to LinkedIn user profiles found on the respective website.
all_linkedin_companies All links to LinkedIn company profiles found on the respective website.
all_twitter_links All links to X/Twitter accounts found on the respective website.
all_facebook_links All links to Facebook profiles found on the respective website.
additive_manufacturing_intensity Numerical engagement level of the firm in additive manufacturing.
additive_manufacturing_intensity_level Engagement level of the firm in additive manufacturing (very low, low, medium, high, very high).
additive_manufacturing_keywords Keywords potentially related to additive manufacturing on the respective website.
additive_manufacturing_keywords_hits Number of individual hits per keyword from additive_manufacturing_keywords column on the respective website.
additive_manufacturing_total_hits Total number of additive_manufacturing_keywords hits on the respective website.
ai_intensity Numerical engagement level of the firm in artificial intelligence.
ai_intensity_level Engagement level of the firm in artificial intelligence (very low, low, medium, high, very high).
ai_keywords Keywords potentially related to artificial intelligence on the respective website.
ai_keywords_hits Number of individual hits per keyword from ai_keywords column on the respective website.
ai_total_hits Total number of ai_keywords hits on the respective website.
blockchain_intensity Numerical engagement level of the firm in blockchain technology.
blockchain_intensity_level Engagement level of the firm in blockchain technology (very low, low, medium, high, very high).
blockchain_keywords Keywords potentially related to blockchain technology on the respective website.
blockchain_keywords_hits Number of individual hits per keyword from blockchain_keywords column on the respective website.
blockchain_total_hits Total number of blockchain_keywords hits on the respective website.
digital_health_intensity Numerical engagement level of the firm in digital health technologies
digital_health_intensity_level Engagement level of the firm in digital health technologies (very low, low, medium, high, very high).
digital_health_keywords Keywords potentially related to digital health technologies on the respective website.
digital_health_keywords_hits Number of individual hits per keyword from digital_health_keywords column on the respective website.
digital_health_total_hits Total number of digital_health_keywords hits on the respective website.
domain_provider_true True if the firm is actually just a domain provider.
news_probability Low, medium, high or very high probability that the website is a news site (e.g. newspaper, blog, journal, news ticket).
energy_intensity Numerical engagement level of the firm in energy technologies.
energy_intensity_level Engagement level of the firm in energy technologies (very low, low, medium, high, very high).
energy_keywords Keywords potentially related to energy technologies on the respective website.
energy_keywords_hits Number of individual hits per keyword from energy_keywords column on the respective website.
energy_total_hits Total number of energy_keywords hits on the respective website.
mobility_intensity Numerical engagement level of the firm in mobility technologies.
mobility_intensity_level Engagement level of the firm in mobility technologies (very low, low, medium, high, very high).
mobility_keywords Keywords potentially related to mobility technologies on the respective website.
mobility_keywords_hits Number of individual hits per keyword from mobility_keywords column on the respective website.
mobility_total_hits Total number of mobility_keywords hits on the respective website.
sustainability_intensity Numerical engagement level of the firm in ecological sustainability.
sustainability_intensity_level Engagement level of the firm in ecological sustainability (very low, low, medium, high, very high).
sustainability_keywords Keywords potentially related to ecological sustainability on the respective website.
sustainability_keywords_hits Number of individual hits per keyword from sustainability_keywords column on the respective website.
sustainability_total_hits Total number of sustainability_keywords hits on the respective website.
innoprob Numerical probability of the company to be an innovator (Innoprob score). Only available in German-speaking countries.
innoprob_innovator_probability Probability of the company to be an innovator (Innoprob score) (very low, low, medium, high, very high). Only available in German-speaking countries.
social_innoprob Numerical probability of the company to be an innovator (Social Innoprob score). Only available in German-speaking countries.
social_innoprob_innovator_probability Probability of the company to be a social innovator (Social Innoprob score) (very low, low, medium, high, very high). Only available in German-speaking countries.
retailer_probability Probability of being a retailer.
links Incoming and outgoing hyperlinks to and from the respective website.
links_count Number of incoming and outgoing hyperlinks to and from the respective website.
outgoing_links Outgoing hyperlinks from the respective website.
outgoing_links_count Number of outgoing hyperlinks from the respective website.
incoming_links List of incoming hyperlinks to the respective website.
incoming_links_count Count of incoming hyperlinks to the respective website.
techstack List of website technologies used on the website.
new_register_entry The value in this column is True if the commercial register entry for this observation is less than 5 years old.
geolocation Point coordinates of firm location in well-known text (EPSG: 4326) format.
lon Longitude of firm location (EPSG: 4326)
lat Latitude of firm location (EPSG: 4326)