Indicators overview
Each line in the table represents a company. Even when we talk about companies, we also mean other economic players such as universities, research institutes and associations. Our data is sourced from national commercial registers and other public sources. See country coverage for the number of companies per country.
Our central identifier for a company is the web address (domain). If several entries in a commercial register share a domain (e.g. in the case of companies from the same group of companies), we identify the company's headquarters and only keep this observation.
Note that not all of these columns are shown in the dashboard. However, you can extract all of the information when you download our data.
| Colum name | Description |
|---|---|
domain | The main URL of the respective company. It is used as a unique identifier in our database. |
domain_redirect | True if the domain automatically redirects to another webpage. |
country | Country (NUTS-0) where the firm is located. Official name (e.g. "Deutschland"). |
country_english | English name of country (e.g . "Germany") |
country_code | ISO code of the country. |
state | State (NUTS-1) within the country. |
state_code | NUTS code representing the state. |
region | Region (NUTS-2) within the state. |
region_code | NUTS code representing the region. |
district | District or locality (NUTS-3) within the region. |
district_code | Code representing the district. |
title | Title provided in the HTML head of the company website. |
keywords | Keywords provided in the HTML head of the company website. |
description | Description provided in the HTML head of the company website. |
main_contact_mail | Main contact e-mail address. |
all_mails | All e-mail addresses found on the respective website. |
main_contact_number | Main contact phone number. |
all_phones | All phone numbers found on the respective website. |
all_linkedin_profiles | All links to LinkedIn user profiles found on the respective website. |
all_linkedin_companies | All links to LinkedIn company profiles found on the respective website. |
all_twitter_links | All links to X/Twitter accounts found on the respective website. |
all_facebook_links | All links to Facebook profiles found on the respective website. |
additive_manufacturing_intensity | Numerical engagement level of the firm in additive manufacturing. |
additive_manufacturing_intensity_level | Engagement level of the firm in additive manufacturing (very low, low, medium, high, very high). |
additive_manufacturing_keywords | Keywords potentially related to additive manufacturing on the respective website. |
additive_manufacturing_keywords_hits | Number of individual hits per keyword from additive_manufacturing_keywords column on the respective website. |
additive_manufacturing_total_hits | Total number of additive_manufacturing_keywords hits on the respective website. |
ai_intensity | Numerical engagement level of the firm in artificial intelligence. |
ai_intensity_level | Engagement level of the firm in artificial intelligence (very low, low, medium, high, very high). |
ai_keywords | Keywords potentially related to artificial intelligence on the respective website. |
ai_keywords_hits | Number of individual hits per keyword from ai_keywords column on the respective website. |
ai_total_hits | Total number of ai_keywords hits on the respective website. |
blockchain_intensity | Numerical engagement level of the firm in blockchain technology. |
blockchain_intensity_level | Engagement level of the firm in blockchain technology (very low, low, medium, high, very high). |
blockchain_keywords | Keywords potentially related to blockchain technology on the respective website. |
blockchain_keywords_hits | Number of individual hits per keyword from blockchain_keywords column on the respective website. |
blockchain_total_hits | Total number of blockchain_keywords hits on the respective website. |
digital_health_intensity | Numerical engagement level of the firm in digital health technologies |
digital_health_intensity_level | Engagement level of the firm in digital health technologies (very low, low, medium, high, very high). |
digital_health_keywords | Keywords potentially related to digital health technologies on the respective website. |
digital_health_keywords_hits | Number of individual hits per keyword from digital_health_keywords column on the respective website. |
digital_health_total_hits | Total number of digital_health_keywords hits on the respective website. |
domain_provider_true | True if the firm is actually just a domain provider. |
news_probability | Low, medium, high or very high probability that the website is a news site (e.g. newspaper, blog, journal, news ticket). |
energy_intensity | Numerical engagement level of the firm in energy technologies. |
energy_intensity_level | Engagement level of the firm in energy technologies (very low, low, medium, high, very high). |
energy_keywords | Keywords potentially related to energy technologies on the respective website. |
energy_keywords_hits | Number of individual hits per keyword from energy_keywords column on the respective website. |
energy_total_hits | Total number of energy_keywords hits on the respective website. |
mobility_intensity | Numerical engagement level of the firm in mobility technologies. |
mobility_intensity_level | Engagement level of the firm in mobility technologies (very low, low, medium, high, very high). |
mobility_keywords | Keywords potentially related to mobility technologies on the respective website. |
mobility_keywords_hits | Number of individual hits per keyword from mobility_keywords column on the respective website. |
mobility_total_hits | Total number of mobility_keywords hits on the respective website. |
sustainability_intensity | Numerical engagement level of the firm in ecological sustainability. |
sustainability_intensity_level | Engagement level of the firm in ecological sustainability (very low, low, medium, high, very high). |
sustainability_keywords | Keywords potentially related to ecological sustainability on the respective website. |
sustainability_keywords_hits | Number of individual hits per keyword from sustainability_keywords column on the respective website. |
sustainability_total_hits | Total number of sustainability_keywords hits on the respective website. |
innoprob | Numerical probability of the company to be an innovator (Innoprob score). Only available in German-speaking countries. |
innoprob_innovator_probability | Probability of the company to be an innovator (Innoprob score) (very low, low, medium, high, very high). Only available in German-speaking countries. |
social_innoprob | Numerical probability of the company to be an innovator (Social Innoprob score). Only available in German-speaking countries. |
social_innoprob_innovator_probability | Probability of the company to be a social innovator (Social Innoprob score) (very low, low, medium, high, very high). Only available in German-speaking countries. |
retailer_probability | Probability of being a retailer. |
links | Incoming and outgoing hyperlinks to and from the respective website. |
links_count | Number of incoming and outgoing hyperlinks to and from the respective website. |
outgoing_links | Outgoing hyperlinks from the respective website. |
outgoing_links_count | Number of outgoing hyperlinks from the respective website. |
incoming_links | List of incoming hyperlinks to the respective website. |
incoming_links_count | Count of incoming hyperlinks to the respective website. |
techstack | List of website technologies used on the website. |
new_register_entry | The value in this column is True if the commercial register entry for this observation is less than 5 years old. |
geolocation | Point coordinates of firm location in well-known text (EPSG: 4326) format. |
lon | Longitude of firm location (EPSG: 4326) |
lat | Latitude of firm location (EPSG: 4326) |