Variable Overview

Company Identification

name

String

Real-value

Company or organization name

type

String

Company type

Classification of the organization type

domain

String

Real-value

Primary web domain of the company

domain_alias

String

Real-value

Alternative or secondary domain names

domain_provider_true

Boolean

0, 1

Indicates if the domain provider information is verified

domain_redirect

Boolean

0, 1

Indicates if the domain redirects to another URL

b2x

String

B2B, B2C, B2G

Business model classification (Business-to-Business, Business-to-Consumer, Business-to-Government)

employee_class

String

0-10 employees, 11-50 employees, 51-250 employees, 250+ employees, unknown

Company size classification based on number of employees

new_register_entry

Boolean

true, false

Indicates if this is a newly registered entry in the database

Geographic Information

continent

String

Asia, Africa, North America, South America, Antarctica, Europe, Australia

Continental location

country

String

Real-value

Country name

country_code

String

Real-value

ISO country code

state

String

Real-value

State or province name

state_code

String

Real-value

State or province code

region

String

Real-value

Regional administrative division

region_code

String

Real-value

Regional code identifier

district

String

Real-value

District administrative division

district_code

String

Real-value

District code identifier

municipality

String

Real-value

Municipal administrative division

municipality_code

String

Real-value

Municipal code identifier

address

String

Real-value

Physical address of the organization

Technology Intensity Measures

Artificial Intelligence

Variable
Data Type
Values
Description

ai_intensity

Double

Real-value

Numerical measure of AI technology adoption/focus

ai_intensity_level

String

very low, low, medium, high, very high

Categorical classification of AI intensity

ai_keywords

String

Real-value

Keywords related to AI technologies used by the company

Additive Manufacturing (3D Printing)

Variable
Data Type
Values
Description

additive_manufacturing_intensity

Double

Real-value

Numerical measure of additive manufacturing adoption

additive_manufacturing_intensity_level

String

very low, low, medium, high, very high

Categorical classification of additive manufacturing intensity

additive_manufacturing_keywords

String

Real-value

Keywords related to additive manufacturing technologies

Blockchain Technology

Variable
Data Type
Values
Description

blockchain_intensity

Double

Real-value

Numerical measure of blockchain technology adoption

blockchain_intensity_level

String

very low, low, medium, high, very high

Categorical classification of blockchain intensity

blockchain_keywords

String

Real-value

Keywords related to blockchain technologies

Digital Health

Variable
Data Type
Values
Description

digital_health_intensity

Double

Real-value

Numerical measure of digital health technology focus

digital_health_intensity_level

String

very low, low, medium, high, very high

Categorical classification of digital health intensity

digital_health_keywords

String

Real-value

Keywords related to digital health technologies

Energy Technology

Variable
Data Type
Values
Description

energy_intensity

Double

Real-value

Numerical measure of energy technology focus

energy_intensity_level

String

very low, low, medium, high, very high

Categorical classification of energy technology intensity

energy_keywords

String

Real-value

Keywords related to energy technologies

Mobility Technology

Variable
Data Type
Values
Description

mobility_intensity

Double

Real-value

Numerical measure of mobility technology focus

mobility_intensity_level

String

very low, low, medium, high, very high

Categorical classification of mobility technology intensity

mobility_keywords

String

Real-value

Keywords related to mobility technologies

Sustainability

Variable
Data Type
Values
Description

sustainability_intensity

Double

Real-value

Numerical measure of sustainability focus

sustainability_intensity_level

String

very low, low, medium, high, very high

Categorical classification of sustainability intensity

sustainability_keywords

String

Real-value

Keywords related to sustainability practices


Sustainable Development Goals (SDG)

Variable
Data Type
Values
Description

sdg1_intensity

Double

Real-value

Intensity score for SDG 1: No Poverty

sdg1_intensity_level

String

very low, low, medium, high, very high

Categorical level for SDG 1 focus

sdg1_keywords

String

Real-value

Keywords related to poverty reduction efforts

sdg2_intensity

Double

Real-value

Intensity score for SDG 2: Zero Hunger

sdg2_intensity_level

String

very low, low, medium, high, very high

Categorical level for SDG 2 focus

sdg2_keywords

String

Real-value

Keywords related to hunger and food security

sdg3_intensity

Double

Real-value

Intensity score for SDG 3: Good Health and Well-being

sdg3_intensity_level

String

very low, low, medium, high, very high

Categorical level for SDG 3 focus

sdg3_keywords

String

Real-value

Keywords related to health and well-being initiatives


Scoring and Probability Measures

Variable
Data Type
Values
Description

cultural_score

Integer

Real-value

Numerical score measuring cultural aspects

cultural_score_category

String

very low, low, medium, high, very high

Categorical classification of cultural score

leisure_score

Integer

Real-value

Numerical score related to leisure industry involvement

leisure_score_category

String

very low, low, medium, high, very high

Categorical classification of leisure score

recreational_score

Double

Real-value

Numerical score for recreational activities/services

recreational_score_category

String

very low, low, medium, high, very high

Categorical classification of recreational score

transport_score_category

String

very low, low, medium, high, very high

Categorical score for transportation-related activities

innoprob_innovator_probability

String

very low, low, medium, high, very high

Probability assessment of being an innovator

social_innoprob_innovator_probability

String

very low, low, medium, high, very high

Probability of being a social innovator

news_probability

String

very low probability, low probability, medium probability, high probability, very high probability

Likelihood of appearing in news/media

retailer_probability

String

very low probability, low probability, medium probability, high probability, very high probability

Probability of being classified as a retailer

Company Structure and Team

Structure Information

Variable
Data Type
Values
Description

structure.name

String

Real-value

Name of organizational structure/division

structure.type

String

Real-value

Type of organizational structure

structure.description

String

Real-value

Description of the organizational structure

structure.location

String

Real-value

Location of the structure/division

structure.contact

String

Real-value

Contact information for the structure

Team Information

Variable
Data Type
Values
Description

team.name

String

Real-value

Team member name

team.position

String

Real-value

Position/role of team member

team.contact

String

Real-value

Contact information for team member

team.cv

String

Real-value

Curriculum vitae or profile information

Products and Services

Variable
Data Type
Values
Description

products.name

String

Real-value

Name of product or service

products.type

String

Service, Product, Software, Subscription, Book, Event, Course, Program, Food, Game, Other

Classification of product/service type

products.main_features

String

Real-value

Key features or characteristics of the product

products.pricing

String

Free, and others (real-value)

Pricing information for the product/service

Partnerships

Variable
Data Type
Values
Description

partnerships.entity_name

String

Real-value

Name of partner organization

partnerships.relationship_type

String

Cooperation, Customer, Supplier, Affiliate, Investor, Regulator, Other, Sponsor, Partner, Distributor

Type of business relationship

Contact Information

Variable
Data Type
Values
Description

main_contact_mail

String

Real-value

Primary email address

main_contact_number

String

Real-value

Primary phone number

all_mails

String

Real-value

All associated email addresses

all_phones

String

Real-value

All associated phone numbers

Technical and Content Data

Variable
Data Type
Values
Description

techstack

String

Real-value

Technology stack used by the company

description

String

Real-value

General description of the company

summary

String

Real-value

Executive summary of the organization

summary_keywords

String

Real-value

Keywords extracted from the summary

keywords

String

Real-value

General keywords associated with the company

text

String

Real-value

Full text content related to the organization

title

String

Real-value

Title or headline associated with the entry

links

String

Real-value

Web links associated with the company

Data Quality Notes

  • Real-value: Indicates that the field can contain any text or numerical value within the data type constraints

  • Boolean fields: Use 0/1 or true/false depending on the specific variable

  • Intensity measures: Higher numerical values typically indicate greater intensity or involvement

  • Categorical levels: Five-point scale from "very low" to "very high" for most categorical variables

  • Probability measures: Express likelihood using standardized probability categories

Usage Guidelines

  1. Missing Values: Fields marked as "Real-value" may contain empty or null values

  2. Data Validation: Boolean fields should be validated for proper 0/1 or true/false values

  3. Geographic Hierarchy: Geographic variables follow administrative hierarchy from continent to municipality

  4. Technology Intensity: Both numerical and categorical versions are provided for flexibility in analysis

  5. Nested Fields: Variables with dots (e.g., team.name) represent nested or structured data fields

Last updated

Was this helpful?