Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
With this functionality you can download your generated files.
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
How to reach the webAI Platform and what to do after your first login.
You can access the webAI platform via the web address: https://dashboard.istari.ai
Your login details were sent to the email address (which is also your Username
) you provided when you subscribed. Use this login data for your first login.
After your first login you will be asked to set a new password. The password must meet the following criteria:
At least one capital letter
At least one lower case letter
At least one number
At least one special character
If you have troubles logging in or you forgot your password, please contact our support via: support@istari.ai
An overview of the platform and its functions.
The webAI Platform allows users to create personalized company lists, which can then be processed in use case-specific modules. To do this, the user first uses the Company List Creator to retrieve specific companies from the webAI company database and save them in a personal company list. This company list can then be further processed in one or more modules. Examples of further processing are the download of contact data for companies, the creation of market reports or the creation of maps based on the company lists.
After logging in, you should see the following layout (please note that we are constantly improving the webAI platform and your layout may differ slightly from the one shown here). We will now go through the individual elements of the platform: The side panel, the main panel and the credit indicator.
On the left-hand side, you can see the expanded side panel. This panel is available at all times and offers you quick access to the platform's most important functions. You are currently in the Company List Creator
, which you can see from the fact that this element is highlighted in gray.
You can hide the side panel at any time by clicking the X at the top right of the side panel. The side panel is then hidden, but can be shown again at any time by clicking on the arrow >
at the top left.
We will go into the side panel elements Company List Creator
, Modules
, My files
and Credits
in detail later. The side panel also allows you to activate "dark mode" by clicking on the 🌛moon symbol, or if you are in "dark mode", you can activate "light mode" by clicking on the ☀️ sun symbol.
Click on Help
to open a menu where you can access the documentation you are currently reading via Documentation
. Account Management
takes you to an external page where you can view and change your active subscriptions and download your invoices. Contact Us
allows you to send an email to our support team.
A click on the Log out
button logs you out from the system.
In the main panel you will see your active module (we will go into the individual modules later and present them in detail). After logging in to the platform, the Company List Creator is always displayed first as the active module.
At the top right of the main window you can see a button that shows you how many webAI Credits you have available. We will go into webAI Credits in more detail in the next section. Clicking on this button opens a menu which includes a button Buy webAI Credits
to buy new credits. Account Management
takes you to your account (just like the button in the side panel described above). Log out
logs you out of the system.
What modules are and what they can be used for.
The modules on the webAI platform offer specific functions that allow you to process the company lists you have created.
You can access the available modules, via the Modules
section in the side panel. Please note that not every module is available for all subscriptions. You can load the corresponding module by clicking Load Module
.
When you load a module, you will see the Load Company List
drop-down menu at the top. By default, the last search you carried out ("Last Search") is already loaded there. Select the list you want to work on from the drop-down menu and then click on Load Company List
to load it.
Below the menu you will see how many companies are in the currently loaded list.
In our webAI Platform documentation you can find everything about ISTARI.AI data and the use of the webAI Platform
On the following pages we will introduce you to the use of the webAI Platform, show you the functions, available data and explain everything you need to know.
Just click on the next section Platform login below and we will guide you step by step through the usage of the webAI Platform.
If you need a kick-start, then take a look at our sample use cases, which show you how you can use webAI using hands-on examples.
The starting point for most workflows on the webAI Platform. Allows you to compile custom company lists for further processing.
The Company List Creator can be used to compile individual company lists using a mixture of keyword-based searches, geography filters and webAI indicators. When you first open the creator, you see a minimalist view in which most of the available filters are hidden for a better overview.
In order to start searching for companies, you can, for example, enter a keyword in the search bar that matches the companies you are looking for. This could be, for example, a specific technology mentioned by the companies on their websites. Press enter on your keyboard or use the arrow to confirm your input.
Alternatively, you can also click on one of the popular keywords that are suggested to you.
Another option is to activate the keyword generator to automatically generate suitable keywords and keyword combinations based on your own entries. Please note that this function consumes the displayed credits each time it is activated.
As soon as a filter is active, this is displayed as shown below.
If any filters are active and at least one company has been found based on these filters, the results view appears. The number of companies found is displayed at the top. In the Dataframe
view, which is activated by default, you will also see a map on the left with the locations of the companies found. On the right, you can see the results in tabular form, whereby each company found corresponds to a row.
Please note that the results view only shows a small selection of the companies actually found (preview). For the complete data set, you must use the Company List Downloader module, which will be explained later.
The columns displayed within the table can be customized using the "Select Columns" drop-down menu. Please note that different columns (indicators) may be available to you depending on your active subscription.
As an alternative result view, the card view can opened by clicking on Companies
, in which a selection of the companies found is displayed. Click on the individual cards to open a detailed view of the company in question.
Click on Show Filters
to open the filter detail view.
On the left-hand side you can see the three available keyword filters:
Optional: At least 1 of the keywords listed here must be named.
Mandatory: All keywords listed here must be specified.
Blacklist: None of the keywords listed here may be mentioned.
In the example below, we are looking for all companies that mention "packaging machine" and at least one additional keyword from "producer" and "manufacturing". We also use a * wildcard as a blacklist keyword. This wildcard in "consult*" allows us to filter words such as "consulting" or "consultants" with just one keyword.
You can remove keywords individually by activating the x next to the keyword. Each change in the filters immediately updates the hits displayed.
Please note that the keyword search is language-specific. For example, you must enter the German word "Verpackungsmaschine" if you want to search for it, as "packaging machine" will probably not return the desired results for company websites in German.
Alternatively, you can also activate the slider for "webAI Multilingual Search", which automatically translates your keywords into over 20 relevant languages in the background and uses them for the search. Please note that the multilingual search is a function for which the displayed webAI credits must be used. The multilingual search also takes a little longer.
On the right-hand side of the detailed filter view, at the top, you find the filters with which companies can be filtered according to their location. With the All Locations
search bar, you can search for specific regions and use them as filters. After clicking on Set Locations
, the selected locations are applied as filters.
Under the All Locations
filter you will find the filters for the administrative units. Select the different administrative levels one after the other to use one or more regions as a filter. In the screenshot below, for example, "United States of America" has been selected as the Country
filter, which activates the US states in the State
drop-down menu, which in turn can be selected as further filters.
Under the geo filters you can find a number of other filters with which you can, for example, exclude all companies for which no contact information is available, which are retailers or pure news providers.
You can use the TechStack filter to find companies that have integrated certain website technologies on their websites. It is best to take a look at the exact description of the indicator. This filter works just like a keyword filter and you can simply enter the names of the relevant technologies and the results will be updated to show that the companies found must have integrated at least one of the specified technologies.
A click on Show Advanced Filters
reveals another set of filters that are only available to users of certain webAI plans. Here, companies can be searched for specifically using advanced webAI indicators. In the example below, only companies with a link to the istari.ai website, an AI Intensity Level of "high" or "very high" and a Sustainability Intensity of "low" or "very low" are displayed as hits.
You can save the filters you have set and the company list you have created (companies that match your filters) by entering a name that has not yet been used and clicking Save Company List
. Saved lists can be selected via the drop-down menu next to Load Company List
and loaded using the same button. Be careful, the currently set filters will be overwritten.
Click on Clear Filters
to reset all filters and search results.
Our webAI similarity search allows you to find companies that are similar to an input reference company.
The reference company you select will be the basis for the similarity search. WebAI will scan the database for other companies with similar website contents. The similarity between two company websites is determined by the overall nature of their texts respectively the content of these texts. Language may play a role, but in general webAI has no problem recognizing the French website of a mechanical engineering company as similar to the German website of another mechanical engineering company.
WebAI calculates a similarity score for each company in the database and the reference company you enter. The score will be close to 1.0 for very similar companies and close to 0.0 for very dissimilar companies. The minimum similarity score that must be reached in order to classify another company as “similar enough” can be set using the “Minimum Similarity Score” slider. Only companies that achieve at least the specified score will appear in the results list. A good starting point is 0.5.
Note:
WebAI only displays the 100 most similar companies in the results list if at least 100 companies achieve the set similarity score. If fewer companies achieve the minimum value, fewer companies will be displayed. If no companies are found at all, it is recommended to set the threshold value lower.
Enter the website address of the company you want to use as a reference for the similarity search. You can enter the website address in the usual formats, such as https://example.com or www.example.com or example.com. Then confirm your entry with the Enter key.
Important: Only websites that are already in our database can be used as a reference.
If the entered reference company is available in our database, you will see a green confirmation text box and your reference company has been set successfully. In order to switch to another reference company, click Change Similarity Search
.
In case the entered reference company is not available in our database, you will see a red warning text. Please select a different reference company.
In order to update your search results after entering a valid reference company, please click the Apply Changes
button in the main view.
Note: You can combine the similarity search with any of the other available filters like keywords or location filters to further narrow down and refine your search results.
With this module you can download the company lists you have created.
In this module, a selection of the companies you have loaded is displayed as a table. Similar to Company List Creator, not all hits are displayed here, but only a preview. This preview shows you what the table you can download with this module will look like in the end.
Above the table, as in all other modules, you can see the currently loaded company list and how many companies it contains.
If you want, you can customize the columns of your table by selecting the columns relevant to you from the Select Columns
drop-down menu. You can remove selected columns at any time by clicking on x
next to the column name. Please note that some columns are only available for certain webAI subscriptions.
You can also customize the format of your table using the Output type
drop-down menu.
If you are satisfied with your list, you can now download the complete list with all company observations. The costs for this are displayed below the table (1 webAI credit per company). Please ✅ confirm that you have understood the costs involved, which will then activate the Download
button.
After clicking on Download
, the download of your data is initialized. Successful processing is indicated by the display of a confirmation message below the button. You will then receive a confirmation email with the direct download link for your company list within a few minutes (the more companies you want to download, the longer the processing will take). This download link is valid for a few days. Your download will also appear in the My Files
section of the side panel.
If something goes wrong, the credits you have used will be refunded immediately. Please try again. If the problem persists, please contact our support team.
Our Keyword Generator allows you to create additional keywords related to your input keyword.
In our webAI, there is also an option to automatically generate keywords for your queries. The "webAI Keyword Generator" is designed to help you create relevant keywords based on an initial keyword you provide. The generated keywords will make your searches more effective and targeted. You can activate the Keyword Generator by just clicking the option above the search bar.
To generate keywords fitting to your search criteria, you should follow these steps:
Enter a keyword: You start by entering a keyword related to your area of interest.
Generate keywords: The app uses advanced AI to generate a list of relevant keywords based on your input.
Receive results: You will receive a list of new, highly relevant keywords to use for your searches.
Important: Each keyword generation costs 0.1 credits.
Let's say you are interested in companies that work with 3D printing. You can just type in "3D printing" into search bar and our webAI will generate other related keywords. Once you submit your keyword, the Keyword Generator will process your request and display a list of new keywords that are highly relevant to your original keyword. These keywords can help refine your searches and improve your research. They appear under "Keyword filters", where you can manually check them, delete words or add additional search terms.
Be specific: The more specific your initial keyword is, the more targeted the generated keywords will be.
Experiment with variations: Try different variations of your keyword to get a broader range of related terms.
Use generated keywords: Incorporate the generated keywords into your search queries to find more relevant information.
No keywords were generated: Ensure you have entered a valid keyword in the input field. If the problem persists, try refreshing the page and entering the keyword again.
Slow response: If the app takes longer than usual to generate keywords, it might be due to high server load. Please wait a few moments and try again.
Ask free-form questions about the companies in your list.
The Q&A module of webAI enables unprecedented interactivity with our data. Once you have selected your companies of interest, you can simply ask a question and receive an answer for each company.
You can find the Q&A module under the "Modules", which you can access via the left toolbar of the platform. Click on "Load Module" to access it.
Before posing your question, you need to load your pre-selected firm dataset. For this, select your previous searches from the dropdown bar and the click on "Load Company List".
You can then formulate your question for our data in the editable box under "Your Question". There are no restrictions on the content of your questions, but the answer will always be based on the information we have about the company in question. Here are a few sample questions you could use:
What products does the company offer? Please give me back the product names and categories as a bullet point list.
Does the company specify measures to reduce its own greenhouse gas emissions? Answer with YES or NO. If YES, then describe in keywords which measures are mentioned.
What is the address of the company?
Who is the managing director of the company?
Note: The answers are based on the website content of the companies analyzed. If a company does not provide any information on your question, webAI cannot answer it. For example, information on the last two questions mentioned above (address and managing director) would be more likely to be found on the websites of European companies than American companies, because there is an imprint obligation in Europe where the relevant information must be provided.
After a couple of seconds, you will receive answers for four sample firms in separate boxes. This way you can see whether webAI's answers match your expectations or whether you still need to adjust the question. If our AI is not sure how to answer for a company or does not find any information, it will answer explicitly that there is no information available for this company.
Important: Note that each question costs 1 webAI Credits per company. Please also note that the analysis of a company in your list will also cost webAI Credits if webAI cannot find any information about your question during the analysis.
If you are satisfied with the results and want to ask the question to all the firms in your pre-selected list, you can click on "Start". You will see how many credits it will cost to ask the question to each firm in your selected dataset. Before starting, you need to accept that you are aware of these costs.
WebAI Credits can be used for paid functions of the webAI Platform.
WebAI Credits never expire. However, you need an active subscription to be able to use webAI Credits.
With webAI Credits you can use certain paid functions in the webAI Platform. If you do not have enough credits for a certain function, you will not be able to perform this function and will have to buy additional webAI credits first (see below).
If a function is subject to a charge, this will always be clearly displayed. So you don't have to worry about accidentally using up credits. For larger amounts, you must actively confirm that you are ready to use webAI Credits.
For very small amounts, you will instead be shown a coin symbol (please note that the appearance of the symbol may vary depending on your device) in the corresponding button indicating that you will use (i.e. consume) the displayed amount of webAI Credits with each click.
You can buy additional webAI Credits at any time, which will then be added to your credit balance immediately. To do this, either click on webAI Credits
in the side panel or on Buy webAI Credits
in the webAI Credit Indicator menu. Both will take you to the webAI Credits overview, where you can see the available webAI Credit packages.
Depending on the package you choose, you will be rewarded a certain number of bonus credits. For example, if you go for the popular "10,000 webAI Credits" package, you will receive 1,000 bonus credits after checkout, giving you a total of 11,000 webAI credits.
You can access the similarity filter by clicking the Similarity Filters
button in the view. This will open up the options menu where you can select your reference company and adjust the the Minimum Similarity Score
.
Analyzing your list can take quite a while if it contains many companies. In the meantime, you can continue working on the webAI platform or go for a coffee. We will notify you by e-mail as soon as your results are available for download. You can then download it from .
When you sign up for a subscription, you will be allocated a certain amount of webAI Credits (you can find out the exact number in the section). Your currently available credits are displayed at all times in the at the top right of the main panel.
Each company is geocoded to the exact house number level and assigned to administrative units.
Availability: webAI Lead, webAI Insights
In order to be able to assign the companies to a geographical region, we use a hierarchical structure inspired by the NUTS system of the European Union. This is structure is comparable across countries. However, in some world regions, the more fine-grained categories do not exist.
See also our coverage (number of companies per country).
country
Country
Germany
state
Regions, federal states, provinces or territories.
Baden-Württemberg
region
Departments, districts, metropolitan statistical areas.
Rhein-Neckar-Kreis
district
Counties, districts, cities.
Sandhausen
Explore company networks.
Classify companies with your custom-defined categories.
The Classifier module of webAI enables precise categorisation of text data from selected companies based on custom-defined categories and their descriptions.
You can find the Classifier module under "Modules", which you can access via the left toolbar of the platform. Click on "Load Module" to open it.
Before posing your categories and descriptions, you need to load your pre-selected firm dataset. For this, select your previous searches from the dropdown bar and then click on "Load Company List".
You can then formulate the categories for your data in the editable box under "Category Name". Every category needs a description for further specification, which you can input in the box under "Category Description". After defining your categories, click on "Add Category to Category Collection" to add them to the list. You can add up to 10 categories.
💡 Tip: Use the editable box "Name your category collection." and click on "Save" to save the categories for further analysis.
There are no restrictions on the content of your categories and description, but the answer will always be based on the information we have about the company in question. Here are a few sample categories in the field of technology you could use:
Artifical Intelligence
Description: This company develops AI solutions for various industries.
Natural Language Processing
Description: NLP is a branch of AI focused on enabling computers to understand, interpret, and generate human language. This includes tasks like sentiment analysis, named entity recognition, machine translation, and dialogue systems.
IT Security
Description: The company reports that itself is ISO/IEC 27001 certified.
Note: The classifications are based on the website content of the companies analysed. If a company does not provide any information on your categories, webAI cannot answer it. For example, information on the example categories mentioned above would be more likely to be found on the websites of IT companies than on construction companies.
After a couple of seconds, you will receive the relevant categories with a short summary for four sample firms in separate boxes. This way you can see whether webAI's classification matches your expectations or whether you still need to adjust the categories. If our AI is not sure how to classify a company or does not find any information, it will state explicitly that there is no information available for this company.
Important: Note that each classification costs 1 webAI Credits per company. Please also note that the analysis of a company in your list will also cost webAI Credits if webAI cannot find any information about your categories during the analysis.
If you are satisfied with the results and want to classify all the firms in your pre-selected list, you can click on "Start". You will see again which categories you are classifying for and how many credits it will cost to classify all companies in your selected dataset. Before starting, you need to accept that you are aware of these costs.
Analysing your list can take a while if it contains many companies. In the meantime, you can continue working on the webAI platform or go for a coffee. We will notify you by email as soon as your results are available for download. You can then download them from My Files.
Find companies that are engaged in artificial intelligence.
Find companies that are engaged in blockchain.
Find companies that are engaged in additive manufacturing.
A brief overview of the available company information.
Our central identifier for a company is the web address (domain). If several entries in a commercial register share a domain (e.g. in the case of companies from the same group of companies), we identify the company's headquarters and only keep this observation.
GDPR compliant phone numbers and e-mail addresses
ISTARI.AI uses GDPR-compliant approaches to collect, process and make available the publicly available contact information (i.e. self-published information) of companies. The following information is available:
Find companies that are engaged in digital health.
Find companies that are engaged in the energy sector.
Each line in the table represents a company. Even when we talk about companies, we also mean other economic players such as universities, research institutes and associations. Our data is sourced from national commercial registers and other public sources. See for the number of companies per country.
Note that not all of these columns are shown in the dashboard. However, you can extract all of the information when you our data.
domain
The main URL of the respective company. It is used as a unique identifier in our database.
domain_redirect
True if the domain automatically redirects to another webpage.
country
Country (NUTS-0) where the firm is located. Official name (e.g. "Deutschland").
country_english
English name of country (e.g . "Germany")
country_code
ISO code of the country.
state
State (NUTS-1) within the country.
state_code
NUTS code representing the state.
region
Region (NUTS-2) within the state.
region_code
NUTS code representing the region.
district
District or locality (NUTS-3) within the region.
district_code
Code representing the district.
title
Title provided in the HTML head of the company website.
keywords
Keywords provided in the HTML head of the company website.
description
Description provided in the HTML head of the company website.
main_contact_mail
Main contact e-mail address.
all_mails
All e-mail addresses found on the respective website.
main_contact_number
Main contact phone number.
all_phones
All phone numbers found on the respective website.
all_linkedin_profiles
All links to LinkedIn user profiles found on the respective website.
all_linkedin_companies
All links to LinkedIn company profiles found on the respective website.
all_twitter_links
All links to X/Twitter accounts found on the respective website.
all_facebook_links
All links to Facebook profiles found on the respective website.
additive_manufacturing_intensity
Numerical engagement level of the firm in additive manufacturing.
additive_manufacturing_intensity_level
Engagement level of the firm in additive manufacturing (very low, low, medium, high, very high).
additive_manufacturing_keywords
Keywords potentially related to additive manufacturing on the respective website.
additive_manufacturing_keywords_hits
Number of individual hits per keyword from additive_manufacturing_keywords
column on the respective website.
additive_manufacturing_total_hits
Total number of additive_manufacturing_keywords
hits on the respective website.
ai_intensity
Numerical engagement level of the firm in artificial intelligence.
ai_intensity_level
Engagement level of the firm in artificial intelligence (very low, low, medium, high, very high).
ai_keywords
Keywords potentially related to artificial intelligence on the respective website.
ai_keywords_hits
Number of individual hits per keyword from ai_keywords
column on the respective website.
ai_total_hits
Total number of ai_keywords
hits on the respective website.
blockchain_intensity
Numerical engagement level of the firm in blockchain technology.
blockchain_intensity_level
Engagement level of the firm in blockchain technology (very low, low, medium, high, very high).
blockchain_keywords
Keywords potentially related to blockchain technology on the respective website.
blockchain_keywords_hits
Number of individual hits per keyword from blockchain_keywords
column on the respective website.
blockchain_total_hits
Total number of blockchain_keywords
hits on the respective website.
digital_health_intensity
Numerical engagement level of the firm in digital health technologies
digital_health_intensity_level
Engagement level of the firm in digital health technologies (very low, low, medium, high, very high).
digital_health_keywords
Keywords potentially related to digital health technologies on the respective website.
digital_health_keywords_hits
Number of individual hits per keyword from digital_health_keywords
column on the respective website.
digital_health_total_hits
Total number of digital_health_keywords
hits on the respective website.
domain_provider_true
True if the firm is actually just a domain provider.
news_probability
Low
, medium
, high
or very high
probability that the website is a news site (e.g. newspaper, blog, journal, news ticket).
energy_intensity
Numerical engagement level of the firm in energy technologies.
energy_intensity_level
Engagement level of the firm in energy technologies (very low, low, medium, high, very high).
energy_keywords
Keywords potentially related to energy technologies on the respective website.
energy_keywords_hits
Number of individual hits per keyword from energy_keywords
column on the respective website.
energy_total_hits
Total number of energy_keywords
hits on the respective website.
mobility_intensity
Numerical engagement level of the firm in mobility technologies.
mobility_intensity_level
Engagement level of the firm in mobility technologies (very low, low, medium, high, very high).
mobility_keywords
Keywords potentially related to mobility technologies on the respective website.
mobility_keywords_hits
Number of individual hits per keyword from mobility_keywords
column on the respective website.
mobility_total_hits
Total number of mobility_keywords
hits on the respective website.
sustainability_intensity
Numerical engagement level of the firm in ecological sustainability.
sustainability_intensity_level
Engagement level of the firm in ecological sustainability (very low, low, medium, high, very high).
sustainability_keywords
Keywords potentially related to ecological sustainability on the respective website.
sustainability_keywords_hits
Number of individual hits per keyword from sustainability_keywords
column on the respective website.
sustainability_total_hits
Total number of sustainability_keywords
hits on the respective website.
innoprob
Numerical probability of the company to be an innovator (Innoprob score). Only available in German-speaking countries.
innoprob_innovator_probability
Probability of the company to be an innovator (Innoprob score) (very low, low, medium, high, very high). Only available in German-speaking countries.
social_innoprob
Numerical probability of the company to be an innovator (Social Innoprob score). Only available in German-speaking countries.
social_innoprob_innovator_probability
Probability of the company to be a social innovator (Social Innoprob score) (very low, low, medium, high, very high). Only available in German-speaking countries.
retailer_probability
Probability of being a retailer.
leisure_score
Number of leisure activity facilities at the company location.
recreational_score
Number of recreational activity facilities at the company location.
transport_score
Public transport capacity at the company location.
cultural_score
Number of cultural activity facilities at the company location.
precipitation
Amount of precipitation at firm location.
precipitation_category
Categorised amount of precipitation at firm location (very low, low, medium, high, very high).
temperature
Average temperature [°C] at firm location.
temperature_category
Categorised average temperature [°C] at firm location (very low, low, medium, high, very high).
SO2
Average concentration of sulfur dioxide at firm location.
SO2_category
Categorised average concentration of sulfur dioxide at firm location (very low, low, medium, high, very high).
NO2
Average concentration of nitrogen dioxide at firm location.
NO2_category
Categorised average concentration of nitrogen dioxide at firm location (very low, low, medium, high, very high).
aerosol
Average concentration of aerosols at firm location.
aerosol_category
Categorised average concentration of aerosols at firm location (very low, low, medium, high, very high).
CH4
Average concentration of methane at firm location.
CH4_category
Categorised average concentration of methane at firm location (very low, low, medium, high, very high).
CO
Average concentration of carbon monoxide at firm location.
CO_category
Categorised average concentration of carbon monoxide at firm location (very low, low, medium, high, very high).
HCHO
Average concentration of formaldehyde at firm location.
HCHO_category
Categorised average concentration of formaldehyde at firm location (very low, low, medium, high, very high).
O3
Average concentration of ozone at firm location.
O3_category
Categorised average concentration of ozone at firm location (very low, low, medium, high, very high).
air_quality_indicator
Sum of normalised air pollutant concentrations at firm location.
air_quality_indicator_category
Categorised air quality indicator (very low, low, medium, high, very high).
links
Incoming and outgoing hyperlinks to and from the respective website.
links_count
Number of incoming and outgoing hyperlinks to and from the respective website.
outgoing_links
Outgoing hyperlinks from the respective website.
outgoing_links_count
Number of outgoing hyperlinks from the respective website.
incoming_links
List of incoming hyperlinks to the respective website.
incoming_links_count
Count of incoming hyperlinks to the respective website.
techstack
List of website technologies used on the website.
new_register_entry
The value in this column is True
if the commercial register entry for this observation is less than 5 years old.
geolocation
Point coordinates of firm location in well-known text (EPSG: 4326) format.
lon
Longitude of firm location (EPSG: 4326)
lat
Latitude of firm location (EPSG: 4326)
main_contact_mail
Main contact e-mail address.
all_mails
All e-mail addresses found on the respective website.
main_contact_number
Main contact phone number.
all_phones
All phone numbers found on the respective website.
Find social media presences of companies
Many companies have presences on various social media platforms (e.g. LinkedIn, Twitter/X). We collect the links on the respective company websites that lead to these social media accounts. This makes it easier for you to contact companies or even individuals.
We collect the following information:
all_linkedin_profiles: all LinkedIn user profiles that exist in the company text. These links should point to employees of the company or partners.
all_linkedin_companies: In LinkedIn, we can separate between a company profile and a “user” profile. This indicator collects the company profile links. This should point to the company itself or related companies, e.g. partners.
all_twitter_links: This indicator collects the links to Twitter/X profiles.
all_facebook_links: This column includes all links to Facebook profiles.
Find companies that are engaged in the mobility sector.
Company information derived from satellite data.
Remote sensing data has many areas of application. At ISTARI.AI, we derive novel indicators at company level from satellite data in order to assess the environmental quality of a company location. To do this, we draw on freely available data from the Copernicus programme of the European Space Agency (ESA).
Our focus to date has been on assessing air quality. As part of the Sentinel-5P satellite, the TROPOMI (Tropospheric Monitoring Instrument) sensor was developed, which can estimate the concentration of air pollutants in a defined air column with an unprecedented spatial resolution. Since 2017, this satellite has been measuring the concentration of sulphur dioxide, methane, carbon monoxide and other pollutants worldwide on a daily basis. We use this data, aggregated over a one-year time series, and allocate it to our company locations. We also created a joint indicator that combines various air pollutant measurements into one categorisation of air quality at a firm location.
Additionally, we provide climate date for each firm location. For this, we have added the average temperature and precipitation to our database.
We have already published the study "Greenwashing in the US metal industry? A novel approach combining SO2 concentrations from satellite data, a plant-level firm database and web text mining" that uses this data. Check it out .
Explore location factors of companies.
All companies on the webAI platform are precisely located at house number level (geocoded). Based on this precise location information, the companies are assigned micro-geographical location factors.
Find companies with standards and certifications.
Find socially innovative companies.
Find innovative companies.
This indicator is available for companies based in Germany and Austria only.
The InnoProb scores were also validated by comparing them with patent data from the European Patent Office, with projections based on survey data from the Mannheim and Berlin Innovation Panels, and with regional innovation indicators from the German Federal Statistical Office. The corresponding study was published in the international journal PLOS ONE.
Find companies that are engaged in sustainability.
Find companies that are engaged in the Sustainable Development Goal 3 - "Good Health and Well-Being"
Find companies that are engaged in the Sustainable Development Goal 2 - "Zero Hunger"
Find companies that are engaged in the Sustainable Development Goal 1 - "No Poverty"
An example use case where we search for hotels based in France that use specific web technologies.
Searching France-based hotel websites with contact forms for a mailing campaign
Imagine you are a startup that has developed a new AI tool for communication specifically between hotels and customers. The product is ready and you are about to win customers. You and your team have chosen France as your first target country. You need a list of hotels in France that have a website and already provide contact forms on the website (oldschool...exactly the customers who need your AI tool).
Let's take a look at how you can compile and download a list of such leads in a few minutes so that you can then import them into the CRM system of your choice.
We start simple and enter the search term "hotel" in the search bar and press enter.
After a few seconds, we have the result: almost 978,570 companies mention this word on their website. The map shows the distribution of these firms, while the table on the right includes a preview.
After clicking Show filters
, you can easily filter our firm database based on various criteria. Using the Country
drop-down menu, we can search for "France" and select it. The preview below the search filters indicates that this filter will reduce your search to 29,839 hits.
As we are looking for hotels specifically in France, it can be assumed that many of the websites will be exclusively in French. It therefore makes sense to add further search terms in the national language. Chain the individual search terms by just adding an optional keyword and pressing enter, so that a hit is generated if at least one of the terms occurs. We therefore found a good 15,000 additional hits with the French synonyms.
Since you are planning an email campaign, you will need an email address for each company. You can easily exclude those companies without an email by clicking Exclude companies without Email
. You can do the same for companies without a phone number.
You can also use the search bar to exclude companies that mention certain keywords on their websites. Suppose your product is not suitable for campgrounds: You could simply filter these using a blacklisted keyword. Just type "camping" into Add blacklisted keyword
and press enter.
WebAI analyzes the technical framework of each individual company website and maps the used web technologies in the TechStack indicator. Here you will find an extensive list of technologies you can query.
To filter for a specific technology from the TechStack, simply use the "techstack" search bar in the filters. You can also query for the supercategories, e.g. "contact form". We will then search for all companies whose techstack column contains the word "contact form" with any characters before and after it.
We confirm our query by pressing the Enter key et voilà we have found 2,655 companies that match our search criteria: Hotels from France that have contact forms on their websites and have an email address. Now all you have to do is download the data.
All you have to do now is click on the Download data
button below the preview. Before receiving the file, you will see how many credits this will cost you and confirm that you want to download the data.
After downloading, you can open the downloaded file with Excel, for example, or import it directly into the CMS.
All current and past changes and patch notes.
An example use case where we search for firms from the energy sector that produce heat pumps and that are based in the German State of Baden-Württemberg.
Searching Hesse-based firms that produce heat pumps.
Imagine you are a public institution that wants to find all companies in a specific region that are producing heat pumps. You and your team have chosen the German state Baden-Württemberg as your first target region. Because you know that companies that sell solar panels often mention heat pumps but don’t produce them themselves you want to exclude these companies. We will search for German companies therefore we also use German words.
Let's take a look at how you can compile and download a list of such leads in a few minutes so that you can analyse and contact them immediately.
We start with entering the following search term "wärmepumpe" and "hersteller" and not "solar" in the filters and press enter. After a few seconds, we have the result: almost 4,400 companies mention this combination of words on the website.
To make sure that we find companies that engage in energy production/information we can use our AI. For this, you can select "Show advanced filters", which provides access to our ISTARI.AI intensity scores, e.g. on the topic of "energy". You can then select the category which should be included in your search, e.g. "high". After a few seconds, we have reduced our result to 1,000 companies.
Using the state
option, we can select only firms located in "Baden-Württemberg". This will reduce our search results to 111 firms.
Now, we have found all the firms that correspond to our criteria. We can inspect our results more closely on the interactive map on the left or in the preview table on the right. With this, all you have to do is download the data.
All you have to do now is click on the Download data
button below the preview. Before receiving the file, you will see how many credits this will cost you and confirm that you want to download the data.
After downloading, you can open the downloaded file with Excel, for example, or import it directly into the CMS.
Manage your subscription, payment method and download invoices.
Manage your account in the webAI Platform Control Center by logging in with the email you used to create your account.
If you lost your access to your account, please contact: support@istari.ai
An example use case in which we analyse a region for industrial policy.
Assume you are someone working for a city or district, and you want to find out:
Which companies exist in your area overall?
Which companies in your area are working on specific topics?
Which companies worldwide specialise in a specific topic for which there is a gap in your regional innovation cluster that you would like to fill?
Supporting the local economy in your area typically requires you to explore various topics and company profiles. That's why the webAI tool offers several ways to search for these topics.
You can use filters to limit your search to a certain region. In your role, you usually want to look at your local region. This region can be a single area like a city like Frankfurt or an area made up of several districts, e.g. the Metropolitan Region Rhine-Neckar around Mannheim and Heidelberg.
Identify one specific region
For a specific area, simply use the drop-down filters to find your area, like in the case of “Frankfurt”. If you do not find it, try to find the next bigger area around it (e.g. you cannot find the french city of "Montpellier" directly in the “District” filter list, but you find its supercategory “Hérault”). You can also select multiple regions.
For more specific information about the regional units, see Administrative units.
Identify a region consisting of several administrative units
In some cases, like e.g. in the case of the Metropolitan Region Rhine-Neckar around Mannheim and Heidelberg, you will have to include several local entities. In this case, you need to choose “Mannheim, Stadtkreis”, “Heidelberg, Stadtkreis”, and “Rhein-Neckar-Kreis” from the District-drop-down menu.
By text
Let’s assume you want to know which companies use the word "hydrogen" or their German, Spanish or French equivalents on their website. To get the results, just enter ”hydrogen”
or “wasserstoff”
or “hidrógeno”
or “hydrogène"
into the filters.
By indicator
Now, take the example from above and assume that you only want to see the companies that have a high or very high sustainability indicator (click here for more information). In this case, you need to add another filter under "Show advanced filters". It is sufficient to use "high", since this will also include all firms that are classified as "very high".
If we combine all of these search criteria, we can identify all 32 companies that mention "hydrogen" in Frankfurt am Main and that have a high sustainability intensity. We can inspect our results more closely on the interactive map on the left or in the preview table on the right. With this, all you have to do is download the data.
All you have to do now is click on the Download data
button below the preview. Before receiving the file, you will see how many credits this will cost you and confirm that you want to download the data.
After downloading, you can open the downloaded file with Excel, for example, or import it directly into the CMS.
Pricing and functionality overview.
You may cancel your plan any time by .
Description
Search monthly updated companies by industry, technology and product portfolio. Leverage science-based business indicators to improve your search.
Annual costs
5,000€
Included webAI Credits
12,000
Search 20 million companies
✅
Free text search
✅
Science-based indicators
✅
Thousands of tracked web technologies
✅
Telephone numbers and emails
✅
Premium support and onboarding
✅
Upgrades
Download flat Academic discount Public institution discount Upload and enrich you own data sets
Designation of the responsible body
The controller responsible for data processing on this website is:
istari.ai GmbH
Responsible persons: Dr David Lenz and Dr Jan Kinne
Julius-Hatry-Straße 1 68163 Mannheim, Germany
Email: info@istari.ai Phone: 0621/72493433
VAT: DE337056158 Local court Mannheim: HRB 733173
The controller decides alone or jointly with others on the purposes and means of processing personal data (e.g. names, contact details, etc.).
For more information, see ISTARI.AI imprint.
Compilation of scientific studies and publications with webAI data.
The following scientific publications have used data by ISTARI.AI:
2023:
Abbasiharofteh, M., Krüger, M., Kinne, J., Lenz, D., & Resch, B. (2023). The digital layer: alternative data for regional and innovation studies. Spatial Economic Analysis, 18(4), 507–529. https://doi.org/10.1080/17421772.2023.2193222
Arifi, D., Resch, B., Kinne, J., & Lenz, D. (2023). Innovation in hyperlink and social media networks: Comparing connection strategies of innovative companies in hyperlink and social media networks. PLOS ONE, 18(3), e0283372. https://doi.org/10.1371/journal.pone.0283372
Liu, C., Peng, Z., Liu, L., & Li, S. (2023). Innovation Networks of Science and Technology Firms: Evidence from China. Land, 12(7), 1283. https://doi.org/10.3390/land12071283
2022:
Schmidt, S., Kinne, J., Lautenbach, S., Blaschke, T., Lenz, D., & Resch, B. (2022). Greenwashing in the US metal industry? A novel approach combining SO2 concentrations from satellite data, a plant-level firm database and web text mining. Science of the Total Environment, 835, 155512. https://doi.org/10.1016/j.scitotenv.2022.155512
Schwierzy, J., Dehghan, R., Schmidt, S., Rodepeter, E., Stömmer, A., Uctum, K., Kinne, J., Lenz, D. & Hottenrott, H. (2022). Technology mapping using WebAI: the case of 3D printing. arXiv.org. https://arxiv.org/abs/2201.01125
2021:
Kinne, J., & Lenz, D. (2021). Predicting innovative firms using web mining and deep learning. PLOS ONE, 16(4), e0249071. https://doi.org/10.1371/journal.pone.0249071
The approximate number of organizations per country. In total we cover about 37 million entities.
AD
Andorra
190
AE
United Arab Emirates
46696
AF
Afghanistan
216
AG
Antigua and Barbuda
62
AI
Anguilla
48
AL
Albania
6011
AM
Armenia
1488
AO
Angola
1264
AQ
Antarctica
2
AR
Argentina
98896
AS
American Samoa
5
AT
Austria
344635
AU
Australia
1789144
AW
Aruba
221
AX
Aland Islands
35
AZ
Azerbaijan
1515
BA
Bosnia and Herzegovina
11014
BB
Barbados
374
BD
Bangladesh
1458
BE
Belgium
378646
BF
Burkina Faso
224
BG
Bulgaria
58036
BH
Bahrain
3737
BI
Burundi
212
BJ
Benin
374
BL
St. Barths
29
BM
Bermuda
1879
BN
Brunei Darussalam
397
BO
Bolivia
2536
BQ
Bonaire, Saint Eustatius and Saba
35
BR
Brazil
772475
BS
Bahamas
963
BT
Bhutan
113
BW
Botswana
422
BY
Belarus
34760
BZ
Belize
232
CA
Canada
679748
CC
Cocos (Keeling) Islands
1
CD
DR Congo
356
CF
Central African Republic
19
CG
Congo Republic
162
CH
Switzerland
443765
CI
Cote d'Ivoire
1149
CK
Cook Islands
18
CL
Chile
96876
CM
Cameroon
986
CN
China
5598924
CO
Colombia
93643
CR
Costa Rica
5391
CU
Cuba
552
CV
Cabo Verde
211
CW
Curacao
383
CY
Cyprus
13040
CZ
Czechia
383614
DE
Germany
3191496
DJ
Djibouti
113
DK
Denmark
276853
DM
Dominica
36
DO
Dominican Republic
9103
DZ
Algeria
2641
EC
Ecuador
9437
EE
Estonia
51644
EG
Egypt
16837
ER
Eritrea
17
ES
Spain
699562
ET
Ethiopia
960
FI
Finland
265163
FJ
Fiji
182
FK
Falkland Islands
7
FM
Micronesia, Fed. Sts.
13
FO
Faroe Islands
44
FR
France
1097862
GA
Gabon
203
GB
United Kingdom
2485814
GD
Grenada
47
GE
Georgia
11866
GF
French Guiana
66
GG
Guernsey
1070
GH
Ghana
1090
GI
Gibraltar
616
GL
Greenland
9
GM
Gambia
100
GN
Guinea
198
GP
Guadeloupe
510
GQ
Equatorial Guinea
66
GR
Greece
127799
GT
Guatemala
4098
GU
Guam
689
GW
Guinea-Bissau
15
GY
Guyana
175
HK
Hong Kong
74265
HN
Honduras
446
HR
Croatia
63880
HT
Haiti
114
HU
Hungary
181901
ID
Indonesia
64425
IE
Ireland
93762
IL
Israel
69060
IM
Isle of Man
1264
IN
India
278496
IQ
Iraq
1053
IR
Iran
8300
IS
Iceland
9739
IT
Italy
1172260
JE
Jersey
214
JM
Jamaica
4543
JO
Jordan
3320
JP
Japan
713940
KE
Kenya
12692
KG
Kyrgyz Republic
664
KH
Cambodia
2256
KI
Kiribati
5
KM
Comoros
46
KN
St. Kitts and Nevis
64
KP
North Korea
3
KR
South Korea
356980
KW
Kuwait
4891
KY
Cayman Islands
2409
KZ
Kazakhstan
20901
LA
Laos
522
LB
Lebanon
5800
LC
St. Lucia
51
LI
Liechtenstein
3056
LK
Sri Lanka
6120
LR
Liberia
159
LS
Lesotho
1195
LT
Lithuania
63032
LU
Luxembourg
26639
LV
Latvia
40010
LY
Libya
311
MA
Morocco
8784
MC
Monaco
1063
MD
Moldova
11376
ME
Montenegro
4426
MF
Saint-Martin
46
MG
Madagascar
520
MH
Marshall Islands
88
MK
North Macedonia
3547
ML
Mali
256
MM
Myanmar
608
MN
Mongolia
555
MO
Macau
221
MP
Northern Mariana Islands
23
MQ
Martinique
1019
MR
Mauritania
125
MS
Montserrat
3
MT
Malta
7425
MU
Mauritius
2427
MV
Maldives
460
MW
Malawi
354
MX
Mexico
225012
MY
Malaysia
63073
MZ
Mozambique
1009
NA
Namibia
99
NC
New Caledonia
208
NE
Niger
41
NF
Norfolk Island
1
NG
Nigeria
4537
NI
Nicaragua
783
NL
Netherlands
1789447
NO
Norway
220779
NP
Nepal
3526
NR
Nauru
3
NU
Niue
1
NZ
New Zealand
152747
OM
Oman
5364
PA
Panama
4340
PE
Peru
34147
PF
French Polynesia
56
PG
Papua New Guinea
1206
PH
Philippines
24262
PK
Pakistan
8892
PL
Poland
710324
PM
St. Pierre and Miquelon
6
PN
Pitcairn
1
PR
Puerto Rico
3803
PS
Palestine
395
PT
Portugal
134735
PW
Palau
31
PY
Paraguay
3620
QA
Qatar
3766
RE
Reunion
569
RO
Romania
93943
RS
Serbia
39003
RU
Russia
696328
RW
Rwanda
553
SA
Saudi Arabia
14151
SB
Solomon Islands
14
SC
Seychelles
322
SD
Sudan
238
SE
Sweden
387522
SG
Singapore
113021
SH
St. Helena
2
SI
Slovenia
49376
SJ
Svalbard and Jan Mayen Islands
7
SK
Slovakia
124328
SL
Sierra Leone
94
SM
San Marino
195
SN
Senegal
998
SO
Somalia
72
SR
Suriname
107
SS
South Sudan
67
ST
Sao Tome and Principe
18
SV
El Salvador
2013
SX
Sint Maarten
59
SY
Syria
257
SZ
Eswatini
205
TC
Turks and Caicos Islands
62
TD
Chad
39
TG
Togo
220
TH
Thailand
55130
TJ
Tajikistan
120
TL
Timor-Leste
27
TM
Turkmenistan
108
TN
Tunisia
1539
TO
Tonga
23
TR
Türkiye
235745
TT
Trinidad and Tobago
545
TV
Tuvalu
1
TW
Taiwan
70158
TZ
Tanzania
1793
UA
Ukraine
72245
UG
Uganda
1836
UNK
UNK
680
US
United States
8565006
UY
Uruguay
7739
UZ
Uzbekistan
2529
VA
Vatican
3
VC
St. Vincent and the Grenadines
58
VE
Venezuela
8246
VG
British Virgin Islands
347
VI
United States Virgin Islands
217
VN
Vietnam
89163
VU
Vanuatu
59
WF
Wallis and Futuna Islands
2
WS
Samoa
36
XK
Kosovo
1862
YE
Yemen
229
YT
Mayotte
91
ZA
South Africa
163445
ZM
Zambia
960
ZW
Zimbabwe
1432