Frequently Asked Questions


Where does the data come from?

The base data source for the Salary Database is fed from the U.S. Bureau of Labor Statistic’s Occupational Employment and Wage Statistics data. Proprietary analytics and predictive modeling techniques are applied to cleanse, normalize, and fill in data gaps.

What variables are included in the data modeling?

The data model includes over 60 variables, including Job Name, Location, Industry, Year, etc. to build an accurate data model.

How many jobs are covered in the salary database?

Over 800 unique job names are included in the salary database. More information is available within the Job Name Definitions guide.

How many industries are covered in the salary database?

The salary database allows for 22 different industry selections. Additionally, the underlying dataset is broken out further by over 250 unique industries. More information is available within the Industry Definitions guide.

How many locations are covered in the salary database?

The salary database allows for up to 600 different location selections. Locations can be selected at the country level (US) or by state/territory. Additionally, the Locations Database provides additional detail into wages by cities and nonmetropolitan areas by state/territory. More information is available within the Locations Guide.

Are benefits included in the rates represented in the salary database?

The rates represent wages and salaries only, and do not include nonproduction bonuses or employer costs of nonwage benefits, such as health insurance or employer contributions to retirement plans.

What could be drive variances between my rate and the rates in the database?

Various factors could be driving variances between an individual rate and the market rates available in the database. Examples of factors that are not fully accounted for in the salary database include years of experience, personal performance, specific expertise, regulatory requirements, gender, and education level.

Where is the data hosted?

The transformed dataset is hosted in a MySQL database to allow for complex data connections. Additional predictive modeling techniques are applied through various products (e.g., RapidMiner, Azure Machine Learning, etc.).

Are any APIs or data feeds available?

Currently, no APIs are offered to connect to the database. Data exports can be arranged. Please use the Contact Us form to submit an inquiry.