Group Members
- Alina Vikhnevich
- Olivia Azevedo
- Alyssa Gurkas
- Musrat Jahan
Data Sources
- Computer and
Mathematical Occupations – Profile Data - This data source provides
detailed information on computer and mathematical occupations. In this
project, we may use the Bureau of Labor Statistics to identify
occupational codes related to data science, and relevant information
such as income.
- United Nations Standard
Products and Services Code - This data source includes information
about products and services and can be used to analyze company
expenditures.
- Projections
Central - This data source includes projections of industry and
occupational employment by state and the US. This could be used to
explore the projected outcome for certain occupations related to data
science such as data scientists, analysts, data engineers, and data
architects.
- O*Net
Database – The O*NET database outlines various information that
describe work and worker characteristics, including skill requirements
for many occupations. This data source may be used to explore skillsets,
applications, and programming languages used in occupations related to
data science.
Logical Model
The logical model represents the relationships between different
entities in the dataset, specifically focusing on skills, employment
statistics, and job market trends.
Entities and Attributes:
Jobs (job_id, title,
SOC_code, industry,
median_salary, projected_growth)
Skills (skill_id,
name, category, importance,
level)
Job_Skills (Bridge Table: job_id,
skill_id)
Industries (industry_id,
name, description)
Employment_Statistics (job_id,
industry_id, hourly_median_wage,
annual_median_wage, employment_count)
Entity Relationship Diagram
Entity Relationship Diagrams specify the nature of the relationship
between tables in a database.