A Breath of Fresh Air: AI Hackathon
This hackathon invites participants to create AI-powered solutions for air quality across the Wasatch Front addressing pressing challenges in predicting poor air quality either due to wildfire smoke or inversions, elucidating contributing factors, such as the Great Salt Lake, geography, weather, and human factors, and mitigating effects of poor quality. With a focus on leveraging data science, machine learning, and AI, participants will create tools and models to tackle real-world issues around air quality and help communities and policymakers make informed decisions to improve air quality and public health.
Event Structure
- Duration: 48-hour hackathon
- Location: This is a hybrid event. Join us online or at the Warnock Engineering Building.
- Team Size: 3-5 members per team, including at least 2 students and 2 Utahns
- Dates: June 4–5, 2025. Awards announced June 6, 2025.
- Registration Closes: April 15, 2025
- Team Building Kickoff (hybrid): May 1, 2025
Eligibility & Support
This event is targeted toward Utah college students, but all are welcome. No coding experience needed! Out-of-state registrants will be teamed with at least two Utahns.
We will hold teaming sessions to help match participants with other likeminded participants and projects. Prior to the event, in June, Amazon Web Services (AWS) will host enablement sessions to ensure participants are prepared to participate fully in the hackathon.
Each team will be supported by subject matter experts (SMEs) to support projects both in terms of domain expertise and technical expertise. AWS Solution Architects will be available for consult throughout the event. Teams will meet with their domain SME prior to the hackathon to help scope out the project. SMEs will also be available during the event to support emerging questions. Interested SMEs should email Penny Atkins.
Amazon Web Services (AWS) Enablement and Pre-Planning Sessions
From May to early June, Amazon Web Services (AWS) and the National Data Platform (NDP) are offering virtual enablement sessions to provide you with the skills you need to participate in the hackathon. RSVPs are required to ensure there’s enough interest (at least 20 participants) to hold each session.
Workshops last three hours and entail a lecture and hands-on training.
Office hours follow workshops. They last one hour and allow for Q&As with relevant experts.
- May 1, 11:30-1:00: Team Kickoff*
- May 6, 1:00-4:00: Virtual Workshop—AWS Basics*
- May 7, 12:00-1:00: Virtual Office Hours—Subject Matter Experts*
with Tom Becnel (TELLUS), Derek Mallia (UofU, Atmospheric Sciences), and Akhil Potla (UofU, Software Design Engineer CTSI) - May 8, 12:00-1:00: Virtual Office Hours—AWS Basics*
- May 12, 1:00-4:00: Virtual Workshop—SageMaker Canvas†
- May 13, 12:00-1:00: Virtual Office Hours—Subject Matter Experts*
with William Anderegg (UofU, Biological Sciences), Philip Harrison (Utah Department of Air Quality), John Lin (UofU, Atmospheric Sciences), and Rob Smart (UofU, Software Design Engineer CTSI) - May 14, 12:00-1:00: Virtual Office Hours—National Data Platform & Data Access Δ
- May 15, 12:00-1:00: Hybrid Office Hours—SageMaker Canvas†
- May 20, 1:00-4:00: Virtual Workshop—Bedrock‡
- May 20, 1:00-4:00: Virtual Workshop—QuickSight §
- May 22, 12:00-1:00: Virtual Office Hours—Bedrock & QuickSight‡§
- May 27, 1:00-4:00: Virtual Workshop—SageMaker Advanced / Unified Studio‡
- May 28, 11:00-12:00: Virtual Office Hours—Advanced Data Handling with NDP and SciDx Δ
- May 29, 12:00-1:00: Virtual Office Hours—SageMaker Advanced/Unified Studio‡
- June 3, 11:30-1:00: Virtual Session—Building Your One-Pager*
- June 3, 1:00-3:00: Virtual Office Hours—Account Setup*
Target Audience Key
* = Everyone
† = Teams with no code experience
‡ = Teams with code experience
Δ = Teams that have questions about using data
§ = Teams seeking tools for visualization
Challenge Tracks
We have identified four challenge tracks that have potential for innovative data-driven solutions. Teams can define projects in one of these tracks or at the intersection of multiple tracks.
Prediction Models
Mitigation Strategies
Understanding and Analysis
Air Quality Products for Health Outcomes
Data
In addition to data available through the National Data Platform, AWS provides access to relevant data sets through the AWS Open Data Program:
- The Sentinel-5P data set consists of observations from the Sentinel-5 Precursor (Sentinel-5P) satellite of the European Commission’s Copernicus Earth Observation Program that completes 14 orbits of the Earth a day.
- SPARTAN (Surface PARTiculate mAtter Network) measures and provides surface ambient particulate matter (PM2.5 and PM10) concentration and the chemical composition around the world, with the purpose of connecting ground-based PM2.5 and satellite remote sensing.
- The National Air Quality Forecasting Capability (NAQFC) dataset contains model-generated Air-Quality (AQ) forecast guidance from three different prediction systems.
-
CMAS Data Warehouse collects and disseminates meteorology, emissions and air quality model input and output for Community Multiscale Air Quality (CMAQ) Model Applications.
-
OpenAQ collects global, aggregated physical air quality data from public data sources provided by government, research-grade and other sources. These awesome groups do the hard work of measuring these data and publicly sharing them, and our community makes them more universally-accessible to both humans and machines.
- The Aurora Multi-Sensor Dataset is an open, large-scale multi-sensor dataset with highly accurate localization ground truth contains rich metadata, such as weather and semantic segmentation, and spans all four seasons, rain, snow, overcast and sunny days, different times of day, and a variety of traffic conditions.
Key Considerations for Responsible Solutions & Judging Criteria
-
Ensure the data sources are reliable and have sufficient historical records for training AI models.
-
Address privacy concerns, especially if using data that could identify individuals or specific locations.
-
Encourage teams to include members with diverse expertise, such as data scientists, environmental scientists, and public health experts.
-
Involve local communities and stakeholders to ensure the solutions are practical and address real-world needs.
-
Consider how the solutions can be scaled and sustained over time, including potential partnerships with local governments or organizations.
-
Potential to make a meaningful difference in air quality prediction and mitigation.
-
Originality and creativity in the approach and solution design.
-
Integration of advanced AI/ML techniques and data sources.
-
Quality of the final presentation, including clear articulation and demonstration of the solution.
-
Practicality and scalability of the proposed solution in real-world applications.
Prizes & Perks
-
Food and refreshments will be provided for in-person participants. In addition, all participants will have access to enablement sessions from AWS and a chance at one of four prizes which will be awarded to top teams:
-
1st Prize: $1500 cash, $1250 AWS credits
-
2nd Prize: $1000 cash, $750 AWS credits
-
3rd Prize: $500 cash
-
NVIDIA’s Choice Award: GeForce RTX 4090 GPU
-
Thank You to Our Sponsors