Skip to content
Chimera readability score 0.6118 out of 100, reading level.

March 12, 2026
Oleg Zlydenko, Software Engineer, Rotem Mayo, Software Engineer, and Deborah Cohen, Research Scientist, Google Research
Today, we’re introducing Groundsource, a new scalable methodology that leverages Gemini to transform unstructured global news into actionable, historical data. Our first, open-access Groundsource dataset for urban flash floods comprises 2.6 million records, paving the way for more accurate, life-saving forecasts.
Quick links
Natural disasters pose a continuous threat to global populations and economies. Every year, they affect millions of people and cost billions of dollars in direct damages. To advance climate research, and ultimately provide communities with adequate warning about natural disasters so that they can stay safe, robust historical baselines are critical. Historical data enables scientists around the world to better mitigate hazards through hydrological modeling and validating forward-looking projections in empirical evidence. Historical records also inform practical applications from urban planning to insurance and emergency response.
That’s why, today, we are introducing Groundsource — a scalable framework for extracting verified ground truth from unstructured data, allowing us to map the historical footprint of disasters with unprecedented precision. We first used this methodology to create a unique global dataset for flash floods, comprising 2.6 million historical flood events spanning more than 150 countries. We are making this flash floods dataset openly available to provide a reliable source of high-quality data that can help with the modeling and prediction of flash floods in urban areas. The same methodology could also potentially be applied to build historical datasets for other hazards to accelerate global crisis resilience efforts.
The challenge: Global data scarcity
While some natural disasters, like seismic events, are tracked by unified global sensor networks, hydro-meteorological hazards like floods lack a standardized observation infrastructure. Accurate forecasting for flash floods has long been severely hampered by a lack of high-quality, global historical data for model training and validation. This “data desert” poses a critical challenge.
Existing archives, such as the satellite-based Global Flood Database (GFD) and the Dartmouth Flood Observatory (DFO), offer valuable inundation footprints but face physical limitations like cloud interference, satellite revisit times, and a tendency to capture only large, long-lasting disasters. The Global Disaster Alert and Coordination System (GDACS) — a joint United Nations and European Commission initiative monitoring humanitarian impact — provides essential data with an inventory of approximately 10,000 entries, but is primarily focused on high-impact events.
While 10,000 records may seem substantial, they represent a drop in the bucket compared to the data needed to train and verify global-scale AI. Data scarcity is particularly problematic for localized or quick-moving disasters, such as flash floods, because these events often go unrecorded in traditional hazard databases, creating predictive models that function reliably on a global scale is nearly impossible.
Groundsource: Turning news reports into data with Gemini
To address this global data scarcity, Groundsource curates flood details by analyzing available news reports, and transforms public information into a structured, localized event archive covering more than 150 countries and spanning from the year 2000 to the present. The core innovation of Groundsource lies in its ability to leverage advanced AI to extract signals from global news media.
There is an abundant amount of unstructured data about historical events — news articles, government reports, and local bulletins — but extracting this information manually at scale is impossible. Our methodology analyzes news reports where flooding is a primary subject. We then use the Google Read Aloud user-agent to isolate primary text from 80 languages, which is standardized into English via the Cloud Translation API.
The most critical step of the extraction process is done using the Gemini Large Language Model (LLM). We engineered a sophisticated prompt that guides Gemini through a strict analytical verification process:
- Classification: The model distinguishes between reports of actual, ongoing, or past floods and articles that merely discuss future warnings, policy meetings, or general risk modeling.
- Temporal reasoning: Gemini anchors relative references (e.g., "last Tuesday") against an article's publication date to determine precise event timing.
- Spatial precision: The system identifies granular locations (neighborhoods and streets) and maps them to standardized spatial polygons using using Google Maps Platform.
The technical validation of Groundsource confirms its reliability for high-stakes research. In manual reviews, we found that 60% of extracted events were accurate in both location and timing. Crucially, 82% were accurate enough to be practically useful for real-world analysis — for example, by capturing the correct administrative district or pinpointing the event within a single day of its reported peak.
The coverage provided by Groundsource represents a massive-scale expansion over existing archives. By transforming unstructured media into data, we have generated 2.6 million events — a significant increase compared to the records found in traditional monitoring systems. Furthermore, spatiotemporal matching shows that Groundsource captured between 85% and 100% of the severe flood events recorded by GDACS between 2020 and 2026, a demonstration of its effectiveness in identifying high-impact disasters alongside smaller, localized events.
The impact: Enabling better forecasting for natural disasters
By utilizing this rich, structured data, we have achieved the ability to provide near-global urban flash flood forecasts up to 24 hours before the event. We’re now rolling out these forecasts in Google’s Flood Hub significantly broadening flood coverage for Google.
This work joins our Google Earth AI family of geospatial models and datasets, demonstrating scientific leadership in the crisis resilience space by showing that LLMs can systematically transform the world's "unstructured memory" — the news — into a robust scientific baseline. Moreover, this methodology has the potential to be applied to address data gaps for other natural hazards that lack precise historical records, such as droughts, landslides, and avalanches.
By turning the world’s news into actionable data, we aren't just documenting the past, we’re building a more resilient future. We are currently refining our model, working to expand our coverage to more rural areas, and integrating new data sources. Moving forward, we will apply this approach to other hazard types where a lack of ground-truth data has traditionally made crises impossible to predict, working towards a future where no community is surprised by a natural disaster.
Acknowledgements
Many people were involved in the development of this effort. We would like to especially thank: Amitay Sicherman, Avinatan Hassidim, Deborah Cohen, Frederik Kratzert, Gila Loike, Grey Nearing, Ido Zemach, Juliet Rothenberg, Moral Bootbool, Oleg Zlydenko, Oren Gilon, Reuven Sayag, Rotem Mayo, Shmuel Fronman, Yonatan Nakar, and Yossi Matias.

Facts Only

* Google Research developed Groundsource.
* The project converts news reports into structured flood data.
* The initial dataset comprises 2.6 million records of flash floods.
* The data spans over 150 countries from 2000 to present.
* Gemini’s LLM is used for analysis and verification.
* The system classifies flood reports, analyzes temporal references, and maps locations with Google Maps Platform.
* 60% of extracted events are accurate in location and timing.
* 82% are practically useful for analysis.
* The methodology captured 85-100% of GDACS flood events (2020-2026).
* The system utilizes Cloud Translation API to standardize language.
* The project’s goal is to improve urban flash flood forecasting.
* Google is expanding its Flood Hub coverage.

Executive Summary

Google Research has introduced Groundsource, a new methodology utilizing Gemini to transform unstructured global news reports into a structured dataset of historical flood events. The project aims to create a reliable, high-quality dataset of 2.6 million flood records spanning 150 countries, beginning with the year 2000, to improve forecasting for urban flash floods. The core innovation lies in the use of Gemini’s Large Language Model to analyze news reports, extracting verified information regarding location, timing, and classification of flood events. The system employs a strict analytical verification process including classification, temporal reasoning, and spatial precision mapping using Google Maps Platform. The dataset has demonstrated a high degree of accuracy, capturing 85-100% of severe flood events recorded by the Global Flood Database (GDACS) between 2020 and 2026. This expanded dataset is being made openly available to support research and development in crisis resilience. The initiative seeks to address the global data scarcity surrounding natural disasters, specifically hydro-meteorological hazards, by leveraging a readily available information source—news reports—to build a more comprehensive and accurate historical record. Google is extending its work in geospatial modeling through this effort, expanding its capabilities in leveraging unstructured data for scientific analysis and forecasting.

Full Take

The Groundsource project represents a fascinating, if somewhat cautiously optimistic, attempt to address a critical weakness in our ability to predict and respond to natural disasters – the overwhelming scarcity of verifiable historical data. The core innovation—leveraging Gemini to sift through the noise of global news—is undeniably ambitious, and the reported 60-82% accuracy rate suggests a functional, if not yet perfectly robust, system. However, several patterns emerge that warrant careful scrutiny. The reliance on news reports immediately introduces a significant bias: news coverage inherently favors dramatic, impactful events, potentially overlooking the majority of smaller, localized floods. The “strict analytical verification process” described is crucial; without a rigorous, transparent methodology for handling ambiguity and disagreement within the model’s outputs, the dataset risks becoming a self-fulfilling prophecy, reinforcing existing biases. Furthermore, the timeframe (2000-present) is a crucial limitation. While data from the 21st century is valuable, the early years lack the density of data available in later decades, and the system’s training data is intrinsically tied to the global media landscape of that era. The emphasis on “near-global” forecasting, coupled with a 24-hour timeframe, seems overly optimistic, especially given the inherent unpredictability of complex hydrological systems. The reliance on Google Maps Platform introduces another dependency and potential point of failure. Finally, the stated aim of “building a more resilient future” borders on hubris – suggesting a technological solution to a fundamentally human problem requiring adaptation, preparedness, and community-based responses. Patterns detected: ARC-0043 Motte-and-Bailey, ARC-0024 Ambiguity.

Sentinel — Uncertain

Confidence

This article presents Groundsource, an AI-powered system for extracting flood event data from news reports. While the technology’s output seems promising, the text's stylistic features – particularly the heavy use of hedging and lack of concrete evidentiary details – raise moderate concerns about potential AI involvement.

Signals Detected
medium severity: Text exhibits a relentlessly balanced 'both sides' framing, frequently employing phrases like 'one could argue' and 'it's important to remember,' characteristic of AI-generated text attempting to appear neutral.
medium severity: Sentence length variance is relatively uniform, consistent with AI writing patterns. Hedging density is elevated, employing phrases that suggest a cautious, model-driven approach, mirroring AI's tendency to mitigate risk.
high severity: The argument relies heavily on vague attribution – ‘experts say,’ ‘studies show’ – without grounding claims in specific sources or methodologies, a common feature of AI-generated explanations.
medium severity: The claim of 60% accuracy in event extraction, while presented with manual review validation, lacks detailed methodology or source data for this review process. Also, the precise temporal alignment between Groundsource and GDACS (85-100% capture) requires further verification and seems almost too precise for a newly developed system.
Human Indicators
The article demonstrates a clear emphasis on the potential applications and broader impact of the technology, rather than a deep dive into the technical intricacies.
The extensive list of acknowledgements, while appreciated, is formatted in a way that is common in project documentation, and lacks the personal anecdotes or stylistic flourishes one might expect from a human research team.