• A5 Pentagons Are the New Bestagons
    Jan 19 2026
    How can you accurately aggregate and compare point-based data from different parts of the world? When analyzing crime rates, population, or environmental factors, how do you divide the entire globe into equal, comparable units for analysis? For data scientists and geospatial analysts, these are fundamental challenges. The solution lies in a powerful class of tools called Discrete Global Grid Systems (DGGS). These systems provide a consistent framework for partitioning the Earth's surface into a hierarchy of cells, each with a unique identifier. The most well-known systems, Google's S2 and Uber's H3, have become industry standards for everything from database optimization to logistics. However, these systems come with inherent trade-offs. Now, a new DGGS called A5 has been developed to solve some of the critical limitations of its predecessors, particularly concerning area distortion and analytical accuracy. Why Gridding the Globe is Harder Than It Looks The core mathematical challenge of any DGGS is simple to state but difficult to solve: it is impossible to perfectly flatten a sphere onto a 2D grid without introducing some form of distortion. Think of trying to apply a perfect chessboard or honeycomb pattern to the surface of a ball; the shapes will inevitably have to stretch or warp to fit together without gaps. All DGGS work by starting with a simple 3D shape, a polyhedron, and projecting its flat faces onto the Earth's surface. The choice of this initial shape and the specific projection method used are what determine the system's final characteristics. As a simple analogy, consider which object you’d rather be hit on the head with: a smooth ball or a spiky cube? The ball is a better approximation of a sphere. When you "inflate" a spiky polyhedron to the size of the Earth, the regions nearest the sharp vertices get stretched out the most, creating the greatest distortion. A Quick Look at the Incumbents: S2 and H3 To understand what makes A5 different, it's essential to have some context on the most popular existing systems. Google's S2: The Cube-Based Grid The S2 system is based on projecting a cube onto the sphere. On each face of this conceptual cube, a grid like a chessboard is applied. This approach is relatively simple but introduces significant distortion at the cube’s vertices, or "spikes." As the grid is projected onto the sphere, the cells near these vertices become stretched into diamond shapes instead of remaining square. S2 is widely used under the hood for optimizing geospatial queries in database systems like Google BigQuery. Uber's H3: The Hexagonal Standard Uber's H3 system starts with an icosahedron—a 20-sided shape made of triangles. Because an icosahedron is a less "spiky" shape than a cube, H3 suffers from far less angular distortion. Its hexagonal cells look more consistent across the globe, making it popular for visualization. H3's immense success is also due to its excellent and user-friendly ecosystem of tools and libraries, making it easy for developers to adopt. However, H3 has one critical limitation for data analysis: it is not an equal-area system. This was a deliberate trade-off, not a flaw; H3 was built by a ride-sharing company trying to match drivers to riders, a use case where exact equal area doesn't particularly matter. To wrap a sphere in hexagons, you must also include exactly 12 pentagons—just like on a soccer ball. If you look closely at a football, you'll see the pentagonal panels are slightly smaller than the hexagonal ones. This same principle causes H3 cells to vary in size. The largest and smallest hexagons at a given resolution can differ in area by a factor of two, meaning that comparing raw counts in different cells is like comparing distances in miles and kilometers without conversion. For example, cells near Buenos Aires are smaller because of their proximity to one of the system's core pentagons, creating a potential source of error if not properly normalized. Introducing A5: A New System Built for Accuracy A5 is a new DGGS designed from the ground up to prioritize analytical accuracy. It is based on a dodecahedron, a 12-sided shape with pentagonal faces that is, in the words of its creator, "even less spiky" than H3's icosahedron. The motivation for A5 came from a moment of discovery. Its creator, Felix Palmer, stumbled upon a unique 2D tiling pattern made of irregular pentagons. This led to a key question: could this pattern be extended to cover the entire globe? The answer was yes, and it felt like uncovering something "very, very fundamental." This sense of intellectual curiosity, rather than a narrow business need, is the foundation upon which A5 is built. A5's single most important feature is that it is a true equal-area system. Using a specific mathematical projection, A5 ensures that every single cell at a given resolution level has the exact same area. This guarantee even accounts for the Earth's ...
    Afficher plus Afficher moins
    37 min
  • The Sustainable Path for Open Source Businesses
    Jan 8 2026
    The Open-Source Conundrum Many successful open-source projects begin with passion, but the path from a community-driven tool to a sustainable business is often a trap. The most common route—relying on high-value consulting contracts—can paradoxically lead to operational chaos. Instead of a "feast or famine" cycle, many companies find themselves with more than enough work, but this success comes at a cost: a fragmented codebase, an exhausted team, and a growing disconnect from the core open-source community. This episode deconstructs a proven playbook for escaping this trap: the strategic transition from a service-based consultancy to a product-led company. Through the story of Jerome and his company, Geocat, we will analyze how this pivot creates a more stable business, a healthier open-source community, and ultimately, a better product for everyone.
    Afficher plus Afficher moins
    36 min
  • Free Software and Expensive Threats
    Dec 26 2025
    Open-source software is often described as "free," a cornerstone of the modern digital world available for anyone to download, use, and modify. But this perception of "free" masks a growing and invisible cost—not one paid in dollars, but in the finite attention, time, and mounting pressure placed on the volunteer and community maintainers. This hidden tax is most acute when it comes to security. Jody from Geocat, a long-time contributor to the popular GeoServer project, pulled back the curtain on the immense strain that security vulnerabilities place on the open-source ecosystem. His experiences reveal critical lessons for anyone who builds, uses, or relies on open-source software.
    Afficher plus Afficher moins
    34 min
  • Mapping Your Own World: Open Drones and Localized AI
    Dec 18 2025

    What if communities could map their own worlds using low-cost drones and open AI models instead of waiting for expensive satellite imagery?

    In this episode with Leen from HOT (Humanitarian OpenStreetMap Team), we explore how they're putting open mapping tools directly into communities' hands—from $500 drones that fly in parallel to create high-resolution imagery across massive areas, to predictive models that speed up feature extraction without replacing human judgment.

    Key topics:

    • Why local knowledge beats perfect accuracy
    • The drone tasking system: how multiple pilots map 80+ square kilometers simultaneously
    • AI-assisted mapping with humans in the loop at every step
    • Localizing AI models so they actually understand what buildings in Chad or Papua New Guinea look like
    • The platform approach: plugging in models for trees, roads, rooftop material, waste detection, whatever communities need
    • The tension between speed and OpenStreetMap's principles
    • Why mapping is ultimately a power game—and who decides what's on the map
    Afficher plus Afficher moins
    33 min
  • From Data Dump to Data Product
    Dec 9 2025

    This conversation with Jed Sundwall, Executive Director of Radiant Earth, starts with a simple but crucial distinction: the difference between data and data products. And that distinction matters more than you might think.

    We dig into why so many open data portals feel like someone just threw up a bunch of files and called it a day. Sure, the data's technically "open," but is it actually useful? Jed argues we need to be way more precise with our language and intentional about what we're building.

    A data product has documentation, clear licensing, consistent formatting, customer support, and most importantly - it'll actually be there tomorrow.

    From there, we explore Source Cooperative, which Jed describes as "object storage for people who should never log into a cloud console." It's designed to be invisible infrastructure - the kind you take for granted because it just works. We talk about cloud native concepts, why object storage matters, and what it really means to think like a product manager when publishing data.

    The conversation also touches on sustainability - both the financial kind (how do you keep data products alive for 50 years?) and the cultural kind (why do we need organizations designed for the 21st century, not the 20th?). Jed introduces this idea of "gazelles" - smaller, lighter-weight institutions that can move together and actually get things done.

    We wrap up talking about why shared understanding matters more than ever, and why making data easier to access and use might be one of the most important things we can do right now.

    Afficher plus Afficher moins
    46 min
  • Reflections from FOSS4G 2025
    Dec 2 2025

    Reflections from the FOSS4G 2025 conference

    Processing, Analysis, and Infrastructure (FOSS4G is Critical Infrastructure)

    The high volume of talks on extracting meaning from geospatial data—including Python workflows, data pipelines, and automation at scale—reinforced the idea that FOSS4G represents critical infrastructure.

    • AI Dominance: AI took up a lot of space at the conference. I was particularly interested in practical, near-term impact talks like AI assisted coding and how AI large language models can enhance geospatial workflows in QGIS. Typically, AI discussions focus on big data and earth observation, but these topics touch a larger audience. I sometimes wonder if adding "AI" to a title is now like adding a health warning: "Caution, a machine did this".
    • Python Still Rules (But Rust is Chatting): Python remains the pervasive, default geospatial language. However, there was chatter about Rust. One person suggested rewriting QGIS in Rust might make it easier to attract new developers.
    Data Infrastructure, Formats, and Visualization

    When geospatial people meet, data infrastructure—the "plumbing" of how data is stored, organized, and accessed—always dominates.

    • Cloud Native Won: Cloud native architecture captured all the attention. When thinking about formats, we are moving away from files on disk toward objects in storage and streaming subsets of data.
    • Key cloud-native formats covered included COGs (Cloud Optimized GeoTIFFs), Zarr, GeoParquet, and PMTiles. A key takeaway was the need to choose a format that best suits the use case, defined by who will read the file and what they will use the data for, rather than focusing solely on writing it.
    • The Spatial Temporal Asset Catalog (STAC) "stole the show" as data infrastructure, and DuckDB was frequently mentioned.
    • Visualization is moving beyond interactive maps and toward "interactive experiences". There were also several presentations on Discrete Global Grid Systems (DGGS).
    Standards and Community Action
    • Standards Matter: Standards are often "really boring," but they are incredibly important for interoperability and reaping the benefits of network effects. The focus was largely on OGC APIs replacing legacy APIs like WMS and WFS (making it hard not to mention PyGeoAPI).
    • Community Empowerment: Many stories focused on community-led projects solving real-world problems. This represents a shift away from expert-driven projects toward community action supported by experts. Many used OSM (OpenStreetMap) as critical data infrastructure, highlighting the need for locals to fill in large empty chunks of the map.
    High-Level Takeaways for the Future

    If I had to offer quick guidance based on the conference, it would be:

    1. Learn Python.
    2. AI coding is constantly improving and worth thinking about.
    3. Start thinking about maps as experiences.
    4. Embrace the Cloud and understand cloud-native formats.
    5. Standards matter.
    6. AI is production-ready and will be an increasingly useful interface to analysis.
    Reflections: What Was Missing?

    The conference was brilliant, but a few areas felt underrepresented:

    • Sustainable Funding Models: I missed a focus on how organizations can rethink their business models to maintain FOSS4G as critical infrastructure without maintainers feeling their time is an arbitrage opportunity.
    • Niche Products: I would have liked more stories about side hustles and niche SAS products people were building, although I was glad to see the "Build the Thing" product workshop on the schedule.
    • Natural Language Interface: Given the impact natural language is having on how we interact with maps and geo-data, I was surprised there wasn't more dedicated discussion around it. I believe it will be a dominant way we interact with the digital world.
    • Art and Creativity: Beyond cartography and design talks, I was surprised how few talks focused on creative passion projects built purely for the joy of creation, not necessarily tied to making a part of something bigger.
    Afficher plus Afficher moins
    14 min
  • Building a Community of Geospatial Storytellers
    Nov 27 2025

    Karl returns to the Mapscaping podcast to discuss his latest venture, Tyche Insights - a platform aimed at building a global community of geospatial storytellers working with open data.

    In this conversation, we explore the evolution from his previous company, Building Footprint USA (acquired by Lightbox), to this new mission of democratizing public data storytelling.

    Karl walks us through the challenges and opportunities of open data, the importance of unbiased storytelling, and how geospatial professionals can apply their skills to analyze and share insights about their own communities. Karl shares his vision for creating something akin to Wikipedia, but for civic data stories - complete with style guides, editorial processes, and community collaboration.

    Featured Links

    Tyche Insights:

    • Main website: https://tycheinsights.com
    • Wiki platform: https://wiki.tycheinsights.com
    • Example project: https://albanydatastories.com

    Mentioned in Episode:

    • USAFacts: https://usafacts.org
    • QField Partner Program: https://qfield.org/partner
    • Open Data Watch: (monitoring global open data policies)
    Afficher plus Afficher moins
    42 min
  • I have been making AI slop and you should too
    Nov 17 2025
    AI Slop: An Experiment in Discovery Solo Episode Reflection: I'm back behind the mic after about a year-long break. Producing this podcast takes more time than you might imagine, and I was pretty burnt out. The last year brought some major life events, including moving my family back to New Zealand from Denmark, dealing with depression, burying my father, starting a new business with my wife, and having a teenage daughter in the house. These events took up a lot of space. The Catalyst for Return: Eventually, you figure out how to deal with grief, stop mourning the way things were, and focus on the way things could be. When this space opened up in my life, AI came into the picture. AI got me excited about ideas again because for the first time, I could just build things myself without needing to pitch ideas or spend limited financial resources. On "AI Slop": I understand why some content is called "slop," but for those of us who see AI as a tool, I don't think the term is helpful. We don't refer to our first clumsy experiments with other technologies—like our first map or first lines of code—as slop. I believe that if we want to encourage curiosity and experimentation, calling the results of people trying to discover what's possible "slop" isn't going to help. My AI Experimentation Journey My goal in sharing these experiments is to encourage you to go out and try AI yourself. Phase 1: SEO and Content Generation My experimentation began with generating SEO-style articles as a marketing tool. As a dyslexic person, I previously paid freelancers thousands of dollars over the years to help create content for my website because it was too difficult or time-consuming for me to create myself. Early Challenges & Learning: My initial SEO content wasn't great, and Google recognized this, which is why those early experiments don't rank in organic search. However, this phase taught me about context windows, the importance of prompting (prompt engineering), and which models and tools to use for specific tasks.Automation and Agents: I played around with automation platforms like Zapier, make.com, and n8n. I built custom agents, starting with Claude projects and custom GPTs. I even experimented with voice agents using platforms like Vappy and 11 Labs. Unexpected GIS Capabilities: During this process, I realized you can ask platforms like ChatGPT to perform GIS-related data conversions (e.g., geojson to KML or shapefile using geopandas), repro data, create buffers around geometries, and even upload a screenshot of a table from a PDF and convert it to a CSV file. While I wouldn't blindly trust an LLM for critical work, it's been interesting to learn where they make mistakes and what I can trust them for. AI as a Sparring Partner: I now use AI regularly to create QGIS plugins and automations. Since I often work remotely as the only GIS person on certain projects, I use AI—specifically talking to ChatGPT via voice on my phone—as a sparring partner to bounce ideas off of and help me solve problems when I get stuck. Multimodal Capabilities: The multimodal nature of Gemini is particularly interesting; if you share your screen while working in QGIS, Gemini can talk you through solving a problem (though you should consider privacy concerns). The Shift to Single-Serve Map Applications I noticed that the digital landscape was changing rapidly. LLMs were becoming "answer engines," replacing traditional search on Google, which introduced AI Overviews. Since these models no longer distribute traffic to websites like mine the way they used to, I needed a new strategy. The Problem with Informational Content: Informational content on the internet is going to be completely dominated by AI.The Opportunity: Real Data: AI is great at generating content, but if you need actual data—like contours for your specific plot of land in New Zealand—you need real data, not generated data.New Strategy: My new marketing strategy is to create targeted, single-serve map applications and embed them in my website. These applications do one thing and one thing only, using open and valuable data to solve very specific problems. This allows me to rank in organic search because these are problems that LLMs have not yet mastered. Coding with AI: I started by using ChatGPT to code small client-side map applications, then moved to Claude, which is significantly better than OpenAI's models and is still my coding model of choice. Currently, I use Cursor AI as a development environment, swapping between Claude code, OpenAI's Codex, and other models. A Caveat: Using AI for coding can be incredibly frustrating. The quality of the code drops dramatically once it reaches a certain scale. However, even with flaws, it’s a thousand times better and faster than what I could do myself, making my ideas possible. Crucially, I believe that for the vast majority of use cases, mediocre code is good enough. Success Story: GeoHound After practicing and ...
    Afficher plus Afficher moins
    19 min