Newslurp

<< Stories

Grafana Labs 2025 ⏲️, Python 3.15 πŸ†•, Meta’s RCA Platform πŸ”

TLDR DevOps <dan@tldrnewsletter.com>

December 22, 12:10 pm

TLDR DevOps
Grafana Labs capped 2025 with Grafana 12, major open source milestones, AI-powered Grafana Assistant, and expanded Grafana Cloud capabilities β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ  β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ β€Œ 

TLDR

Together With Momentic

TLDR DevOps 2025-12-22

This AI tool writes, runs, and maintains E2E tests at Notion, Webflow, and Quora (Sponsor)

How much faster could you ship if you didn't have to babysit brittle Playwright tests? With AI, there's no excuse for testing bottlenecks - and the most forward-thinking teams have found a better way.

Momentic's AI lets you prompt in plain English β€” and it writes, runs, and maintains your entire E2E test suite. In Momentic:

πŸ€• Auto-healing locators find elements by descriptions, not XPath.

🌀️ Cover critical user flows up to 75% faster with plain English tests.

πŸ’¨ Cut defect escape rates by an average of 89%.

See why the best product teams ship choose Momentic.

πŸ“±

News & Trends

Grafana Labs: Top 10 moments of 2025 (10 minute read)

Grafana Labs capped 2025 with Grafana 12, major open source milestones, AI-powered Grafana Assistant, Adaptive Telemetry, and expanded Grafana Cloud capabilities, alongside strong community growth, global expansion into Japan, industry recognition, and record revenue and customer adoption.
Shifting left at enterprise scale: how we manage Cloudflare with Infrastructure as Code (7 minute read)

Cloudflare transitioned its internal operations to an Infrastructure as Code (IaC) and "shift left" security model, managing hundreds of production accounts with Terraform and a custom CI/CD pipeline. This approach uses the Open Policy Agent (OPA) framework and Rego to define approximately 50 security policies, ensuring automated compliance checks and peer reviews before deployment to minimize human error.
What's new in Python 3.15 (2 minute read)

Python 3.15 adds a unified profiling module (including the new Tachyon sampling profiler), makes UTF-8 the default encoding, improves error messages, and significantly upgrades the JIT. It also includes many stdlib/C-API tweaks plus a wave of removals and deprecations for upcoming releases.
πŸš€

Opinions & Tutorials

Optimizing Kyverno CLI performance: My LFX mentorship journey (5 minute read)

Exploring Kyverno through the LFX mentorship enabled significant open source contributions, including optimizing the CLI to reduce policy application time on 3,000+ resources from 15 minutes to 1–2 seconds using concurrent loading, namespace caching, and deferred RestMapper discovery.
DrP: Meta's Root Cause Analysis Platform at Scale (3 minute read)

Meta designed DrP, a root cause analysis (RCA) platform, to programmatically automate incident investigations, significantly reducing mean time to resolve (MTTR) by 20-80%. The platform is currently utilized by over 300 teams at Meta, running 50,000 analyses daily. It is slated to evolve into an AI-native system to advance the company's AI4Ops vision.
Logging sucks (12 minute read)

Traditional logs are noisy, low-context, and optimized for writing, not for answering real debugging questions in distributed systems. The fix is wide events (canonical log lines): emit one high-cardinality, high-dimensional, context-rich event per request (with tail sampling). This turns debugging from grep-based archaeology into fast, analytical queries.
πŸ§‘β€πŸ’»

Resources & Tools

Skeptics welcome: Setting the gold standard of AI SRE (Sponsor)

Think AI SRE = hype? Engineering teams at Coinbase, DoorDash, MSCI, and Zscaler are seeing 70% faster MTTR, 30% fewer engineers pulled in per incident, and thousands of saved engineering hours using Resolve AI. Get the free AI SRE buyers guide to set your benchmarks or join the online session with a Staff Engineer (and ex-skeptic) from Coinbase on January 8.
Beowulf AI Cluster (GitHub Repo)

Beowulf AI Cluster was developed to deploy AI clusters using Ansible across diverse computers. It provides a flexible framework for testing distributed AI clustering tools and running various benchmarks.
Exo (GitHub Repo)

Exo is a platform that allows users to form an AI cluster at home by connecting everyday devices like phones and laptops, enabling faster execution of larger models with day-0 RDMA over Thunderbolt support. The system provides an API and a dedicated macOS app for managing this distributed AI processing.
🎁

Miscellaneous

Yelp Publishes Blueprint for Managing S3 Server-Access Logs at Massive Scale (7 minute read)

Yelp built a scalable, cost-efficient pipeline for S3 server-access logs by compacting raw logs into Parquet files, reducing storage by 85% and object counts by 99.99%, while enabling efficient querying for debugging, cost analysis, and compliance.
The future of AI-powered software optimization (and how it can help your team) (7 minute read)

GitHub's Continuous Efficiency combines AI-driven automation and green software practices to make codebases self-optimizing for performance, efficiency, and sustainability. Using agentic workflows, developers can author natural-language rules that AI agents apply to improve code quality, enforce standards, and iteratively enhance performance across heterogeneous repositories.
⚑

Quick Links

Planview Hub Bytes: 15-Minute Webinars on Jira, ServiceNow, and Custom Integrations (Sponsor)

Watch Planview's solution architects walk through the most requested integrations β€” with live Q&A. No hour-long time sink required. Register now
10 Years of Let's Encrypt Certificates (4 minute read)

Let's Encrypt has become the world's largest certificate authority in a decade.
What do people love about Rust? (14 minute read)

Rust is loved because its combination of reliability, performance, and strong tooling gives developers confidence to tackle problems across the entire stack.

Love TLDR? Tell your friends and get rewards!

Share your referral link below with friends to get free TLDR swag!
Track your referrals here.

Want to advertise in TLDR? πŸ“°

If your company is interested in reaching an audience of devops professionals and decision makers, you may want to advertise with us.

Want to work at TLDR? πŸ’Ό

Apply here, create your own role or send a friend's resume to jobs@tldr.tech and get $1k if we hire them! TLDR is one of Inc.'s Best Bootstrapped businesses of 2025.

If you have any comments or feedback, just respond to this email!

Thanks for reading,
Kunal Desai & Martin Hauskrecht


Manage your subscriptions to our other newsletters on tech, startups, and programming. Or if TLDR DevOps isn't for you, please unsubscribe.