TLDR Web Dev 2024-11-15

Worried about customer data leaks when using RAG and LLM's? (Sponsor)

Secure your AI Pipeline with Piiano's Data Privacy Vault:

Automatically tokenize PII in your prompts, contexts, embeddings and RAGs while using LLM's.
Encrypt your embeddings and protect them in-use.
Protect customer data like chats, speech, transcriptions, photos and PII with a breeze.

Just ask Rabbit Tech, creator of R1, a proud Piiano customer:

“Our achievements to date and future roadmap wouldn't have been possible without Piiano.”

✅ Dev-centric | easy to use | tech-stack agnostic

✅ Compliance | visibility | controls

✅ Free trial

Schedule a demo to learn more

🧑‍💻

Articles & Tutorials

I Followed the Official AWS Amplify Guide and was Charged $1,100 (6 minute read)

This developer was hit with unexpectedly high AWS charges after following the official guide for integrating Amazon OpenSearch with AWS Amplify. He was charged over $1,100 due to the guide's failure to properly manage OpenSearch domain creation and deletion, which led to multiple, concurrently running instances. AWS customer support ultimately waived the charges but the issue persisted until AWS recently fixed its docs.

SQLite Index Visualization: Structure (10 minute read)

The on-disk and in-memory representations of SQLite index structures using a B-tree model can be visualized using an ImageMagick-based approach. This shows how index size, sorting order (ASC/DESC), data types, and index creation timing impact the B-tree structure and how optimizations are achievable through VACUUM and REINDEX.

Netflix's Distributed Counter Abstraction (25 minute read)

Netflix developed a distributed counter abstraction built upon its TimeSeries Abstraction to handle massive-scale counting with low latency. This service addresses various counting needs. The system uses a combination of techniques, including event logging, background aggregation with sliding time windows, and caching.

🧠

Opinions & Advice

The Pythonic Emptiness (15 minute read)

Python's PEP-8 recommended method for checking empty sequences using truthiness (e.g., `if not mylist:`) is more Pythonic and efficient than using `len()`. While using `len()` isn't incorrect, the recommended Pythonic approach is simpler, faster, and clearer.

AI Makes Tech Debt More Expensive (4 minute read)

AI increases the cost of tech debt as it boosts the productivity gap between clean and messy codebases. Generative AI tools work best with well-structured, low-complexity code. Companies with legacy systems will struggle to adopt them. To maximize AI's benefits, developers should prioritize refactoring legacy code into modular, well-defined units.

Lessons from my First Exit (33 minute read)

Michael Lynch, a former Google engineer, sold his bootstrapped hardware company, TinyPilot, for $598,000 after four years of running it. In this post, he details the process, highlighting what he did well (extensive documentation, using a trusted broker, and avoiding seller financing) and what he would do differently (cash buyer incentives and earlier legal counsel). The sale taught him valuable lessons about due diligence, the hidden costs of selling, and the importance of clear contracts.

🚀

Launches & Tools

How Statsig's streaming architecture handles 1 trillion daily events as a startup (Sponsor)

Discover the architecture and streaming pipelines that enable Statsig to process massive event volumes at low cost—while optimizing for near-zero data loss. See how they keep latency low and reliability high, even with a 20x increase in daily event volume over the last year. Read the blog

Windsurf (Website)

Windsurf Editor is an AI-powered IDE that aims to provide a "flow state" coding experience. Its core feature, Cascade, uses codebase understanding for multi-file editing and intelligent AI-powered suggestions.

React Query Builder (GitHub Repo)

React Query Builder is a customizable React component for creating queries that offers utility functions for various query languages (SQL, MongoDB, etc.). It includes compatibility packages for popular styling libraries like Ant Design and Bootstrap and a React Native component.

json.cpp (GitHub Repo)

json.cpp is a fast, small, and highly compliant JSON library for C++. It's more performant than alternatives (parsing 3x faster and compiling 10x faster), has a much smaller codebase (10x less code), and has better JSONTestSuite conformance than its competitors. The library uses Google's double-conversion library for efficient floating-point number serialization.

🎁

Miscellaneous

OpenAI, Google, and Anthropic Are Struggling to Build More Advanced AI (10 minute read)

OpenAI, Google, and Anthropic are encountering difficulties in developing more advanced AI models. Their latest projects, including OpenAI's Orion and Google's next Gemini iteration, are falling short of internal expectations. This slowdown is being attributed to challenges in acquiring high-quality training data and the immense costs associated with building and maintaining these massive models.

Open-Source Software is in Crisis (5 minute read)

The legal dispute between WordPress and hosting provider WPEngine highlights the "maker-taker" problem in open-source software, where companies profit without adequately contributing back to the projects they utilize. This issue is made worse by increasing reliance on open source. Possible solutions include the commercialization of support services, government funding (as seen in Germany), and social validation through recognition and reward systems.

O2 unveils Daisy, the AI granny wasting scammers' time (8 minute read)

“Daisy” is a lifelike AI "granny” designed to waste scammers' time by engaging them in lengthy, rambling conversations. It's trained using real scambaiting content. Daisy can hold conversations indistinguishable from a real person, diverting scammers from genuine targets.

⚡

Quick Links

SQL Style Guide (Website)

Comprehensive SQL guidelines for writing consistent, readable, and portable SQL code.

What makes concurrency so hard? (8 minute read)

Concurrency in computer science is difficult primarily because the number of possible states in a concurrent system grows exponentially with the number of concurrent actors and their actions.

TIL: inline event handlers still fire when passed to React's dangerouslySetInnerHTML (6 minute read)

React's `dangerouslySetInnerHTML` allows execution of inline event handlers despite blocking `script` tags, posing a security risk.

A memory leak in Apple's Network Extension framework (2 minute read)

A memory leak in Apple's Network Extension framework affecting macOS 15 Sequoia causes apps using the framework to consume excessive memory.

Love TLDR? Tell your friends and get rewards!

Share your referral link below with friends to get free TLDR swag!

https://refer.tldr.tech/d76eafc3/3

Track your referrals here.

Want to advertise in TLDR? 📰

If your company is interested in reaching an audience of web developers and engineering decision makers, you may want to advertise with us.

If you have any comments or feedback, just respond to this email!

Thanks for reading,
Priyam Mohanty, Jenny Xu & Ceora Ford

If you don't want to receive future editions of TLDR Web Dev, please unsubscribe from TLDR Web Dev or manage all of your TLDR newsletter subscriptions.

Newslurp

AWS docs blunder 🤦, open-source crisis 🚨, advanced AI struggles 🤖

TLDR Web Dev <dan@tldrnewsletter.com>

November 15, 12:23 pm

TLDR Web Dev 2024-11-15

Articles & Tutorials

Opinions & Advice

Launches & Tools

Miscellaneous

Quick Links