AWS us-east-1 Outage, Production RAG at Scale, and AI-Driven UX

NEWSLETTER

Beyond the Build • October 20, 2025

Widespread AWS us-east-1 Degradation Hits EC2, S3, Lambda, RDS

Around the web • October 20, 2025

AWS is reporting a degraded operational issue in the N. Virginia (us-east-1) region impacting 100+ services, including EC2, S3, DynamoDB, Lambda, RDS, Route 53, and CloudWatch. While some services are recovering, customers may see API errors, elevated latency, and console issues; monitor the Service Health and Personal Health dashboards and consider multi‑region failover and increased retry/backoff where possible.

Read Full Article →

AI Trends & Engineering

Production RAG at Scale: Hard-Won Lessons from 5M+ Docs

Around the web •October 20, 2025

Hands-on takeaways from deploying RAG over 5M+ documents: generate multiple semantic/keyword queries per prompt, aggressively rerank (50 chunks in → 15 out), invest heavily in custom chunking, inject metadata, and route non-RAG questions to lightweight APIs. The stack evolved from LangChain to LlamaIndex; vector stores from Azure to Pinecone to Turbopuffer; embeddings via text-embedding-large-3; reranking from Cohere 3.5 to Zerank; and LLMs GPT‑4.1/5 via Azure—optimizing coverage, quality, and cost. The approach is open-sourced as agentset-ai/agentset (MIT), offering practical patterns for teams scaling enterprise RAG.

Read Full Article →

BERT as Diffusion: RoBERTa Generates Text in 10 Steps

Around the web •October 20, 2025

This piece argues that discrete text diffusion is effectively masked language modeling at varying mask rates and shows a simple method to repurpose a RoBERTa encoder for generation. Using HuggingFace on WikiText with a custom collator that samples mask percentages and preserves a 16‑token prompt, the model performs 10 iterative denoising steps to generate 256‑token blocks—yielding coherent text that’s slightly slower (~13s vs ~9s) and less consistent than GPT‑2. The result suggests encoder‑only LMs can act as diffusion generators without architectural changes, with potential gains from AR‑Diffusion and Skip‑Step Diffusion.

Read Full Article →

Google AI Mode Supercharges Search, Stumbles on UX and Discoverability

Nielsen Norman Group •October 17, 2025

NN/g’s usability review finds Google’s new AI Mode—an AI-powered search chat using query fan-out and RAG with Maps/Shopping/YouTube integrations—delivers strong synthesized answers and helpful follow-up prompts. But in a small study (n=7), users struggled with poor discoverability, confusion with Gemini and AI Overviews, verbose responses, hard-to-navigate long threads (especially on mobile), tab-model mismatches, and occasional hallucinations. For builders and SEOs, expect adoption friction until onboarding, scannability, and product differentiation improve.

Read Full Article →

UX Leadership & Practical Playbooks

MUXI: Embed UX in Product, Keep Accountability with UX Leaders

Nielsen Norman Group •October 17, 2025

The Managed-UX Integration (MUXI) model embeds UX practitioners in product teams for day-to-day work while maintaining a single reporting line to UX leadership, avoiding matrix-style dual accountability. This gives UX peer-level influence on discovery and prioritization, improves cross-functional communication, and reduces late rework—while engineers and PMs collaborate with UX in standups, planning, and retros and route feedback through UX managers. The article outlines common pitfalls (manager visibility, accountability gaps, siloing, rigid resourcing) with mitigations and provides adoption guidance from startups to enterprises.

Read Full Article →

AI as Your UX Intern: Ship Research, Prototypes, and Content Faster

Smashing Magazine •October 17, 2025

Paul Boag proposes a pragmatic model—treat AI like an enthusiastic but inexperienced intern—and outlines tested workflows that accelerate UX research, design/dev, and content production. He details corpus-aware projects in ChatGPT/Claude; analytics Q&A via Microsoft Clarity Copilot and Triple Whale; functional prototypes with Relume and Bolt; rapid audits using Wevo Pulse, Baymard UX Ray, and Attention Insight; Midjourney+Gemini for precise imagery; and a stakeholder-bullets-to-AI-draft copy process. With strict oversight and source checks (not for final production code/design), teams can gain roughly 25–33% throughput.

Read Full Article →

Turn UX Research Into Decisions: Triangulate Data, Lead With Story

Smashing Magazine •October 16, 2025

Facts rarely shift product direction on their own; this guide shows how to reconcile conflicting quant and qual through triangulation, then wrap findings in an actionable narrative aligned to shared goals. In stakeholder forums, demonstrate validation, present concrete options with a clear path and impact, and humanize evidence (e.g., short customer clips) to build trust and commitment. For engineers and PMs, this approach accelerates prioritization, reduces rework, and clarifies next steps when analytics and usability tests disagree.

Read Full Article →

‍

AWS us-east-1 Outage, Production RAG at Scale, and AI-Driven UX

Widespread AWS us-east-1 Degradation Hits EC2, S3, Lambda, RDS

Around the web • October 20, 2025

AI Trends & Engineering

Production RAG at Scale: Hard-Won Lessons from 5M+ Docs

Around the web •October 20, 2025

BERT as Diffusion: RoBERTa Generates Text in 10 Steps

Around the web •October 20, 2025

Google AI Mode Supercharges Search, Stumbles on UX and Discoverability

Nielsen Norman Group •October 17, 2025

UX Leadership & Practical Playbooks

MUXI: Embed UX in Product, Keep Accountability with UX Leaders

Nielsen Norman Group •October 17, 2025

AI as Your UX Intern: Ship Research, Prototypes, and Content Faster

Smashing Magazine •October 17, 2025

Turn UX Research Into Decisions: Triangulate Data, Lead With Story

Smashing Magazine •October 16, 2025

Don't Just Follow the News. Build your Competitive Advantage.

Architect Your Success.