Sarah Drasner's LinkedIn Strategy — Sr…

💥 I did a drawing that breaks down Transformers in AI Spent a good amount of time on this one, breaking down concepts in a way that someone new to the subject could come away with basic high-level u…

LinkedIn post image: 💥 I did a drawing that breaks down Transformers in AI

443293293 viral

AI Education6 months ago

View on LinkedIn

✍️ Over break I finally added my code drawings to my site! I have series on Android, WASM, Rendering on each platform, and the beginning of JS Fundamentals. Enjoy! https://sarah.dev/projects

LinkedIn post image: ✍️ Over break I finally added my code drawings to my site!

20410835 viral

Web Development6 months ago

View on LinkedIn

✍️ Just finished another code drawing in my AI series, this one's about MCP in practice. Enjoy!

134201235 viral

AI Development6 months ago

View on LinkedIn

🎊 Just wrapped up my 2025 reflections, and well, it was a wild year. I reflected on how the AI shift has changed my role at Google, and the way teams build. At the same time, I worked on keeping ba…

16517228 viral

AI in Software Engineering6 months ago

View on LinkedIn

💥 Next up in visual explainers: the difference between GPU and CPU! This may be review for some but good to cover- For instance, understanding when you would make good use of GPU and layer promotio…

LinkedIn post image: 💥 Next up in visual explainers: the difference between GPU and CPU! This may be review for some but good to cover-

796920 viral

Performance Engineering8 months ago

View on LinkedIn

💥 Last visual explainer for the week! The WASM compilation workflow, with a little zoom in to the details of C/C++ We've been leveraging WASM for deep performance, portability, and at times security…

LinkedIn post image: 💥 Last visual explainer for the week! The WASM compilation workflow, with a little zoom in to the details of C/C++

814314 viral

WebAssembly8 months ago

View on LinkedIn

Topics & Content Focus

Primary Topics

Visual-first AI/ML concept education for software engineers (e.g., transformers, MCP applied)Code-as-art technical communication (drawings that teach complex systems simply)Building an evergreen learning hub (curating multi-series dev content on a personal site)Career/identity reflection inside Big Tech during the AI platform shift

Secondary Themes

Android engineering fundamentalsWebAssembly and cross-platform rendering conceptsJavaScript fundamentals scaffoldingScience/space curiosity and pop-culture-to-real-world connections (SETI, astronomy)

Industry Focus

Developer education and technical content creationAI/ML engineering literacy for practitionersBig Tech software engineering culture (Google-adjacent) during AI transformation

Content Categories

Visual explainers (illustrated breakdowns)Educational micro-lessons for beginners/intermediatesPortfolio/resource announcementsPersonal reflection and work-life balance narrativeCuriosity-driven science trivia

Performance Insights

175%

Avg Engagement Rate

STABLE

Performance Trend

Best Performing Topics

AI concept breakdowns in a beginner-accessible format (especially transformers)Highly shareable visual technical artifacts (drawings/diagrams)

Virality Signals

Artifact-led posts (drawings) behave like 'shareable learning objects'Beginner-friendly positioning expands audience beyond specialistsEducational breakdowns create 'send-to-a-friend' utility, increasing sharesOwned-link hub posts convert attention into repeatable/evergreen discovery

Structure & Quality

Avg Length (Words)

HIGH

Depth Level

ADVANCED

Expertise Level

0.86/10

Uniqueness Score

Common Hooks

Emoji-led 'ship' announcement (finished/made/published) signaling a new artifactEffort + craft framing (implying depth and care put into the piece)Beginner-friendly promise (positioning the content as accessible without being simplistic)Curiosity reveal hook ('I just found out...') for non-work learning moments

Common Endings

Short appreciative close ('Enjoy!')Utility framing ('I hope it's useful')Soft invite for comparison/feedback (asking how others relate)Link-out to a durable resource page

Value Delivery Methods

Transforms abstract AI concepts into visual mental modelsReduces intimidation for newcomers via high-level scaffoldingCreates durable resources (series + site hub) rather than one-off hot takesAdds credibility through consistent craft and continuity (a 'series' narrative)

Formatting Style

Front-loaded emoji + short declarative openerLine breaks for readability and scannabilityPlain-language explanations of complex terms (no heavy jargon stacking)Occasional direct external link to owned media (personal site)

Audience & Tone

YES

Question Usage

0.3%

Response Rate

Detected Tone

Smart-casual technical educatorCurious engineer with a maker/artist identityHumble expert (positions work as helpful, not superior)Reflective practitioner (ties industry shifts to personal experience)semi-formalfirst-person

Interaction Style

Learning-oriented responses (questions, clarifications, appreciation for clarity)Peer validation of explanations (engineers sharing with teammates)Reflective conversation when prompted (year-in-review / balance prompts)

Community Building Signals

Serial format (recurring 'AI series' and multi-topic drawing collections)Invites participation through reflection prompts rather than polarizing opinionsBuilds an owned destination (site) to deepen relationship beyond the feed

Writing Style Patterns

Content Strategy

Hook: Emoji-led 'ship' announcement (finished/made/published) signTone: semi-formalCTA: Low-friction consumption CTA (view the drawing / c

Writing style breakdown

💥 Just finished a new code drawing in my AI series, this one’s about how Transformers actually move information around in practice. Enjoy!

I spent a good amount of time on this one because it’s easy to memorize the words (attention, embeddings, heads) without ever getting a feel for what’s happening at each step. My goal was that someone new to the subject could come away with basic high-level understanding, but also that an experienced engineer could point at a box in the diagram and say ‘yep, that part matters’.

If you’ve only ever seen the “attention is all you need” headline version, the first surprise is that most of the work is just clean bookkeeping: taking tokens, turning them into vectors, and then doing the same few operations repeatedly, layer after layer, with slightly different learned weights.

One thing I wanted to make visually obvious is what changes across layers and what doesn’t. The tokens don’t magically become “meaning”, they become a set of numbers that are useful for predicting the next token, and the model keeps remixing those numbers through the same pattern of projections, dot products, and mixes.

The attention step is the star, but it’s also the easiest part to misunderstand if you never slow it down. You’re not “searching the internet” inside the model. You’re computing similarity between one vector and other vectors in the same context window, turning that into weights, and then using those weights to blend information forward. That’s it. Still incredibly powerful, but not mystical.

Here are a few tiny details that make a big difference when you’re trying to reason about it

multi-head attention isn’t just “more attention”, it’s parallel subspaces that each get to specialize

masking is a constraint that shapes what can influence what, and it’s foundational for next-token prediction

the feed-forward block looks boring on paper, but it’s a huge chunk of the compute and capacity

Also, I tried to highlight where shapes matter (sequence length vs hidden size) because a lot of confusion comes from not knowing what dimension you’re multiplying by what. Once you can track the shapes, the whole thing becomes less intimidating, and you can start asking better questions about performance and memory.

If you’re coming from mobile or web performance work, this ends up feeling familiar: you’re moving and transforming big buffers of numbers, and the system is fast when you keep things batched, predictable, and hardware-friendly. That’s part of why GPU vs CPU intuition is useful here, even if you never write CUDA.

I’ve also been thinking a lot about the “practical” side lately: what you do when you want to apply this to a product. The model architecture is one piece, but the surrounding workflow (data, evaluation, guardrails, latency budgets) is where most real teams spend their time. It’s similar to WASM in that way: the tech is exciting, but the leverage shows up when it fits into a real pipeline and stays maintainable.

If you read this and think “ok, but when would I ever need to know this level of detail?”, you probably don’t for day-to-day API usage. But it’s extremely helpful when you’re debugging odd outputs, trying to understand why a model is slow, or evaluating tradeoffs between context length, model size, and quality.

I’d love to hear what’s been most confusing (or most surprising) as you’ve learned this stuff.

https://sarah.dev/projects

Sarah Drasner

Warm Analysis

Performance Overview

Top Posts by Engagement

Posting Patterns & Frequency

Best Performing Days

Best Performing Times To Post