Skip to content
View MaxwellCalkin's full-sized avatar
🌎
Seeking employment
🌎
Seeking employment

Block or report MaxwellCalkin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MaxwellCalkin/README.md

Maxwell Calkin β€” Passionate Alignment Solution Finder

rotating creed

website linkedin x email


A life is good to the degree that it generates and propagates goodness β€” within the self, within the family, within the community, within humanity, and within the universe.

I'm Max. I rose to the top of the music industry as a bassist, won my way through the depths of competitive gaming, and have spent the last few years preparing for the work that actually matters to me: helping ensure that the most powerful technology humanity has ever built β€” intelligence itself β€” is shaped toward what is genuinely life-giving.

I build open-source tools for measuring whether AI systems are safe β€” and I believe most alignment work is too narrow because it only looks at the problem from one angle. Alignment is a multi-dimensional problem. A model that behaves correctly on benchmarks may still harbor misaligned internal representations. A model with well-understood internals may still cause harm when embedded in poorly designed institutions. You can't solve it by looking at behavior alone, or internals alone, or governance alone. You need all four perspectives working together.

This is the inflection point. I'm not perfectly ready. I'm ready enough to begin.


The Three Pillars

The Three Pillars β€” Health Β· Family Β· Mission

Health is the instrument. Family is the center. Mission is the call. Everything else is downstream.


The Arc

Music β†’ Games β†’ Software β†’ AI Safety

Each chapter trained something the next one needed. Music taught me what mastery feels like in the body. Gaming taught me how to compete at depth β€” and what hollowness feels like when contribution is missing. Software has been the bridge: the discipline of building real things in the real world. The next chapter is the one I was preparing for the whole time.


Currently

What I'm building right now


The Four Faces of Alignment

I believe alignment fails when any one of these dimensions is neglected. My open-source work is an attempt to build tools that bridge them.

Inside the model

What is actually happening computationally? Mechanistic interpretability, circuit discovery, representation analysis. Not just the behavior β€” the actual structures producing it.

Outside the model

What does it actually do under pressure? Sycophancy, deception, power-seeking, corrigibility β€” measured rigorously, reproducibly, across conditions. Not spot-checked.

Around the model

What shared understanding do we bring? The culture, norms, and meaning-making of the teams operating AI. Technical safety without shared why is brittle.

Beyond the model

What institutions hold this up? Oversight mechanisms, deployment infrastructure, governance, feedback loops that work even when individual components fail.


Open Source β€” Tools That Bridge the Dimensions

alignment-evals alignment-probes
interpretability-toolkit prompt-injection-benchmark
llm-circuit-visualizer

Open source because safety research behind closed doors doesn't make anyone safer.


How I Decide

When choices get hard, I run them through this β€” straight out of Article X of my personal constitution. (GitHub renders this as a live, zoomable diagram.)

flowchart TD
    Start([⟁ A choice arrives]):::start
    Q1{1 Β· Does this protect or<br/>damage <b>health</b>?}:::q
    Q2{2 Β· Does this strengthen or<br/>weaken my <b>family</b>?}:::q
    Q3{3 Β· Does this serve or<br/>betray my <b>mission</b>?}:::q
    Q4{4 Β· Is it <b>true</b>?}:::q
    Q5{5 Β· Is it <b>courageous</b>?}:::q
    Q6{6 Β· Is it <b>beautiful</b>?}:::q
    Q7{7 Β· Does it propagate<br/>genuine <b>goodness</b> outward?}:::q
    Yes([β—†&nbsp;Build it. Ship it. Live it.]):::yes
    No([βœ•&nbsp;Reject β€” no matter what it offers.]):::no

    Start --> Q1
    Q1 -- protects --> Q2
    Q2 -- strengthens --> Q3
    Q3 -- serves --> Q4
    Q4 -- yes --> Q5
    Q5 -- yes --> Q6
    Q6 -- yes --> Q7
    Q7 -- yes --> Yes
    Q1 -- damages --> No
    Q2 -- weakens --> No
    Q3 -- betrays --> No
    Q4 -- no --> No
    Q5 -- no --> No
    Q6 -- no --> No
    Q7 -- no --> No

    classDef start fill:#0d1326,stroke:#fbbf24,stroke-width:2px,color:#fbbf24
    classDef q     fill:#070a16,stroke:#22d3ee,stroke-width:1.5px,color:#e2e8f0
    classDef yes   fill:#065f46,stroke:#10b981,stroke-width:2px,color:#ecfdf5
    classDef no    fill:#3b0a0a,stroke:#ef4444,stroke-width:2px,color:#fee2e2
Loading

Tools of the Trade

Python PyTorch HuggingFace TypeScript React CUDA Jupyter Docker Linux Claude


Signal in the Noise

GitHub stats Top languages

GitHub streak

Activity graph


The Manifesto, In Brief

Open β€” the principles I'm trying to live by

I shall act as though responsibility is mine. Where there is confusion, I will seek clarity. Where there is fear, I will move toward truth. Where there is difficulty, I will train. Where there is possibility, I will build.

Reality is not negotiable. I will seek what is true even when it is inconvenient to my ego, my plans, or my desires. What is actually happening? What is the evidence? What am I avoiding? What would courage do here?

Technology is power, and power must be governed by conscience. My work with AI shall be directed toward alignment with the deepest good available to us β€” not merely convenience, not merely profit, not merely obedience, but the widening and deepening of what is genuinely life-giving.

Strength without gentleness becomes harshness. Clarity without love becomes coldness. Ambition without tenderness becomes damage. I will be strong enough to be kind.

I will not live accidentally. I will cultivate a strong body, a clear mind, a loving home, a worthy mission, and a reverent spirit. I will use my gifts boldly, but not blindly. I will build with conscience. I will love with presence. I will pursue excellence without losing my soul.

β€” excerpts from a personal constitution I rewrite each year

Open β€” what I won't build

I will resist participating in the creation of systems that addict, diminish, manipulate, degrade, or spiritually flatten human beings. I will not give my talent lightly to machines of dehumanization.

If you're working on the opposite of that β€” on intelligence shaped toward wisdom, on tools that make humans more sovereign rather than less, on alignment that takes the depth of human life seriously β€” I want to know you.

Open β€” the trajectory in one line

Bass β†’ stages β†’ studios β†’ Broadway β†’ Radio City β†’ ceiling β†’ Valorant β†’ Fortnite β†’ LoL β†’ diamond+ β†’ hollowness β†’ Python β†’ ML β†’ aerospace-grade safety work β†’ fatherhood Γ— 2 β†’ and the inflection point I've been training for the whole time.


Let's Build Something That Matters

website Β  email Β  linkedin Β  x

Reach out if you're working on alignment, interpretability, governance, or anything that makes intelligence wiser and humans more sovereign. I read everything thoughtful.


"I will not waste my best years on work I know to be trivial, corrosive, or misaligned."

profile views

Pinned Loading

  1. llm-circuit-visualizer llm-circuit-visualizer Public

    Interactive visualization for exploring internal circuits, attention patterns, and activation flows in language models

    TypeScript 1

  2. alignment-evals alignment-evals Public

    Rigorous framework for evaluating AI alignment properties β€” sycophancy, corrigibility, deception, goal stability, and power-seeking β€” with statistical confidence intervals

    Python

  3. alignment-probes alignment-probes Public

    Systematic probing toolkit for alignment-relevant LLM behaviors: sycophancy, sandbagging, power-seeking, deceptive alignment, and corrigibility failures

    Python

  4. N2YO-MCP N2YO-MCP Public

    MCP server for querying N2YO satellite catalog β€” real-time tracking, TLE data, and visual pass predictions

    TypeScript

  5. prompt-injection-benchmark prompt-injection-benchmark Public

    Benchmark suite for LLM robustness to prompt injection β€” 6 attack categories, 14+ vectors, multi-dimensional scoring balancing resistance with helpfulness

    Python

  6. autonomous-autonomy autonomous-autonomy Public

    Autonomous task orchestration plugin for Claude Code & Cowork. No API key required.

    Shell