On nuclear weapons policy, Biden beats Trump

There are a lot of uncertainties about the result of a nuclear war, but one thing seems clear: it would be bad. How bad depends on things like who we go to war with, number of nuclear weapons used, weather patterns, etc. Wikipedia documents some estimates that the US government produced during the cold war as saying that nuclear war with the Soviet Union could lead to the death of 70% of all Americans. My wife and I have two kids. If 70% of us die, that’s three out of the four people in our family. My kids, my wife, gone.

Since the cold war, the number of nuclear weapons stockpiled has been reduced by about 85%. That said, there are over ten thousand nuclear weapons in the world, over half operated by other countries. Things would still be very bad if we got into a nuclear war.

In a lot of ways, I see nuclear war deterrence as one of the more important responsibilities of a US president. It doesn’t matter what other policies they put in place if they get us involved in a war that kills 70% of Americans. I want a president that will continue drawdown of current nuclear stockpiles, prevent other countries from continuing their nuclear weapons programs, and provide stability to the international environment. That’s one part of why I’m going to vote for Biden.

As President, Donald Trump has spent the last four years making our country much less safe from nuclear war. It can be hard to evaluate some of the nuclear decisions his administration has made, given the international landscape. Unilateral disarmament seems likely to make the US less safe, and modernizing our nuclear arsenal improves reliability and safety. Because of that, we have to look at all of the administration’s decisions as a whole to determine if they’re improving the security of Americans.

The Trump administration has:

  • actively called for development of new nuclear weapons (both in type and in quantity).
  • focused on increasing capabilities to match Russia and China, instead of negotiating for more international drawdown
  • broadened the definition of “extreme circumstances” under which nuclear weapons could be used
  • pulled out of the INF. This was done because Russia wasn’t meeting targets, but the INF governed more than just the US and Russia. Pulling out of the INF reduces leverage on other nuclear powers governed by the treaty, as well as those not officially in the treaty but who abided by it (like Germany and Slovakia).
  • introduced development of low-yield nuclear weapons
  • looks likely to not extend the New START treaty, which places limits on the number of nuclear weapons Russia and the US can deploy. Russia is currently complying with that treaty.

In addition to leaving nuclear non-proliferation treaties and calling for development of new nuclear weapons, the Trump administration has also completely failed to reign in North Korea and Iran.

North Korea currently has around 30 nuclear weapons and can produce something like 6 more weapons each year. Several years ago, North Korea offered to start reducing nuclear capabilities in exchange for lifting of sanctions, but Trump walked away. A couple months ago, North Korea said that negotiating with Trump had been “a nightmare” and that they were going to increase their nuclear weapons stockpile.

Iran’s nuclear weapons development had, prior to Trump’s election, been governed by the JCPOA. The JCPOA was an agreement between Iran, the US, and several other nations. It governed how Iran was to eliminate its stockpile of enriched uranium. Iran would still be able to build nuclear power plants, but not to enrich enough to build nuclear weapons. In 2018, Trump’s administration withdrew from that agreement over the protest of every other member (including Iran). After the US withdrew, Iran and the other countries in JCPOA attempted to continue abiding by the agreement. The US’s withdrawal from JCPOA led to series of skirmishes which culminated in the US killing an Iranian major general. In response to that killing, Iran has said that they won’t abide by the JCPOA at all. By their actions, the Trump administration increased the likelihood of Iran getting nuclear weapons.

Let’s contrast with Biden. Biden wants to extend the New START treaty. He wants to drawdown nuclear stockpiles. We have evidence from his decades in politics of him working to reduce the likelihood of nuclear war. Biden has also released a letter describing his approach to nuclear disarmement treaties in the past.

This quote in particular is why I think Biden will manage our nuclear policy well:

“Despite what some extreme voices argued at the time, the arms control agreements we hammered out with the Soviets were not concessions to an enemy or signs of weakness in the United States.
They were a carefully constructed barrier between the American people and total annihilation.”

Joe Biden

Bad Math: Tax Plan Tweet Edition

It’s pretty common these days to talk about how much Jeff Bezos could buy for us. Facebook and twitter are both full of people saying things like this:

The idea being that if Jeff Bezos just paid his share of our country’s upkeep, we could have a utopia.

There’s a problem with this idea, but it’s not a political problem. It’s a math problem. It’s a problem with what people mean when they say “wealth” or “taxes”.

He doesn’t have the cash

First thing’s first: Bezos has somewhere around $200 Billion now (a quarter of his and MacKenzie Scott’s wealth went to her in the divorce). That wealth is not cash in a bank account. It’s Amazon stock that he owns. There’s no way for him to spend that much money, because he doesn’t have it as money.

If we wanted to take our 4.7% of his wealth, we’d have to start by selling about $10B worth of his Amazon stock. Selling that much stock would have an impact on Amazon’s stock price, so we’d likely have to sell more shares than you’d expect to reach that amount.

Bezos spends around $1Billion a year on his new space company, Blue Origin. He makes a big deal about that, I think, in large part because he has to justify selling that much stock every year. If Bezos just started selling Amazon stock for no reason, especially that amount, the price of the stock would plummet and he’d lose a lot of his wealth. People would assume that he knew something bad about Amazon’s business.

Now if he were selling the stock to pay for College For All, that would be a pretty good signal to the market that Amazon was still a stable company. The market price for the stock likely wouldn’t drop much from our $10B sale due to a lack of confidence.

I was originally going to write something here about $10B being a lot to sell on the stock market, but it turns out that’s not true. The NASDAQ (where Amazon is listed) clears over $100B per day. Bezos could probably find someone to buy his $10B of stock pretty easily.

Normal US Taxes don’t work like that

Let’s say that we decide it’s a good idea. That $10B that people are paying Bezos for his stock wasn’t doing us any good before (it was probably just wrapped up in some other big tech stock). We’re going to have Bezos fund our College.

The thing is, we can’t do this by just taxing Bezos normally. The US has an income tax. That’s a tax on income. Bezos actually doesn’t have much income, he just has assets that are worth a lot. This means that if President Biden (I hope that’s who we have next year!) says that 2021 will have a super high tax on everyone named Jeff Bezos, we wouldn’t actually get much money.

Jeff Bezos doesn’t pay any taxes on his stock until he sells it, and then he only pays taxes on the appreciation. Though given that he got the stock when it was likely worth a single dollar, the tax will apply to pretty much the entire amount he’s selling.

So if Bezos did sell $10B in stock next year, then we’d only be taxing that $10B. The current capital gains tax would be 20% for him, so we’d only get $2B from that sale.

In order to fix this, we’d need to tax wealth, not income. We could absolutely do that, and Piketty has advocated for that kind of tax to help deal with some of the social problems that we’re facing now. But I don’t really think that Jack Califano, from our original tweet up above, is thinking about it like that. Obviously I don’t know what his understanding of our current tax situation is, but if he were proposing something as radical as a wealth tax I’d have expected him to play it up.

Math vs. Politics

Enough of this tax bracket, stocks-vs-cash nonsense. We want our College For All; let’s get Bezos to pay for it. We’re going to pass a law that says Bezos has to sell enough stock every year to provide 4.7% of his wealth to the US government, each year, effective immediately.

Tomorrow, Bezos sells enough stock to get a grand total of $10B from it (remember that his current wealth is around $200B, so 4.7% is slightly less than $10B). Now can we all have free college?

No, no we can’t. Because Bernie Sanders says that he needs $48B per year to pay for his College For All plan. I have no idea where Califano got the 4.7% number, but it makes no sense. Bezos could only pay for College For All for four years, even if we gave him a wealth tax of $50B/yr.

Bezos has a lot of money, but he doesn’t have that much.

Why

I’m sympathetic to Califano’s argument. It would be really nice for Bezos to pay to improve America. Hell, it would be nice for him to pay to improve Seattle (more directly than he’s doing by building nice buildings and bringing in employees who spend money). People have been trying to get Amazon to pay more Seattle taxes for a long time, and I hope that they’ve finally managed it.

I’m a bit more reluctant about the idea of a wealth tax, but I could be convinced by the right arguments and experiments.

I want free college (and free healthcare, and affordable housing) for everyone in America. I want it so bad that I’m willing to pay attention to reality to find ways to get it. This is why I get so frustrated when people start talking about having Bezos pay for everything.

Bezos has a lot of money, but he doesn’t have that much. Simple “tax the rich” schemes also generally won’t access most of his money, so saying that we should tax Bezos without specifying that you want to totally change the way taxes are done in America seems disingenuous.

I want people to work towards a better world together. To do that, we all need to know what direction a better world is. If we just ignore the math, I’m afraid of where we’ll actually end up.

Life Events Are Community Events

I read Achtung Baby, by Sara Zaske, when my kids were about six months old. The book is part travelog, part parenting advice, by a woman who moved her family from the US to Germany. She raised kids for five years there, came back to the states, and wrote a book on the difference in parenting cultures.

One of the biggest things that stuck with me about this book was the discussion about Einschulung, and life milestones parties more generally. An Einschulung is a party thrown for a kid when they first go off to school, and it is a big deal. Everyone is invited, family, extended family, friends, neighbors. Everyone comes out to celebrate and acknowledge this major milestone for the kid.

The Einschulung is compared with two other major parties that Germans have in their lifetimes: their Jugendweihe celebrating entrance into young adulthood and their wedding. These are the three parties that define the arc of many a German’s life, and they help tie the person into their community.

Community

What struck me most about the Einschulung, Jugendweihe, and wedding is how they were seen in Germany. Or perhaps how Sara Zaske saw them. These were parties that were in honor of a given person, but not always for them. Often the parties were for the community that person belonged to as much as they were for the person themselves.

This really appealed to me when I was reading it as a new parent. I still had fresh memories of my own wedding two years before, and how my wedding had changed my views on weddings overall. Prior to my own wedding, I’d often viewed wedding invites with distaste. When some friends got married, I’d feel obligated to go in order to show my support, but I often felt pretty isolated at weddings. I didn’t know how to interact with the event, or the people at the event.

When planning our wedding, my wife and I wanted to really emphasize the family and community aspects. We ended up doing a lot of non-standard wedding things, but the closer we got to the wedding, the more those seemed to matter less to me than having people there to witness our love. I think several of our plans for our wedding were amazing, and a couple fell kind of flat, but the thing that was most meaningful to me was just having so much family and so many friends there with us as we said we loved each other.

I really understood then that our wedding wasn’t just for us, it was for our community as well.

This was a pretty new experience for me. I was a lonely kid, and had a lot of trouble making friends. It wasn’t until I was in my mid-twenties that I understood that it was possible to have a social interaction with someone new that wasn’t emotionally painful. My wedding was the final push I needed to see community in a different way.

Reading about Einschulung in Achtung Baby got me wondering if I could have had a much more socially comfortable adolescence if I were raised in a culture that emphasized community in a more formal way. It’s really too bad the US is the way that it is, but maybe I could do something like that for my kids anyway.

Other states

I was pretty surprised to discover that, at least in one US state, there are parties like Einschulung. Several Michiganders that I know recently told me about the tradition of high school graduation parties there.

In Michigan, everyone apparently throws a high school graduation party. You invite your family, your friends, your friends’ families, your neighbors, your parents coworkers. It’s a regular community ritual. It’s such a big deal that high schoolers carefully schedule their graduation parties to not overlap with their classmates, so that everyone can go to everyone else’s parties.

I was immediately excited about this and wondering if Michiganders felt less isolated and more tied in to their community than I had as a high schooler.

Not Always Good

It turns out that one of the people who was telling me about this tradition had hated their graduation party. They felt that it was a party for their parents, not for them, and that they were being forced to go. In fact, the way they described it reminded me a lot of how I felt about other people’s weddings before I’d had my own.

This raises some interesting questions. Why hadn’t I felt like I was a part of a community when I was going to other people’s weddings before I had my own? Why did my friend wish they hadn’t had a high school graduation party, instead of feeling like it was their community supporting them?

It’s hard to speculate on my own about what someone else was feeling at some party, but I think my own lack of community feeling was related to a sense of support going one way only. I went to weddings of friends and families because I wanted to be there for them, which is actually a large part of what I consider to be a good community. But I didn’t feel communal about it, I felt like it was an obligation. I was doing it to avoid being cast out of my community, not because I wanted to build a stronger community. I don’t think I even had a sense that community could benefit me in any way.

This is not to say that community never benefited me before I got married (though the amount of community support I received after my wedding was mind boggling). Looking back, I see many times when my community was supporting me throughout childhood and young adulthood. My experience of that support at the time was confused though. People would do something nice for me, and I wouldn’t understand why. I’d feel like I had to pay them back immediately, or that they were condescending to me. It also wasn’t clear to me if someone was “in my community” or not (a question not at all helped by the introduction of Facebook).

This makes me wonder if my wedding was a turning point because it really shoved my face in the idea that other people were helping me because they wanted me to be happy. Planning and throwing a wedding is not an easy thing, and there’s often a ton of family stress on top of that. We never would have been able to throw the wedding that we did without the help of a huge number of people.

I think this is why the other two parties described in Achtung Baby sounded so good to me. If I’d had the idea that other people might genuinely want to help me when I was a kid, or that they might actively want me to be a part of their community, then I would have had a much happier childhood. The Einschulung seems like a very stark demonstration of that fact to a kid when they could most use it: right before joining a huge group of people they’ve never met before to do something totally new.

Celebrating With My Kids

This idea of the reason behind community parties offers up some ideas for how to do similar things for my kids as they age:

  1. throw parties at milestones that are a Schelling Point for the community that you’re in
  2. let people help with party set up in a way that’s visible to others
  3. invite everyone that’s in your community, but not everyone you know
  4. make sure the party will actually be enjoyable for the kids

I think point 4 is pretty important. I really enjoyed my wedding, and I suspect I’d be feeling differently about community if I hadn’t (even if my community had been exactly as helpful). I also suspect that this was what went wrong for my friend who hated their own graduation party. If the party had been structured more to their liking, then they could have recognized the community aspects of it more easily.

I also think point 1, about choosing milestones that make sense for your community, is important. This makes me think that birthdays are more important than I had been thinking before this, and going forward I plan to place more importance on my kids birthdays and on my own birthday.

Knowledge Bootstrapping Experiment 2

Last month, I started experimenting with AcesoUnderGlass’s Knowledge Bootstrapping method. I started out with a small project learning some facts about radiation and electronics. That worked well, so I then went to learn about something a little less straightforward: GPT-3’s likely impact on AI safety.

I have to be honest, selecting this topic may have been a bit of a mistake. I was seeing a lot headlines and posts about GPT-3, and I had a pretty immediate emotional reaction of “GPT-3 isn’t a big deal and people don’t know what they’re talking about.”

I had a lot of fun writing this post, but I’m less happy with the final product than I expected to be. The thing is, I had that original emotional reaction to a bunch of headlines. I literally hadn’t read the articles before I had decided to try to rebut them. When I went to go read the articles themselves, they were different than lots of the headlines and twitter hype had implied (shocking, I know). As I read more about GPT-3, I ended up changing my mind several times about my thesis. The post I wrote was much different than the one I was planning on.

In a lot of ways, this is great. I learned a lot about the current state of AGI research, and some of the current major players in AI safety. Deciding (before doing any research) to write a post about the topic is what gave me the motivation to actually read all those articles, and then read the background, and then read even more background. I haven’t really kept up with these things for the past three years, so things had changed a lot since I had last looked into it. This project gave me the push I needed to finally learn how the transformer architecture really worked, as well as uncovering some of what DeepMind has been doing. I hadn’t even known that MuZero existed before starting on this project.

Motivation

All of this leaves me still excited about the knowledge bootstrap method, but I’m also noticing that keeping my motivation for a research project up is hard. When I have a blog post that I’m excited about writing, it’s easy to put in effort to learn and write. When someone is wrong on the internet, of course I’ll be burning to write about it. The more I wrote my post, the more clear it became that I was the one wrong on the internet.

That started sapping my motivation to write, even though the things that I was writing changed enough that I still stand by their accuracy.

As I closed in on answering most of the questions that I had come up with in my original question decomposition, I had such a different understanding of the topic that I realized I had an enormous number of new questions. I answered those questions, and then the questions that followed from that. Eventually, I came to the point where I thought I had a decent stance on the original safety question I had. At that point, I also realized how much detail there was to making a decent prediction about GPT-3’s implications on future safe AI. And much of that detail was (and is) still unknown to me.

As I began to realize how much I’d have to research in order to do the topic justice, I could feel my excitement fade. Given that I’ve had a very stuttering relationship with this blog over the past decade or so, I could recognize that if I let my excitement about the topic drive me into perfectionism I wouldn’t post anything. I also recognized that if I didn’t post that blog entry, I’d feel like a failure and there would be a long drought in me posting anything at all.

I decided that I had enough for a high level post and wrote it, but I ended up writing a more milquetoast thesis than I had originally intended.

The most important thing for me in any kind of learning project is keeping up motivation. For work related topics, there’s enough external motivation that I can power my way to a solution one way or another. For personal projects, even personal projects that could help me out at work, I need to stay interested throughout the process to have any hope of success.

My first experience of Knowledge Bootstrapping showed me that an emphasis on questions could help me keep my motivation up. By keeping my thoughts close to my original questions, it was easy to remember why I was doing the thing. This second experience of the process showed me that the blog output itself is still a big part of my motivation, and I’ll need to plan around that in future projects.

Question Decomposition

I still view question decomposition as one of the more important components of Knowledge Bootstrapping. My original project had a very straightforward set of questions, and after I decomposed them it was easy to pull answers out of the sources I found. The hardest part of my Radiation+Electronics mini-project was finding sources that went deep enough to truly answer my questions.

The GPT-3/AI-safety mini-project was much different. When I first started decomposing questions (before I started doing much reading) I had a ton of trouble figuring out what my primary question even was. Then I had trouble breaking that down into questions that reading books/papers could answer. I did my best to decompose the questions, then went and tried to answer them. That helped me orient myself to the field again, and when I came back to try answering my original questions I could clearly see some better question decompositions.

I ended up iterating this process several times, and I think for difficult or new topics this is probably crucial.

Elizabeth says that if you’re not sure what notes to take when you’re reading a source, you should go look at your questions again. That isn’t great advice if you’re having trouble with the decomposition step itself. I tried to address this by emphasizing the difference between what I was reading and what I already thought, and writing that down. That also helped me to figure out what my questions were, as I would sometimes realize I disagreed with something but be uncertain why.

Pre-Read, Brain-Dump

Elizabeth emphasizes doing a brain dump of what you think about any given source before you really start reading it. I didn’t do this very much in my first mini-project, but I did it for every source in this project.

I now think that my radiation+electronics mini-project didn’t need much of the brain-dump step because I’d been thinking about the topic on and off for several years. I pretty much knew what I already knew. I also had a mindset that was focused on fact acquisition and model building, but I didn’t have to worry much about conflicting information or exaggeration.

With GPT-3 and AI safety, there’s no settled science about the topic. Everything is new, so people are all very excited. That meant that I had to be more careful with what sources I was using. I also didn’t have a good handle on what questions I was trying to answer at the beginning, which meant that it was harder for me to notice what was important about each source’s content.

This is where the pre-read brain-dump really shines. Before I did an in-depth read of any source, I’d free-write for a while about what I expected the source to say. I’d also write about what I personally thought about the expected content of the source. Then when I went to read the source, it was easy for me to notice myself being surprised. That surprise (or disagreement, or whatever) was the trailhead for the questions that I should have been asking at the beginning.

Interestingly, this seems to be the exact opposite of the reason that Elizabeth does it. She talks about how, if she didn’t get her brain dump on paper, those thoughts would be floating around her head interrupting her reading process.

When I don’t do the brain dump, I don’t have any of those thoughts floating around my head as I read. That makes it really hard for what I read to latch on to what I already know. I’ll sometimes read something and feel like I understand it, but then be unable to recall it even ten minutes (or ten pages) later. By brain-dumping, I prime my mind with all those thoughts so that I’m actually engaging with and thinking about the content in the source.

(Though Elizabeth also talks about this a bit here, where she says breaking the flow of a book is a sign of engagement).

In the past I’ve tried to address this with Anki. When I was reading textbooks cover to cover, I’d create flash cards of the major things I learned. This was generally very effective, but I’ve ended up with a truly enormous number of cards. I haven’t kept up on my Anki training for the past couple weeks, and I now have hundred of cards in my backlog. It’s also pretty slow to do this, and really takes me out of the flow of reading.

A good future workflow might be something more like:

  1. question decomposition
  2. source selection
  3. brain-dump
  4. read and note take
  5. post-process notes and write blog post
  6. generate anki cards that are more focused

Tools

One of the things that held me back during my first Knowledge Bootstrapping mini-project was being unfamiliar with some of the markdown features that Elizabeth makes common use of. Because of that, my writing project was slower and more awkward than I think is Elizabeth’s experience.

I took some time (really just ten minutes or so) to look up some of the markdown features that I had wanted to use in my first project. Using those made this second project a lot easier. I was a lot more comfortable drafting the post and referring to each source. I’m beginning to see how the process itself could become more natural and get in the way less.

I still feel pretty curious about Elizabeth’s actual workflow during note-taking and synthesis though. She described it at a high level in her post, but I’m more interested in the nitty-gritty at this point. What does she make a tag, and why? How does she manage her tags? Does she really actually use that many of them?

Math Puzzle: 2D planes in N-D spaces

I was playing around with robot localization the other day, and realized that the angular degrees of freedom a robot has follow an interesting pattern. A robot that can just move around a floor has only one degree of angular freedom; it can rotate to the left or right. A flying robot, on the other hand, has three angular degrees of freedom: it can pitch, roll, or yaw.

That made me curious how the number of angular degrees of freedom is related to the number of spatial degrees of freedom. If we could build a robot that could move in the 4th spatial dimension, how would it rotate?

A robot that can only move along one line is the degenerate case. This one dimensional bot has a single binary degree of freedom in rotation. It can point either forward or backwards.

A robot that can move in a single plane, like a roomba, has a single angular degree of freedom in rotation. It can rotate however it wants as long as it remains parallel to the floor.

A robot in arbitrary three space has the traditional angular degrees of freedom of roll, pitch, and yaw.

But what about a robot in arbitrary spatial dimensions? What would a robot’s degrees of freedom look like in 4-D space, or n-D space?

As normal, the linear degrees of freedom equal the dimensions of the space. So in n-D space there are n linear degrees of freedom.

At first, I naively thought that the number of angles the robot could rotate around would be the number of axes also. After all, in three spatial dimensions there’s one orthogonal axis for each plane the robot rotates in. But for higher dimensional cases this doesn’t quite follow. The plane that’s defined by being orthogonal to one axis is actually a hyperplane. It’s made up of all (n-1) dimensions at a right angle to the axis of rotation. If our robot sensors are still just 2D or 1D devices, then we probably want to be more precise about what plane they’re rotating in.

What we’re interested in for robot rotations is (I think) the number of 2D planes that the robot could rotate around. Just by happenstance, the number of 2D planes in 3-space is the same as the number of axes. But for higher dimensional spaces we actually have to define the 2D plane by choosing two orthogonal vectors and finding their span. We can find a sufficient set of planes by choosing orthogonal vectors that are all aligned with an axis of the space.

So the number of 2D planes in an nD space is {n \choose 2}, or \frac{n!}{(2*(n-2)!)}. Here’s a list of the number of 2D planes of rotation for a few different numbers of spatial dimensions:

  • 2D = 1 plane of rotation
  • 3D = 3 planes of rotation
  • 4D = 6 planes of rotation
  • 5D = 10 planes of rotation
  • and so on

GPT-3, MuZero, and AI Safety

Edited 2020/08/31 to remove an erroneous RNN comment.

I spent about six months in middle school being obsessed with The Secret Life of Dilly McBean by Dorothy Haas. It’s a book about an orphan whose mad scientist parents gave him super magnetism powers before they died. When the book opens, he’s been adopted and moved into a new town. Many magnetism adventures follow, including the appearance of some shadowy spy figures.

After many exasperating events, it’s finally revealed that (spoiler) the horrible no-good Dr. Keenwit is trying to take over the world. How, you might ask? By feeding all worldly knowledge into a computer. Dr. Keenwit would then be able to ask the computer how to take over the world, and the computer would tell him what to do. The shadowy spy figures were out collecting training data for the computer.

Middle school me got a great dose of schadenfreude at the final scene where Dilly runs through the rooms and rooms of magnetic tape drives, wiping all of Dr. Keenwit’s precious data with his magnetism powers and saving the world from a computer-aided dictator.

GPT-3

Dr. Keenwit would love GPT-3. It’s a transformer network that was trained on an enormous amount of online text. Given the way the internet works these days, you could round it off to having been trained on all worldly knowledge. If Dr. Keenwit had gotten his evil hands on GPT-3, would even Dilly McBean have been able to save us?

The internet has been flooded with examples of what GPT-3 can (and can’t) do. Kaj Sotala is cataloging a lot of the more interesting experiments, but a few of the biggest results are:

How is it doing those things? Using all of that source text, GPT-3 was trained to predict new text based on whatever came before it. If you give it the first half of a sentence, it will give you the second half. If you ask it a question, it will give you the answer. While it’s technically just a text prediction engine, various forms of text prediction are the same as conversation. It’s able to answer questions about history, geography, economics, whatever you want.

Even Tyler Cowen has been talking about how it’s going to change the world. Tyler is careful to reassure people that GPT-3 is no SkyNet. Tyler doesn’t mention anything about Dr. Keenwit, but I have to guess that he’s not worried about that problem either.

GPT-3 isn’t likely to cause the end of the world. But what about GPT-4? Or GPT-(N+1)?

GPT-3, on it’s own, just predicts text. It predicts text like a human might write it. You could give it 1000 times more NN parameters and train it on every word ever written, and it would still just predict text. It may eventually be good enough for Dr. Keenwit, but it’ll never be a SkyNet.

Agents

We don’t have to worry about a GPT SkyNet because GPT isn’t an agent. When people talk about agents in an AI context, that means something specific. An agent is a program that interacts with an environment to achieve some goal. SkyNet, for example, is interacting with the real world in order to achieve its goal of world domination (possibly as an instrumental goal to something else). Dr. Keenwit is interacting with society for the same goal. All people are agents, not all programs are agents.

This isn’t to say that GPT-N couldn’t be dangerous. A nuke isn’t an agent. Neither is an intelligence report, but that intelligence report could be very dangerous if read by the right person.

But GPT is a bit more worrisome than an intelligence report or a history book. GPT can interact with you and answer questions. It has no goal other than predicting text, but in the age of the internet text prediction can solve an enormous number of problems. Like writing working software.

If you give GPT an input, it will provide an output. That means that you could feasibly make it into an agent by piping it’s text output to a web browser or something. People are already proposing additions to GPT that make it more agent-y.

The thing is, GPT still has only one goal: predicting human generated text. If you give it access to a web browser, it’ll just output whatever text a human would output in response to whatever is on the page. That’s not something that’s going to make complicated take-over-the-world plans, though it might be something that talks about complicated take-over-the-world plans.

What if we build structure around GPT-n to turn it into an agent, and then tweak the training objective to do something more active. Do we have SkyNet yet? Steve2152 over at LessWrong still doesn’t think so. He comes up with a list of things that an Artificial General Intelligence (like SkyNet) must have, and argues that GPT will never have them due to its structure.

Steve2152’s argument hinges on how efficient GPT can be with training data. The GPT architecture isn’t really designed for doing things like matrix multiplication or tree search. Both of those things are likely to be important for solving large classes of problems, and GPT would be pretty inefficient at doing it. The argument then analogizes from being inefficient at certain problems to being unable to do other problems (similar to how standard DNNs just can’t do what an RNN can do).

Instead of using a transformer block (which GPT uses), Steve2152 would have us use generative-model based AIs. In fact, he thinks that generative-model based AI is the only thing that could possibly reach a generalized (AGI) status where it could be used to solve any arbitrary problem better than humans. His generative-models seem to just be a group of different networks, all finding new ideas that explain some datapoint. Those models then argue among each other in some underspecified way until one single model emerges the winner. It sounds a lot like OpenAI’s debate methods.

I’m not convinced by this generative-model based argument. It seems too close to analogizing to human cognition (which is likely generative-model sub-agents in some way). Just because humans do it that way doesn’t mean it’s the only way to do it. Furthermore, Steve2152’s argument equates GPT with all transformer architectures, and the transformer can be used in other ways.

Transformers, more than meets the eye

Obviously an AI trained to generate new text isn’t going to suddenly start performing Monte Carlo Tree Search. But that doesn’t mean that the techniques used to create GPT-3 couldn’t be used to create an AI with a more nefarious agent-like structure. Standard DNNs have been used for everything from object recognition to image generation to movie recommendations. Surely we can reuse GPT techniques in similar ways. GPT-3 uses a transformer architecture. What can we do with that?

It turns out we can do quite a lot. Nostalgebraist has helpfully explained how the transformer works, and he’s also explained that it can model a super-set of functions described by things like convolutional layers. This means we can use transformers to learn even more complicated functions (though likely at a higher training expense). The transformer architecture is much more generalizable than models that have come before, which I think largely explains its success.

If we wanted SkyNet, we wouldn’t even necessarily need to design control logic ourselves. If we connect up the output of the GPT-3 architecture to a web browser and tweak the cost function before re-training, we could use the same transformer architecture to make an agent.

It’s not even clear to me that the transformer will never be able to do something like tree search. In practice, a transformer only outputs one word at a time. When you want more than one output word, you just repeat the output portion of the transformer again while telling it what it just output. (You can get a good example of what that looks like in this explainer). If you train a transformer to output sentences, it’ll do it one word at a time. You just keep asking it for more words until it says that it’s done by giving you an “end” symbol. It seems possible to use this structure to do something like tree search, where the output it gives includes some kind of metadata that lets it climb back up the tree. You’d never get that with the training corpus that GPT-3 uses, but with the right training data and loss function it seems feasible (if very inefficient).

But if we’re really worried about being able to do tree search (or some other specific type of computation) in our future SkyNet, then maybe we can just put that code in manually.

AlphaGo to MuZero

Hard coded agent-like structure is a large part of what made DeepMind’s AlphaGo and it’s descendants so powerful. These agents play games, and they play them well. AlphaGo and AlphaZero set world records in performance, and are able to play Go (a famously hard game) at superhuman levels.

The various Alpha* projects all used a combination of the game rules, a hand-coded forward planning algorithm, and a learned model that evaluated how “good” a move was (among other things). The planning algorithm iteratively plans good move after good move, predicting the likely end of the game. The move that is predicted to best lead to victory is then chosen and executed in the actual game. In technical terms, it’s doing model based reinforcement learning with tree-search based planning.

By changing what game rules AlphaZero used, it could be trained to superhuman levels on Chess, Go, or Shogi. But each game needed the game rules to be manually added. When it needs to know where a knight would be allowed to move, AlphaZero could just consult the rules of chess. It never had to learn them.

Now DeepMind has released a paper on MuZero, which takes this to a new level. MuZero learns the game rules along with goodness of moves. This means that you can train it on any game without having to know the rules of the game yourself. MuZero achieves record breaking performance on board games and Atari games after automatically learning how the game is played.

With MuZero, the game rules are learned as a hidden state. This is pretty different from prior efforts to learn a game model from playing the game. Before this, most efforts emphasized recreating the game board. Given a chess board and a move, they’d try to predict what the chess board looks like after the move. It’s possible to get decent performance doing this, but a game player built this way is optimizing to produce pictures of the game instead of optimizing to win.

MuZero would never be able to draw you a picture of a predicted game state. Instead, its game state is just a big vector that it can apply updates to. That vector is only loosely associated with the state of the actual game (board or screen). By using an arbitrary game state definition, MuZero can represent the game dynamics in whatever way lets it win the most games.

MuZero uses several distinct neural nets to achieve this. It has a network for predicting hidden game state, a network for predicting game rules (technically, game dynamics), and a network for predicting a move. These networks are all hand-constructed layers of convolutional and residual neural nets. DeepMind in general takes the strategy of carefully designing the overall agent structure, instead of just throwing NN layers and compute at the problem.

I’m a lot more worried about MuZero as a SkyNet progenitor than I am about GPT-3. But remember what we learned from Nostalgebraist above? The transformers that GPT-3 are based on can be used to learn more general functions than convolutional nets. Could GPT and MuZero be combined to make a stronger agent than either alone? I think so.

It’s interesting to note here that MuZero solves one of the most common complaints from writers about GPT-3. GPT-3 often loses the thread of an argument or story and goes off on a tangent. This has been described as GPT not having any internal representation or goal. Prosaically, it’s just generating text because that’s what it does. It’s not actually trying to tell a story or communicate a concept.

MuZero’s learned hidden state, along with a planning algorithm like MCTS, is able to maintain a consistent plan for future output over multiple moves. Its hidden state is the internal story thread that people are wanting from GPT-3 (this is a strong claim, but I’m not going to prove it here).

I like this plan more than I like the idea of plugging a raw GPT-3 instance into a web browser. In general, I think making agent structure more explicit is helpful for understanding what the agent is doing, as well as for avoiding certain problems that agents are likely to face. The hand-coded planning method also bootstraps the effectiveness of the model, as DeepMind found when they trained MuZero with planning turned off and got much worse performance (even compared to MuZero trained with planning turned on and then run with planning turned off).

Winning

The main follow on question, if we’re going to be building a MuGPT-Zero3 model, is what “winning” means to it. There are a lot of naive options here. If we want to stick to imitating human text, it sure seems like a lot of people treat “getting other people to agree” as the victory condition of conversation. But the sky is the limit here, we could choose any victory condition we want. Text prediction is a highly underconstrained problem compared to Go or Atari.

That lack of constrained victory condition is a big part of the AGI problem in the first place. If we’re going to be making AI agents that interact with the real world to achieve goals, we want their goals to be aligned with our own human goals. That’s how we avoid SkyNet situations, and we don’t really know how to do it yet. I think lack of knowledge about useful value functions is likely the biggest thing keeping us from making AGI, aligned or not.

If we ask whether we can get AGI from GPT or MuZero, then we get into all sorts of questions about what counts as AGI and what kind of structure you might need to get that. If we just ask whether GPT and MuZero are a clear step towards something that could be dangerous on a global level (like SkyNet), then I think the answer is more clear.

We’re getting better at creating models that can answer factual questions about the world based on text gleaned from the internet. We’re getting better at creating models that can generate new text that has long duration coherency and structure. We’re not yet perfect at that, but the increase in capability from five years ago is stunning.

We’re also getting better at creating agents that can win games. As little as 6 years ago, people were saying that a computer beating a world-champion at go was a decade away. It happened 5 years ago. Now we have MuZero, which gets record-breaking scores on Atari games after learning the rules through trial an error. MuZero can match AlphaGo’s Go performance after learning Go’s rules through trial and error. This is also a stunning increase in game playing ability.

We don’t have a good way to constrain these technologies to work for the good of humanity. People are working on it, but GPT-3 and MuZero seem like good arguments that capabilities are improving faster than our ability to align AI to human needs. I’m not saying that we need to run through the datacenters of DeepMind and OpenAI deleting all their data (and Dilly McBean’s magic magnetism powers wouldn’t work with contemporary storage technology anyway). I am saying that I’d love to see more emphasis on alignment right now.

There are a few different organizations working on AI alignment. OpenAI itself was originally formed to develop AI safely and aligned with human values. So far most of the research I’ve seen coming out of it hasn’t been focused on that. The strongest AI safety arguments I’ve seen from OpenAI have been Paul Christiano basically saying “we should just build AGI and then ask it how to make it safe.”

In all fairness to OpenAI, I haven’t tracked their research agenda closely. Reviewing their list of milestone releases reveals projects that seem to emphasize more powerful and varied applications of AI, without much of a focus on doing things safely. OpenAI is also operating from the assumption that people won’t take them seriously unless they can show they’re at the cutting edge of capabilities research. By releasing things like GPT, they’re demonstrating why people should listen to them. That does seem to be working, as they have more political capital than MIRI already. I just wish they had more to say about the alignment problem than Paul Christiano’s blog posts.

In fairness to Paul Christiano, he thinks that there’s a “basin of attraction” for safety. If we build a simple AI that’s in that basin, it will be able to gradient descend into an even safer configuration. This intuitively makes sense to me, but I wouldn’t bet on that intuition. I’d want to see some proof (like an actual mathematical proof) that the first AGI you build is starting in the basin of attraction. So far I haven’t seen anything like that from Paul.

DeepMind, on the other hand, was founded to create a general purpose AI. It wasn’t until Google bought it that they formed an internal ethics board (which apparently has a secret membership). They do have an ethics and society board (separate from their internal ethics board) that is also working on AI safety and human alignment (along many different axes). It seems like they’re taking it seriously now, and they have a fairly active blog with detailed information.

MIRI is working exclusively on AI safety, not on capabilities at all. They’ve released some papers I find pretty intriguing (especially related to embedded agency), but publicly visible output from them is pretty sporadic. My understanding is that they keep a lot of their work secret, even from other people that work there, out of fear of inspiring larger capability increases. So I basically have no idea what their position is in all this.

All of this leaves me worried. The ability to create AGI seems closer every year, and it seems like we’re making progress on AGI faster than we are making progress on friendly AI. That’s not a good place to be.

On Knowledge Bootstrapping v0.1

Over the last few weeks, AcesoUnderGlass has been posting a series about how to research things effectively. This culminated with her Knowledge Bootstrapping Steps v0.1 guide to turning questions into answers. To a first approximation, I think this skill is the thing that lets people succeed in life. If you know how to answer your own questions, you can often figure out how to do any of the other things you need to do.

Given how important this is, it seemed totally worth experimenting with her method to see if it would work for me. I picked a small topic that I’d been meaning to learn about in detail for years. I used the Knowledge Bootstrapping method to learn about it, and paid a lot of attention to my experience of the process. You can see the output of my research project here. Below is an overly long exploration of my experience researching and writing that blog post.

How I used to learn vs Knowledge Bootstrapping

My learning method has changed wildly over the years. When I was in undergrad, I thought that going to lectures was how you learned things and I barely ever studied anything aside from my own notes. This worked fine for undergrad, but didn’t really prepare me to do any on-the-job learning or to gain new skills. I spent an embarrassingly long time after undergrad just throwing myself really hard at problems until I cracked them open. If I was presented with a project at work that I didn’t immediately know how to do (and that a brief googling didn’t turn up solutions for), then I would just try everything I could think of until I figured something out. That usually worked eventually, but took a long time and involved a lot of dumb mistakes. And when it didn’t work I was left stuck and feeling like I was a failure as a person.

When I went back to college to get a Master’s degree, I knew I couldn’t keep doing that. I had visions of getting a PhD., so I thought I’d be doing original research for the next few years. I had to get good at learning stuff outside of lectures. My approach to this was to say: what did my undergrad professors always tell me to do? Read the textbooks.

So in grad school I got good at reading textbooks, and I always read them cover-to-cover. I didn’t really get good at reading papers, or talking to people about their research or approaches. Just reading textbooks. This was great for the first two years of grad school, which were mostly just taking more interesting classes. I did a few research projects and helped out in my lab quite a bit, but I don’t think I managed to contribute anything very new or novel via my own research. I ended up leaving after my Master’s for a variety of reasons, but I now wonder if going through with the PhD would have forced me learn a new research method.

Since grad school, my approach to learning new things, answering questions, and doing research has been a mix of all three methods I’ve used. I’ll read whatever textbooks seem applicable from cover to cover, I’ll throw myself at problems over and over until I manage to beat them into submission, and I’ll watch a lot of lectures on youtube. All of these methods have one thing in common: they take a lot of time. Now that I have a family, I just don’t have the time that I need to keep making the progress I want.

This is why I was so interested in AcesoUnderGlass’s research method. If it worked, it would make it so much easier to do the things I was already trying to do.

Knowledge Bootstrapping Method

My (current) understanding of her method is that you:

  1. figure out what big question you want to answer
  2. break that big question down into smaller questions, each of which feed into the answer to the big question
  3. repeat step 2 until you get to concrete questions that you could feasibly answer through simple research
  4. read books that would answer your concrete questions
  5. synthesize what you learned from all the books into answers to each of your questions, working back up to your original big question

This is a very question-centered approach, which contrasts significantly with my past approaches. It seems obvious that breaking a problem down like this would be helpful, and honestly I do a lot of problem-reductionism during my beat-it-into-submission attempts. Is this all there is to her super-effective research method?

Elizabeth spends a lot of time on the right way to take notes, going so far as to show templates for how she does it. When I first saw this, I thought it would be useful but not critical. As I’ll discuss below, I now think some of the templating does a lot of heavy lifting in her method.

Furthermore, she based her system around Roam. I’ve been hearing an enormous amount about Roam over the past year or so. People in the rationality community seem enamoured of it. At this point it mostly seems like a web-based zettelkasten to me, and I already use Zettlr. Zettlr is also free, and stores data locally (my preference). It’s intended to be zettelkasten software, but I mostly just use it to journal in and I’m not very familiar with many of zettelkasten features of it. I decided to use that for my project, since it’s already where I write most of my non-work writings and it seems comparable to Roam.

Elizabeth shared several sample pages from her own Roam research. When I browsed Elizabeth’s roam it seemed super slow. I assume there’s something going on with loading up pages on somebody else’s account that makes it slower, as the responsiveness seemed unusable to me.

Ironically, zettlr crashed on me right after I posted my research results to my blog. I ended up having to uninstall it completely to install a new version in order to get it working again.

Questions

For my test project, I wanted to do a small investigation to see what Knowledge Bootstrapping was like. Elizabeth gives a couple of her own examples that involve answering pretty large and contentious questions. I picked something small just to get a sense for the method, and decided to learn how people protect electronics from radiation in space. I’m interested in this topic just as a curiosity, but it’s also useful to have a good understanding of it for my job (even though I’m mostly doing software in space these days).

I want to talk a bit about why that choice ended up being better than any alternative that past-me could have chosen.

The rationality and effective-altruism communities have infected me with save-the-world memes. People deeper in those communities than me seem to express those memes in different ways, but there’s definitely a common sense of needing to work on the biggest and most important problems.

This particular meme has been a net-negative for me so far. Over the past few years, I sometimes asked myself what I should do with my life, or what my next learning project should be, or what my five year plan should be. I approached those questions from a first principles mindset. I would basically say to myself “what is utopia? How do I get there? And how do I do that?” and try to backwards chain from this very vague thing that I didn’t really understand.

This never worked, and I’d always get stuck trying to sketch out how the space economy works in 2200 instead of chaining back to a question that’s useful to answer now. Because I was approaching this project with the mindset of “let’s experiment with something small to see how Elizabeth’s advice works”, I just took a question I was curious about and that would be useful to answer for work. That was amazingly helpful, and I now think that when choosing top level questions I should go with what I’m curious about and what feels useful, and just avoid trying to come up with any sort of “best” question.

The crucial thing here seems to be learning something that feels interesting and useful, instead of learning something that you feel like you should learn. I still think doing highly impactful things is important, but I’m left a bit confused about how to do that. The thing I’ve been doing to try to figure what’s most important has been sapping my energy and making me feel less interested in doing stuff, which is obviously counterproductive.

Question Decompositions

Decomposing my main question into sub-questions was a straightforward process. It took about five minutes to do, and those sub-questions ended up guiding a lot of my research and writing.

One failure mode that I have with reading books and papers is that it can be hard to mentally engage with them. This is one of the reasons that I have tended to read textbooks cover-to-cover. It makes it easier to engage with each section because I know how it relates to the overall content of the book. When I’ve tried reading only a chapter or section of a book, new notation and terminology has often frustrated me to the point of giving up or just deciding to read the whole thing.

Having the viewpoint of each of my sub-questions let me side-step that issue. For each paper, I could just quickly skim it to find the points that were relevant to my actual questions. Unknown notation and terminology became much easier to handle, because I knew I didn’t have to handle all of it. If something didn’t bear directly on one of my sub-questions (say because it dealt more with solar cycles than with IC interactions), I could safely skip it. If it was important, I knew I only had to read enough to understand the important parts, and that bounded task helped me to keep my motivation up.

When I finished reading some paper, it was always clear what my next step was. I just go back to my list of questions and see which ones are still not answered. Sometimes the answer to one would create a few new questions, which I would just add to my list.

This also explains why breaking the question down into parts at the beginning is more useful than the decomposition I do when I’m debugging something. By starting with a complete structure of what I think I don’t know, I have context to think about everything I read. That lets me pick up useful information quicker, because it’s more obvious that it’s useful. I’ve had numerous debugging experiences realizing that a blog post I read a week ago actually did contain an unnoticed solution to my problem. By starting with a question scaffold, I think I could speed that process up.

Sources

Elizabeth emphasizes the use of books to answer the questions you come up with. She spends at least one blog post just covering how to find good books. I suspect that this is a bigger problem for topics that are more contentious. The question that I was trying to answer is mostly about physics, and I didn’t have to worry too much about people trying to give me wrong information or sell me something.

I also didn’t particularly want to read 12 books to answer my question, so I decided at the beginning that I’d focus on papers instead. Those tend to be faster to read, and I thought they’d also be more useful (though if I could find the exact right book, it might have answered all my questions in one fell swoop).

I did have some trouble finding solid papers. Standard google searches often turned up press releases or educational materials that NASA made for 6th graders. Those didn’t have the level of detail I needed to really answer my low level questions.

So my method for finding sources was mainly to do scattershot google searches, and then google scholar searches. My search terms were refined the more I read, and I tweaked them depending on which specific sub-question I was trying to answer. When I found a good paper, I would sometimes look at the papers it cited (but honestly I did this less than might have been useful).

In general I think I learned the least from this aspect of the project. Part of this might be that my question just didn’t require as much information seeking expertise as some of the questions that Elizabeth was working on. Part of me wants to do another, slightly larger, Knowledge Bootstrapping experiment where I address a question that is less clear cut or more political.

One thing I did notice while I was doing the research was that a part of me sometimes didn’t want to look things up. It wanted to answer the questions I posed via first principles, and the idea of just looking up a table of data seemed like cheating. This reluctance may come from a self-image I have of someone who can figure things out. Looking things up may challenge that self-image, leading me to think less of myself. I think this is a pretty damaging strategy, though it may explain a bit of my old beat-it-into-submission method of solving technical problems. I think it might be useful to explicitly identify to myself if I’m trying to finish a project or to challenge myself. If I’m facing a challenge question, then working it through on my own is noble. If I’m trying to finish a project, then I’m just wasting time. I’d like to not have moral or shame associations in either case.

Reading and Notes

Reading and note-taking are definitely where the Knowledge Bootstrapping process really shines. Being able to efficiently pull information out of text can be difficult, and Elizabeth uses even more structure for this than I think she realizes (or at least more than Knowledge Bootstrapping makes explicit).

Her strategy for note-taking is:

  1. make a new page for each source using a specific template
  2. fill in a bunch of meta-data about the source
  3. brain-dump everything you already think about the source
    • the explicit purpose for this is that it gets the thoughts out of your head, letting you actually focus on the source information
    • I suspect that a large part of the benefit is that you explicitly predict what the source will say, making it easier to notice when it says something different. That surprise is likely the key to new information
  4. fill in the source’s outline (I never did this step)
  5. fill in notes for each section

Elizabeth’s recommendation is that if you’re not sure what to write down in the notes, you should go back to your questions and break them down further. I can confirm that after I had all my questions broken out, it was very easy to figure out what was relevant. This also made it easier to skim the source and to skip sections. I knew at a quick glance whether a chapter or section was related to my main question and had no compunctions about skipping around.

This is a pretty big difference to how I normally read things. I tend to be a completionist when I read, and I definitely feel an aversion to skimming or skipping content normally. In the past, I’d feel the need to read an entire source document in order to say whether it was “good” or if I “liked it”. I had a sense that if I didn’t read every word, I couldn’t tell people that I’d read it. And if I couldn’t tell people that I’d read it I wouldn’t get status points for it or something. Maybe there’s something here about reading not for knowledge but for status and identity.

The knowledge that I was reading for a specific purpose was very freeing, and I felt much more flexible with what I could read or not read.

In any case, I felt comfortable reading the sources just to pick out information, and I felt comfortable with my ability to pick out whatever information was important. What I was less comfortable with was recording that information in a useful way.

This is the main place that I would have liked more information from Knowledge Bootstrapping. When I looked at Elizabeth’s Roam examples, I was blown away by the structure of her notes. It’s not just well organized at a section level, each individual paragraph is well-tagged with claim/synthesis/Implication annotations. She also carefully records page numbers from the book for everything. There are also a lot of searchable tags that link different books together.

I don’t use Roam, so the immediate un-intuitiveness is likely one of the reasons that I find this so impressive. The amount of effort that she puts into her notes is kind of staggering, and I find them much closer to literal art than my own stream-of-consciousness rambling.

The thing is, my own stream-of-consciousness, concise notes are driven by a desire for efficiency. I don’t particularly want to stop and write down page numbers every couple of pages of a book. I’m certain that she gets a lot of benefit from it in terms of being able to review things later, I’m just not sure it would be worth it for me.

This is where my inexperience with my own tools, zettlr and markdown, really hampered me. I’m pretty sure I could get zettlr to do most of what Roam was doing, and maybe even do it efficiently and speedily. To get there, I would have had to stop doing one research project and start another research project on just using zettlr.

I would love to watch Elizabeth take notes on a chapter in real time, to see more of what her actual workflow is like. How much effort does she really expend in those notes, and does it seem worth it to me? Would it seem worth it to me for a more ambitious project? I think watching that would also help me learn the method a lot better than reading about how she does notes, as it would be directly tied to a research project already.

Synthesis

Synthesizing notes into answers to questions was conceptually easy, but logistically I was limited by the same inexperience with my tools as I was when I was taking the notes themselves. Before I do another research project, I want to learn more about using Zettlr (or another tool if I choose to switch) to make citations and cross-post connections.

During this research project, I would often take notes on a source into two different posts at the same time. I’d be putting all my notes directly into the notes doc for that source, then I would switch to my questions doc and start adding some data there immediately.

I noticed while doing my research project that I at times wanted to construct an argument between a couple of sources. “X says x, Y says y, how do I use both these ideas to answer my question?” I found actually doing this to be annoying, and I ended up not really doing much of it in my notes or in my synthesis.

That type of conversation between sources is one of the great strengths of the erstwhile Slate Star Codex (as well as many other blogs I love), so I want to encourage it in my own writing. I don’t normally do that by default, so having it seem desirable here seems like a strength of the method. Before I do something like this again, I’d want to remove whatever barriers made me averse to doing that kind of synthesis.

This is the first time that I’ve appreciated the qualitative differences between different citation styles. Prior to this, when I would write a paper or report, I’d just throw a link or title into a references section while I was writing and clean it up later. I’d pick whatever citation style was called for by the journal/class that I’d be submitting the paper to. I treated citations (and citation style) like something that was getting in the way of writing a paper and figuring things out.

Taking notes (and later synthesizing them) from a question-centered perspective showed me why citations are useful beyond just crediting others. If I were comfortable with an easy to use citation style (AuthorYear?), i could refer to the sources that way in my notes and in my synthesis docs and more easily create the type of “X says, Y says” conversations between sources that I think are so useful.

That seems to be the root of my aversion to doing this type of source vs source conversation. I knew I was going to post a blog with my synthesis, and the idea of going back and fixing all the citations into a coherent style made me not want to do it in the first place.

Elizabeth recommends writing out the answers to your sub questions in the same doc as the questions themselves. Step 9 of her extended process description is just “Change title of page to Synthesis: My Conclusion” because your questions now all have answers. I found this advice to be very helpful. I would sometimes get tired of just reading and note taking, and feel like I should be done. Then I’d go write up the answers to my questions, and in doing so I’d come to a point that I couldn’t really explain yet. That would re-energize me, as suddenly there was an interesting question to address. The act of organizing all the things I’d read about helped me focus on why I was interested in the question in the first place as well as what I still didn’t understand. This aspect of the process, creating questions and then long-form answering them in my own words, seems to cause me to automatically do the Feynman Technique.

KB and me

I liked this experiment. I learned what I wanted to learn on an object level, and doing it felt more free and curiosity driven than a lot of my reading and learning. I think regardless of what I do for future learning projects, I’ll definitely do the question decomposition part of KB again. I’m not quite sure about using the note-taking structure of the method; I’ll need to experiment with it a bit more.

I do think that I’d want to know better how to use my tools before I do the next project like this. For this first project, I had the excitement of doing a new thing a new way to keep me doing the method. I think once the excitement of a new method wears off, the friction of notetaking could stop me from doing it if I didn’t get good at markdown and zettlr/Roam first.

Honestly, I think one of the greatest benefits to this project was the introspection of trying to figure out how well I was learning. If I hadn’t been paying attention to my own thoughts and motivations, this project could have produced similar understanding of my original question without really giving me any information about how I learn or what emotional blocks were contributing to poor learning methods. That’s not really a part of Knowledge Bootstrapping on it’s own, but maybe having a backburner process running in my mind asking how my learning is going would help me more than it would slow me down.

Hardening electronics for space radiation

It’s tradition when talking about space electronics to open with Telstar. Telstar 1 was the first commercial satellite to fail due to radiation effects. The US exploded a nuke in earth’s upper atmosphere the day before Telstar was launched. That nuclear blast knocked out streetlights, set off burglar alarms, and damaged telecoms infrastructure in Hawaii, 900 miles away from the detonation. The charged particles from that explosion hung out in orbit and created man-made radiation belts that lasted for more than 5 years. At least six satellites failed due to the additional radiation from the blast.

Most spacecraft don’t have to deal with nuclear fallout (floatup?), but the radiation environment outside of Earth’s atmosphere is still a fearsome thing. A high energy particle can destroy sensitive electronics, and even low energy particles can eventually sandblast a circuit into submission. So how do we trust the satellites we have in orbit now? How can we send robots to Mars and expect them to work when they get there?

Before we can say how people build reliable computers in space, we need to know exactly what might happen to them there. There’s an enormous number of engineering constraints for space hardware. They range from thermal management to power management to vibration. Most of these concerns have terrestrial analogues. Car makers in particular have gotten good at a number of these issues. I’m going to focus here only on the impacts of space-radiation, as they are the most different from what you might focus on when you’re designing hardware for Earth.

Space Weather? More like space bullets

Empty space is full of charged particles. It’s like putting electronics in a very very diffuse dust storm, and then letting that dust bombard the electronics. But instead of dust, the particles in space are generally individual electrons or atomic nuclei. Many of these come from the sun. In addition to putting out the photons that drive all life on Earth, our star also spits out electrons, neutrons, protons, even helium nuclei and large elements.

The number and type of particles that our sun spits out is variable and somewhat unpredictable. There’s an 11 year solar cycle, where the sun puts out more particles for a while and then fewer. That cycle is only 11 years long on average though; it could be as short at 9 or as long as 13. That means if you’re planning a mission that’s 5 to 10 years away, you can only make a good guess about what the solar cycle is doing at launch. Aside from the cycle, you also have to worry about coronal mass ejections and solar flares, where the sun just burps out a ridiculously huge amount of radiation all at once. These ejections have shut down satellites, broken national electrical grids, and just generally made life harder for electrical engineers on earth and in space.

Aside from our star, you also have to worry about every other star in the universe too. The particles that don’t come from our sun are generally lumped into one category called Galactic Cosmic Radiation (why is it galactic and cosmic at the same time?). If this Galactic Cosmic Radiation makes it near Earth, it’s probably pretty energetic. Lots of energy means lots of chances to hurt our stuff, so we definitely need to watch out for those.

Regardless of the source of the particle, there’s really only three types of particle that we have to worry about: electrons, protons, and heavy nuclei.

Electrons are very very small. They don’t have much mass, so they often don’t carry much energy. They only have a charge magnitude of 1 unit. There are a ton of these in space, so these are the sand that you generally get sand blasted with. Each individual particle doesn’t do much, but they add up.

Protons are pretty big; they’re around 2000 times larger than an electron. Even though they have an electrical charge that’s the same magnitude as an electron, they’ll often carry more energy due to their larger mass. You can think of these like gravel in a sandstorm. There’s fewer of them, but they hit you harder. And some of them have enough energy to really hurt you all on their own.

Finally, there’s the heavy nuclei. A proton is just a hydrogen atom without the electron bound to it. Heavy nuclei are the same thing, but for larger atoms like helium. With Galactic Cosmic Rays, you may even be hit by uranium nuclei. Each of these particles has more mass and more electric charge than a proton, so they’ll generally carry more energy. It’s like being hit by a rock. There are far fewer of these heavy nuclei though. Galactic Cosmic Radiation, for example, is 85% protons, 14% helium nuclei, and only 1% heavier nuclei.

There’s a lot to be said about space weather here. Where you are in space, whether you’re in low earth orbit or on your way to the moon, has a huge impact on how much radiation you see. So does the solar cycle, and even the specific orbital plane you may be in when orbiting the Earth. These details are highly important if you’re going to be designing hardware for a real mission, as they’ll drive the radiation tolerance specification that your hardware has to meet. For now, we don’t need to care about that. We don’t have a real mission to design, we’re just trying to understand the general risks to hardware.

So let’s say you’ve got some particle flying around space. Probably it’s a tiny electron, but there’s a small possibility it could be an enormous uranium nuclei or something. What happens when that particle hits your electronics?

Circuit Collisions

An atom is like a chocolate covered coffee bean. You’ve got the small and hard nucleus with a ton of contained energy, and that’s surrounded by a soft and squishy electron shell. Maybe back in high school you saw a drawing of an atom that was like an electron-planet orbiting a nucleus-sun? That’s not accurate. The electrons really do form a (delicious, chocolatey) shell around the nucleus due to quantum something-something (look, this isn’t supposed to be a physics post).

This is important because when a charged particle from the depths of space hits our coffee-bean atom, it’s going to bounce off of that electron shell. When it does, it’ll deposit some of its energy into the electron. Depending on how much energy it leaves behind, it could knock an electron free of the atom it hit. If you look at a chip that’s been hit by a cosmic ray, you’ll often detect tracks of electrons that have been knocked free from their atoms as the incoming particle bounces around like a pachinko ball in the atomic lattice of the chip.

What do these extra free charges in your circuit actually do? That depends completely on where they are. Lots of them may do nothing, especially if the particle hits copper or PCB substrate. If the particle passes through a transistor or a capacitor, the extra charge can do more damage, especially as they build up.

Chips that are rated for use in space have a specification called Total Ionizing Dose, measured in rads. This is the minimum amount of radiation they can experience before failure. Each incoming particle adds a bit more to the total dose that the chip has received, increasing the number of additional electrons (and technically holes as well). These extra charges can build up in transistors and diodes, changing switching characteristics or power draw. Eventually, the buildup can force transistors into an always on state, and then your Central Processing Unit can’t Process anymore.

There’s another failure mode, too. You might have a particle hit your chip with enough energy that that single event causes a problem. These events, creatively called Single Event Effects, can range in impact. You might be looking at something as small as a bit flip or as large as a short from power to ground. Even a bit flip isn’t necessarily small, if it changes the operation of a critical algorithm at the wrong moment.

The effects of a SEE depend completely on how your chip was made, and I haven’t had any luck finding information about how to predict that kind of thing from first principles. In practice, I think chip makers just make their chip and then shoot radiation at it to see what happens. If they don’t see any SEEs, they say it’s good. When they’re testing their chips for susceptibility to SEEs, manufacturers will use a measurement called Linear Energy Transfer (LET). That tells you how much energy gets transmitted from the incoming particle to the atoms it hits. Higher LET values mean more damage to the chip. While TID is more related to the slow buildup of defects in the chip, SEEs are dictated by whether particles with a specific energy can cause your chip to fail on their own. So if you test all the parts of a chip up to the energy level you might see in flight, you can have confidence that it won’t fail. In one 1996 experiment, a strong falloff in space particle counts was seen for energy levels above about 10 MeV-cm^2/mg. If you don’t have any SEEs for impacts around that level or below, you might consider yourself kind of safe.

You might also ask: what happens if an incoming particle hits the coffee bean in the center of all that chocolate. I mean: what happens if the particle hits the atomic nucleus of an atom in a semiconductor lattice. This can and does happen, and the impacts are pretty similar to what was described above for impacts to the electron shell. In general this is less likely. Electrons usually don’t have enough energy to make it all the way to an atomic nucleus. Heavier nuclei also have more electric charge, so they get pushed harder away from the nucleus and generally can’t make it in either. It’s usually the protons that hit a nucleus, pushing the nucleus out of position or causing it to decay into other atomic elements. Functionally you’ve got the same TID and SEE failure modes though.

A similar failure mode to Total Ionizing Dose is called Displacement Damage. This is caused not by freed electrons, but by nuclei that have been pushed out of their lattice. In practice this can have similar effects to TID.

Reliability at the IC level

The best way to avoid radiation problems in an electronic circuit is to build your circuit out of parts that don’t have problems with radiation. This is what government satellites have always done, and it’s what commercial satellites have mostly done until very recently. You can pick CPUs, RAM, etc. that have been designed to withstand radiation. These parts are likely to be larger, more expensive, and less good than their non- radiation hardened equivalents.

It takes a lot of money and time to design a chip to be rad hard, and there aren’t really a lot of customers out there that buy them. That means that chip manufacturers are likely to keep old technologies around for a while to let them pay off their capital investments. Satellite designers generally don’t mind this, because if they take a risk on a newer part that doesn’t have “space heritage”, their satellite might die before it manages to pay off it’s own capital costs. Many large satellite designers won’t switch to a new chip unless they literally have to in order to hit their functional requirements. Conservative design decisions dominate the space industry.

So when you are designing your spacecraft, you’ll probably start by making a list of all the parts you want. For each of those parts, you can look for rad-hard versions of them. You’ll look for chips that have a TID that’s above what your mission might experience (including some safety margin). You’ll also look for chips that have been tested and shown not to experience severe SEEs at LET levels below what you might experience in your mission.

If you’re really serious, you’ll only buy chips from manufacturers that spot test every lot of parts that they make. Some parts are only tested during the design phase, and then you assume that their production runs are good enough. It turns out that there’s enough variance in performance from one production lot to the next that, if you have a critical component in a high radiation mission, using a part made one week after another can make a crucial difference. If that mission needs that level of reliability, you pay for the extra testing.

But lets say that your mission actually isn’t that critical. You want to put a phone in low earth orbit for a few days to take some pictures. If it fails, nobody dies and nobody loses their job. In this case, a literal off-the-shelf phone might be good enough. Most commercial components can handle somewhere between 1krad and 30krad of TID. That means that if your mission is short enough then you might be able to ignore the TID effects. SEE is still a crapshoot though, and there’s always the possibility that some commercial part is on the low end of the TID range and you’re in a solar maximum.

If you want to save money, but you also want more reliability than smartphone components will give you, you can use a rad-hard management CPU to control a higher performance commercial CPU. The rad hard CPU is expensive, so you buy a super low performance one and just use it to watch the high performance CPU for errors. If the rad-hard CPU sees an error, it can reset things to a good state (probably, depending on the error). That leads us to our next method of dealing with space radiation.

Active Error Checking

The next thing you’ll want to do to make your spacecraft robust is to actively look for runtime errors and try to runtime correct them. This often gets rounded down to making critical systems redundant, but there’s actually a lot more to it. The aerospace industry has developed an enormous set of methods for doing this, ranging from error correcting codes on any data stored in RAM to running every algorithm on multiple CPUs and comparing results. It’s also very common to give critical systems a watchdog. If those systems don’t pat the dog often enough, the watchdog will reset the system. Another common technique is to put current monitors on all the power rails. If there’s a sudden current spike, you might have a SEE that’s causing a short. If the current monitor can trigger a reset fast enough, you might not lose the hardware.

While this section is pretty short here, that is mainly because these techniques are so varied and so application dependent. When it comes to designing a system for space, more design time is probably spent on active error checking than other mitigation methods.

Shielding

Finally, you’re going to want to shield your electronics. If you can just keep the particles from hitting your chips, then you don’t have to worry. Almost all electronics in space are shielded in some way or another. Often designers will include an aluminum box around all their electronics.

A note on material. When I first started learning about this, I was comparing space radiation shielding to the lead aprons in a dentist’s office. I started out thinking aluminum was used in space because it was lighter than lead (cheaper to launch), and that as launch costs dropped we’d change materials. This turns out not to be true. If you bombard lead with protons or heavy nuclei, it can decay into other heavy particles that can damage your circuit even more than the original cosmic ray. Lead is used in the dentist’s office because you’re just trying to block X-rays, which aren’t going to cause the lead atoms to decay. Aluminum is less likely to decay if it’s hit by a GCR, so is much more effective as a shield.

In fact, it turns out that materials with lower atomic numbers generally do best for radiation shielding (in terms of shielding per unit mass). This is such a large factor that NASA is looking into ways to use hydrogen plasmas to shield against radiation on manned missions. With electronics, we generally want to use a conductive shield because that leads to better EMI performance. There’s also the fact that electronics are usually much more space constrained that humans, so we probably want denser material to save space. This does leave me wondering why aluminum is used so often instead of Magnesium. My guess is that, since you pretty much only get magnesium in MgO, it’s not conductive enough to make a good EMI can.

Now we know that we want a shield that’s a light element (low atomic number). We also know that we need a conductive shield to meet our other product requirements. This explains why we pick aluminum. But then we ask: how thick should our shield be?

One shielding experiment was performed on a variety of materials to look at how much radiation made it through.

ShieldingMaterialExperiments.png

The experiments there show thickness measured in g/cm^2. This weird unit makes sense when you consider that you have to pay launch costs by the kilogram. Two shielding materials may be very different thicknesses, but you really want to compare by mass (assuming you have the volume for them, which electronics probably don’t). Let’s just focus on the data for aluminum, which has a density of 2.7 g/cm^3. That means we can calculate real life thicknesses dividing the thicknesses from the plot’s x-axis by 2.7.

These plots show that for high energy particles of 1GeV or more, we don’t get a ton of shielding unless our aluminum box is more than 11cm. That’s probably too thick for anything outside of the Apollo Guidance Computer. What about the more common energies of 500MeV? For those, an enclosure thickness of 4cm cuts out all the primary particles and many of the secondary particles created by collisions. That’s still pretty thick, but feasible for some missions.

This also helps show why shielding primarily helps with Total Ionizing Dose. A thick enough shield can cut down most of your low energy particles. The higher the energy, the more likely those particles are to make it through your shield with only slight energy decrease. That means shielding is likely not a great way of solving SEEs, thought you could make them less common with a thick enough shield.

Experiments with single energies may not give you the information you want regarding TID. One thing that’s been done is putting radiation sensors in space surrounded by varying thicknesses of aluminum. You can then look at the TID after some given duration. Kenneth LaBel shows some of this data in a presentation from 2004.

These numbers depend a great deal on the specific orbit that the experiment was carried out in, but they do give you a sense for what to expect. For an orbit like this (ranging from 200km to 35790km), there are obvious decreasing marginal gains for shielding thicknesses above about a third of an inch.

Zap

Space. The final frontier. Full of radiation. And that radiation is just particles zipping about. If they hit your circuit, you might get very sad. In general, you can predict how much radiation a spacecraft might see on any given mission, and you can design in mitigations for that radiation. If you’re willing to pay the price in dollars and mass and design time, you can make systems that should survive for quite some time in space (assuming they don’t have a nuke shot at them).

Resources

Police Violence Through Schneier’s Lens

Bruce Schneier’s book Liars and Outliers has a security framework that’s been helping me understand police brutality lately . Schneier, one of the top computer security researchers in the world, wrote that book to describe a holistic view of how societies prevent people from harming each other. He breaks harm-prevention into four different categories: moral, social, institutional, and security.

Let’s go through what those different methods are, and how they are (not) used to keep the police from hurting the public.

Moral Pressure

Moral pressure is the internal feeling that individuals have about right and wrong. It’s when someone thinks “it’s wrong to steal,” or “killing people is bad.” When people act on these principles, that is the effect of a moral framework that stops people from wanting to hurt others in the first place.

When officers are required to get implicit bias training, that training is seeking to provide an additional moral sense for the officers. The sense that certain actions that might look innocuous are actually racist and harmful to society. I think implicit bias is real (though less powerful than most of it’s proponents believe).

Most police become officers out of a desire to help people. There are exceptions, and white supremacist groups have a history of infiltrating the police. When you actually ask officers why they became police, 68% cite a desire to help people. The same survey also shows that 41% of police officers “see injustice in the world and want to correct it.”

I am worried that only 68% of the officers in that survey said they became police officers in order to help people. As I understand it, the survey allowed you to agree with many options, so the idea of 32% of police didn’t care enough about their community to even check a box does worry me. Given the role that police play in our society, I’d prefer more emphasis on that particular trait when recruiting.

In fact, I worry that the current police situation is making recruiting moral officers more difficult. If someone right now cares about helping their community or righting injustice, I doubt that they’d think going into the police is the best way to do it. Systemic issues in policing could be actively pushing away the people that we most want to do the work, slowly making police departments worse for their communities.

Societal Pressure

If Moral Pressure doesn’t stop someone from considering a harmful act, Societal Pressure often will. We live in a society, and people want their peers to think well of them. They don’t want to be shunned, whether literally or figuratively. They want to be respected. Unfortunately, this harm prevention method has been totally twisted by our modern policing systems.

Who are a police officer’s main peers? Other police.

Many officers in large cities can’t afford to live in those cities, and so they actually live in the suburbs. This further separates the officers’ peer groups from the people that they directly impact.

Not only that, police (seemingly driven largely by union leadership) have a strong culture of protecting their own. That means that Social Pressure is pushing police to ignore violations from within the police, instead of pushing police to avoid those violations. Watch the video of the police pushing a 75 year old to the ground, seriously injuring him. Several police in the team around the man look worried, and like they want to help him. The official report (before the video came out) was that the man had tripped. That means that everyone on this team chose not to correct the official report. That includes the ones who looked worried about him during the event. That’s the force of societal pressure, causing these officers to try to protect their own instead of doing what’s right.

This severe perversion of social pressure within police departments means that many externally imposed methods of fixing police violence will be swept aside. The desire of humans to be an accepted member of their group is the strongest force in the known universe, and that force is currently working to protect corrupt police officers.

Institutional Pressure

If someone’s morals won’t keep them from violence, and their social peers won’t ostracize them for it, then it’s up to the institutions they’re a part of to impose official sanctions. These institutional sanctions are most of what we talk about when we talk about law and order. The official laws that people are supposed to follow, along with real consequences when the laws aren’t followed.

People (including police), respond to their incentives. Without real consequences to their actions, the police will continue to hurt others.  

A lot has been shared recently about all of the failures of institutional pressure against the police. I’ll just mention what I see as the biggest two right here.

The first is qualified immunity. Police who harm others in the course of their job (whether maliciously or not) are protected from the consequences that a member of the general public would face. This protects police from consequences for more than just the egregious murders committed by police officers. There are examples of police stealing hundred of thousands of dollars, of them shooting children, killing pets, and not being punished for any of it. The effect of qualified immunity means that police don’t have real consequences for many major incidences of violence.

Secondly, even when an officer is sanctioned for poor behavior, many police unions have negotiated loopholes with local municipal governments. An officer may be sanctioned and lose their job, but then be hired back after their records are shortly sealed. Records may be kept secret from independent watchdog groups, preventing patterns of behavior from being seen (and thus preventing sanctions from properly escalating).

All of this means that bad behavior among the police is tolerated at an institutional level. It’s prohibitively difficult to use law and sanction to punish police officers that abuse the public. Officers who notice that they can get away with little violations will be more willing to commit larger violations.

Security

Security is what is supposed to save you after morality, social pressure, and institutional sanctions have failed. If someone is trying to hurt you, you use cameras to observe them, you use locks to keep them out, you use guards to physically stop them.

That’s what’s happening right now. Morality, social pressure, and the law have all failed to keep Americans safe from their police. Now we have people filming police behavior and sharing it online. We have people physically putting themselves between people of color and the police trying to hurt them. We have protesters explicitly calling police out on their behavior, demanding change.

Security is the last line of defense because it’s very expensive. I’m really glad the protests are happening now because things need to change. Unfortunately, one reason the protests are so strong right now is that so many people are out of work. Their opportunity cost for protesting is low, so they can put in the effort that’s needed now.

We need to fix the other aspects of our police system so that people don’t keep dying after America fully returns to work. We need to make that change now, and we need to make it last.

Lifeboat

Over on SSC, Nick D. and Rob S. wrote an article about whether we should colonize other planets as part of an existential risk mitigation strategy. They both think that having “lifeboats” of people in case of human extinction level disasters will help to allow humans to recover. This seems like a great idea, but I found their assertion that closed system lifeboats were the best mix of cost, feasibility, and effectiveness to be hilariously optimistic.

N&R describe a lifeboat as a place where “a few thousand humans survive in their little self-sufficient bubble with the hope of retaining existing knowledge and technology until the point where they have grown enough to resume the advancement of human civilization, and the species/civilization loss event has been averted.”

What Makes a Lifeboat?

There are three requirements to this lifeboat idea:

1. closed off from the outside world
2. self-sustaining
3. able to retain existing technology (presumably at the current level)

Each of these three requirements could be satisfied partially. They aren’t yes/no options.

At one extreme, a closed system could just be a town with strict immigration control. Not letting new people in dramatically drops the possibility of an external plague infecting the town. At the other extreme, something like Biosphere 2 is closed off in air, water, and natural resources as well. A lifeboat could be made anywhere in that range.

Similarly, lifeboats could have varying levels of self-sustenance. A lifeboat at the high end would be capable of producing all of its own food and energy, processing all of its waste, and fixing all of its equipment as fast as it breaks. This is a _very_ high bar, but that lifeboat could last indefinitely. The other end of the spectrum is basically what most preppers have: a cache of supplies that will be drawn down steadily until sustenance is no longer possible.

Finally, retaining existing technology is actually very hard. Unless you have an active semiconductor fab, you aren’t maintaining existing technologies. This is why there are only about 5 groups in the world today that are maintaining our current level of technology. That’s the number of fully industrialized nations that are able to rebuild their own technology stack. All the other countries in the world just import (making them even less closed systems).

Without being able to make everything you use, at best you’re just keeping the knowledge alive by teaching people about it. At worst you have a bunch of books and manuals that you’ll have to puzzle over as you try to put technologies back into practice after everyone who had worked with them has died off.

Maximum lifeboaty-ness

My main issue with Nick and Rob’s article is that they assumed they wanted the most extreme form of each of the lifeboat requirements, but they didn’t take seriously the difficulty of achieving them. Their discussion of lifeboat options assumes complete self-sustenance and (generally) complete closure.

This is most evident in their (lack of) discussion about retaining technology. They seem to assume that as long as you have a copy of wikipedia stored somewhere, you’ll be able to retain the current standard of technology. The problem with that is that so much of current human technology depends on experiential knowledge that is difficult to transmit. Youtube is actually helping a lot with this, but it remains a huge problem.

Space colonies are interesting because, in such a harsh environment, you need an enormous amount of technology to survive. This means that a self-sustaining space-colony is able to retain existing technologies by default. If they weren’t, they couldn’t maintain their quality of life. Space colonies also are closed by default, and have to make an active effort to take in resources from the rest of humanity.

Contrast that with a farmer in kansas. That farmer can just keep farming, even if they lose the ability to make an iPhone or synthesize polymers. A Martian growing hydroponics would die if their society lost the ability to synthesize polymers. Casey Handmer discusses this at length in his excellent book “How to Industrialize Mars.”

Elon Musk wants to create a society on Mars that is self-sustaining, retains modern technologies, and only has contact with Earth every two years. In order to do this, he plans to send over a million people to Mars. That’s larger than most cities. It’s three times the size of the city I live in now. One million people is the minimum needed to make the Mars colony self-sustaining. All of those people will be working different jobs: farming, mining, making solar panels, programming, caring for children, etc. Fewer people means the Mars city isn’t self-sustaining anymore. Fewer people means that everyone dies if the shipments from Earth stop.

Closure on Earth

Later on in the article, N&R seem to implicitly abandon the idea of fully self-sustaining or fully tech-retaining lifeboats. They describe a couple of options for single building city-states that could be populated primarily by programmers. These city states could be easily sealed in case of disaster, thus saving the human race. They wouldn’t maintain our current technical standard, but they’d be closed and (mostly) self-sustaining.

N&R helpfully do some math, and show us that a building like the Pentagon could easily house 4000 families in space that’s equivalent to a 2 bedroom apartment per family. And if a 2 bedroom apartment is only 500 to 1000 square feet, then that’s true. But I have a question for them: what happens to all of the residents’ crap?

Literally: when the residents crap, where does that crap go? And where do they get their food? Let’s assume amazing solar panels on the roof can solve the electricity problem, but how do they fix those solar panels when they break? The Pentagon only works as a building because it has the infrastructure of a nation behind it.

The cost of a lifeboat

Nick and Rob don’t consider the vastly different costs of achieving their lifeboat requirements. At some point in their article, N&R arbitrarily say that they would only send 16000 to a hypothetical Mars lifeboat. The only argument they give for this is that it allows an “apples to apples” comparison with their terrestrial lifeboats. This is a horrible idea.

Let’s say you want to buy a drill, but you’re not sure if you want a battery powered drill or a drill with a cable. N&R would tell you to take the motor out of the battery-powered drill so that the two weigh the same amount, and then compare prices after that.

When buying lifeboats, as when buying drills, you want to identify the specs for each option along with the prices. Arbitrarily changing some parts of the option is going affect the specs, as well as the price. You need to take that into account.

I think that cost comparisons for lifeboats are likely to be deceptive until there are concrete designs. Even once realistic designs are available, apples to apples comparisons won’t be helpful. Different designs will be able to meet different levels of closure/sustenance/tech-retention. Trading off just on cost doesn’t let you make good decisions about what you’re really buying.

Sailing Onwards

I don’t want to harp on Nick and Rob too much. I think having lifeboats is a really good idea, and I think space based lifeboats are worth thinking critically about. But I worry that the article they wrote together will give people an overly rosie picture of the prospect of “backing up” human civilization. Doing so is hard, and we’re not going to get a backup that has perfect closure, self-sustenance, and tech-retention.

Instead of trying to get a bunch of perfect lifeboats in every country, I’d rather we focus on having lifeboats that fall on many different places in the closure/sustainability/retention parameter space. Perhaps there are a few different designs, each of which would hedge against different types of threats. More closure would hedge against plague and nuclear fallout. More self-sustenance would hedge against large-scale economic collapse or war. More tech-retention would hedge against luddism (a la the Khmer Rouge).

Say you want all three. Say that you want perfect closure, perfect self-sustenance, and perfect tech retention. In that case, I think your only choice is going to be a Mars colony. Building such a Mars colony will take us an enormous amount of money. Getting it to self-sustaining will probably take at least 100 years of concerted effort. I think that’s something to work towards, but let’s go for the lower hanging lifeboat fruit first.