Nine years ago, my sister discovered lumps in her neck and arm and was diagnosed with cancer.
9年前,我妹妹在她的脖子和手臂上发现了肿块,随后她被诊断出患有癌症。
From that day, she started to benefit from the understanding that science has of cancer.
从那天起,她开始受益于关于癌症的科学理论。
Every time she went to the doctor,
每次她去看医生,
they measured specific molecules that gave them information about how she was doing and what to do next.
医生们会通过测量她体内的特定分子来获得她的生理状态信息,并决定接下来应该做什么。
New medical options became available every few years.
每隔几年就会有新的医疗手段可供选择。
Everyone recognized that she was struggling heroically with a biological illness.
每个人都意识到她在英勇地与一种生理疾病作斗争。
This spring, she received an innovative new medical treatment in a clinical trial.
今年春天,她在一次临床试验中接受了一种新的疗法。
It dramatically knocked back her cancer.
该疗法显著地击退了她的癌症。
Guess who I'm going to spend this Thanksgiving with?
猜猜我今年要跟谁一起过感恩节?
My vivacious sister, who gets more exercise than I do, and who, like perhaps many people in this room,
我那活泼的妹妹,她的运动量比我都多,而她,可能跟在座的很多人一样,
increasingly talks about a lethal illness in the past tense.
会越来越多地谈论过去的那种致命疾病。
Science can, in our lifetimes -- even in a decade -- transform what it means to have a specific illness.
科学可以在我们的有生之年--甚至在十年之内--改变患特定疾病的含义。
But not for all illnesses. My friend Robert and I were classmates in graduate school.
但不是所有的疾病。我和罗伯特是研究生同学。
Robert was smart, but with each passing month, his thinking seemed to become more disorganized.
罗伯特很聪明,但随着时间的流逝,他的思考似乎变得杂乱无章。
He dropped out of school, got a job in a store ... But that, too, became too complicated.
他退了学,在一家商店找了份工作。但那里的环境也让他觉得越来越难以应付。
Robert became fearful and withdrawn.
罗伯特变得恐惧和孤僻。
A year and a half later, he started hearing voices and believing that people were following him.
一年半后,他开始听到声音,确信有人在尾随他。
Doctors diagnosed him with schizophrenia, and they gave him the best drug they could.
医生诊断他患有精神分裂症,给了他当时能提供的最好的药物。
That drug makes the voices somewhat quieter, but it didn't restore his bright mind or his social connectedness.
这个药物让他脑中的声音变轻了,但并没有恢复他聪明的头脑或社会联系。
Robert struggled to remain connected to the worlds of school and work and friends.
罗伯特很难与学校、工作和朋友的世界保持连接。
He drifted away, and today I don't know where to find him. If he watches this, I hope he'll find me.
他走失了,今天我都不知道去哪里找到他。如果他看到这个演讲,我希望他会来找我。
Why does medicine have so much to offer my sister,
为什么医学可以对我妹妹有如此大的帮助,
and so much less to offer millions of people like Robert? The need is there.
但对像罗伯特那样的数百万人却无能为力呢?需求就在那里。
The World Health Organization estimates that brain illnesses like schizophrenia, bipolar disorder and major depression
世界卫生组织估计,像精神分裂症、双相情感障碍和重度抑郁症之类的大脑疾病,
are the world's largest cause of lost years of life and work.
是世界上导致寿命折损和工作能力丧失的最主要原因。
That's in part because these illnesses often strike early in life, in many ways, in the prime of life,
一定程度上是因为这些疾病经常发生在生命早期,往往也是生命的黄金时期,
just as people are finishing their educations, starting careers, forming relationships and families.
就在人们完成学业,开始职业发展,形成稳定关系和家庭的时期。
These illnesses can result in suicide; they often compromise one's ability to work at one's full potential;
这些疾病会引发自杀;它们经常会导致人们无法充分发挥自己的潜能;
and they're the cause of so many tragedies harder to measure:
它们也导致了很多难以估量的悲剧:
lost relationships and connections, missed opportunities to pursue dreams and ideas.
失去人际关系,错过追求梦想和实现理想的机会。
These illnesses limit human possibilities in ways we simply cannot measure.
这些疾病限制了人类的可能性,我们却无法衡量这种损失。
We live in an era in which there's profound medical progress on so many other fronts.
我们生活的时代,在很多其他领域已经实现了巨大的医疗进步。
My sister's cancer story is a great example, and we could say the same of heart disease.
我妹妹的故事就是一个很好的例子,对于心脏病,我们同样可以做到。
Drugs like statins will prevent millions of heart attacks and strokes.
像他汀类的药物可以预防数百万例心脏病发作和中风。
When you look at these areas of profound medical progress in our lifetimes, they have a narrative in common:
当你留心观察我们生活中这些有着深远的医疗进步的领域,它们都有一个共同的特点:
scientists discovered molecules that matter to an illness,
科学家发现了与疾病有关的分子,
they developed ways to detect and measure those molecules in the body,
发明了检测和测量体内分子的方法,
and they developed ways to interfere with those molecules using other molecules -- medicines.
并开发了用其他分子,也就是药物,来干扰这些分子的方法。
It's a strategy that has worked again and again and again.
这是一种不断重复的策略。
But when it comes to the brain, that strategy has been limited,
但涉及到大脑时,这个策略的作用受到了限制,
because today, we don't know nearly enough, yet, about how the brain works.
因为今天,我们对大脑如何工作的了解还很有限。
We need to learn which of our cells matter to each illness, and which molecules in those cells matter to each illness.
我们需要知道哪个细胞与疾病有关,这些细胞中的哪些分子对哪种疾病起到了关键作用。
And that's the mission I want to tell you about today.
这就是我今天要向各位介绍的使命。
My lab develops technologies with which we try to turn the brain into a big-data problem.
我的实验室开发了可以把大脑问题转变为大数据问题的技术。
You see, before I became a biologist, I worked in computers and math, and I learned this lesson:
在成为生物学家前,我的工作围绕着电脑和数学,从中我有了这样的收获:
wherever you can collect vast amounts of the right kinds of data about the functioning of a system,
只要你能收集到关于某个系统功能的大量正确的数据,
you can use computers in powerful new ways to make sense of that system and learn how it works.
你就可以在电脑上用强大的新方法搞清楚该系统及其工作原理。
Today, big-data approaches are transforming ever-larger sectors of our economy,
今天,大数据方法正在改变我们经济中规模越来越大的部门,
and they could do the same in biology and medicine, too.
它们也可以在生物和医学上有所作为。
But you have to have the right kinds of data.
但你必须得有正确的数据。
You have to have data about the right things.
你必须得到真正想要的数据。
And that often requires new technologies and ideas.
而这通常依赖于新的技术和想法。
And that is the mission that animates the scientists in my lab.
这就是我实验室里的科学家们的使命。
Today, I want to tell you two short stories from our work.
今天,我想要告诉各位我们工作中的两个小故事。
One fundamental obstacle we face in trying to turn the brain into a big-data problem
在试图把大脑转化为大数据问题时,摆在我们面前的一个基本障碍是,
is that our brains are composed of and built from billions of cells.
我们的大脑由数十亿细胞组成。
And our cells are not generalists; they're specialists.
这些细胞不是多面手,它们是专家。
Like humans at work, they specialize into thousands of different cellular careers, or cell types.
就如人们在工作中一样,它们分别擅长于成千上万不同的细胞职业,或细胞类型。
In fact, each of the cell types in our body could probably give a lively TED Talk about what it does at work.
事实上,我们可以围绕身体的每个细胞类型,在TED上做一场有关它们工作原理的生动的演讲。
But as scientists, we don't even know today how many cell types there are,
但作为科学家,我们今天甚至还不知道总共有多少细胞类型,
and we don't know what the titles of most of those talks would be.
也不知道大部分演讲的标题是什么。
Now, we know many important things about cell types.
我们已经了解了关于细胞类型的很多重要的信息。
They can differ dramatically in size and shape.
它们的大小和形状有很大的差异。
One will respond to a molecule that the other doesn't respond to, they'll make different molecules.
有些会对某个分子产生反应,另一些则不会,它们会制造不同的分子。
But science has largely been reaching these insights in an ad hoc way, one cell type at a time, one molecule at a time.
但是科学在很大程度上是以一种特别的方式来得到这些见解的,一次一种细胞类型,一次一种分子。
We wanted to make it possible to learn all of this quickly and systematically.
我们希望能够快速、系统地学习所有这些知识。
Now, until recently, it was the case that if you wanted to inventory all of the molecules in a part of the brain or any organ,
一直以来,如果你想要对大脑或任何器官的所有分子进行编目,
you had to first grind it up into a kind of cellular smoothie. But that's a problem.
你得先把这些细胞研磨成奶昔一样的浆状。但这就是问题了。
As soon as you've ground up the cells, you can only study the contents of the average cell -- not the individual cells.
一旦你已经把细胞磨碎了,你就只能在平均水平上研究细胞--无法得到单个细胞的信息。
Imagine if you were trying to understand how a big city like New York works,
假设你想搞清楚像纽约这样的大城市是如何运转的,
but you could only do so by reviewing some statistics about the average resident of New York.
但只能通过查看纽约居民的平均统计数据。
Of course, you wouldn't learn very much,
当然,这样一来你得到的信息就很有限了,
because everything that's interesting and important and exciting is in all the diversity and the specializations.
因为有趣、重要、让人激动的一切都蕴藏在多样性和专门化中。
And the same thing is true of our cells.
我们的细胞也同样如此。
And we wanted to make it possible to study the brain not as a cellular smoothie but as a cellular fruit salad,
我想要让研究大脑不像研究奶昔那样,而像研究水果沙拉,
in which one could generate data about and learn from each individual piece of fruit.
这样就能从每一片水果中得到数据进行学习。
So we developed a technology for doing that. You're about to see a movie of it.
于是我们为此开发了一种技术。下面展示的就是关于它的影片。
Here we're packaging tens of thousands of individual cells, each into its own tiny water droplet for its own molecular analysis.
我们打包了成千上万的单个细胞,每一个都拥有包裹自身的小水滴,以用来做自身的分子分析。
When a cell lands in a droplet, it's greeted by a tiny bead, and that bead delivers millions of DNA bar code molecules.
当一个细胞降落在一个小液滴上时,就会接触到一个小珠子,而这个珠子能传递数百万个DNA条码分子。
And each bead delivers a different bar code sequence to a different cell.
每一个珠子都向不同的细胞传递不同的条形码序列。
We incorporate the DNA bar codes into each cell's RNA molecules.
我们将DNA条码整合到每个细胞的RNA分子中。
Those are the molecular transcripts it's making of the specific genes that it's using to do its job.
这些是它用来完成工作的特定基因的分子转录信息。
And then we sequence billions of these combined molecules
然后我们对数十亿的组合分子进行测序,
and use the sequences to tell us which cell and which gene every molecule came from.
并利用这些序列来了解每个分子分别来自于哪个细胞和哪个基因。
We call this approach "Drop-seq," because we use droplets to separate the cells for analysis,
我们称这种方法为“液滴测序”,因为我们使用液滴分离细胞来做分析,
and we use DNA sequences to tag and inventory and keep track of everything.
我们使用DNA序列来标记、编目和追踪所有信息。
And now, whenever we do an experiment, we analyze tens of thousands of individual cells.
每次做实验,我们都会分析数以万计的单细胞。
And today in this area of science,
当今,在这个科学领域,
the challenge is increasingly how to learn as much as we can as quickly as we can from these vast data sets.
我们面临的挑战是如何尽可能多,且尽可能快地从这些海量数据集中学习。
When we were developing Drop-seq, people used to tell us,
当我们发明液滴测序时,人们告诉我们,
"Oh, this is going to make you guys the go-to for every major brain project."
“哦,这将使你们的工作成为每个主要大脑项目的首选。”
That's not how we saw it. Science is best when everyone is generating lots of exciting data.
我们不是这样看的。当每个人都产生大量令人兴奋的数据时,科学就是最好的手段。
So we wrote a 25-page instruction book, with which any scientist could build their own Drop-seq system from scratch.
于是我们写了25页的指南,任何科学家都可以借此从零开始,开发他们自己的液滴测序技术。
And that instruction book has been downloaded from our lab website 50,000 times in the past two years.
这个指南过去两年在我们实验室网站的下载次数为5万次。
We wrote software that any scientist could use to analyze the data from Drop-seq experiments,
每个科学家还可以用我们编写的软件来分析通过液滴测序得到的实验数据,
and that software is also free, and it's been downloaded from our website 30,000 times in the past two years.
而这个软件也是免费的,过去两年在我们的网站被下载了3万次。
And hundreds of labs have written us about discoveries that they've made using this approach.
数百家实验室给我们写信,介绍了他们使用这种方法得到的发现。
Today, this technology is being used to make a human cell atlas.
今天,这项技术已经被用来制作人类细胞图谱。
It will be an atlas of all of the cell types in the human body and the specific genes that each cell type uses to do its job.
它是人体所有细胞类型,以及每个用来完成其工作的细胞类型的特定基因的图谱。
Now I want to tell you about a second challenge that we face in trying to turn the brain into a big data problem.
现在我想谈一下把大脑问题转变为大数据问题所面临第二个挑战。
And that challenge is that we'd like to learn from the brains of hundreds of thousands of living people.
那就是,我们需要研究成千上万活人的大脑。
But our brains are not physically accessible while we're living.
但是我们还没有办法接触活体大脑。
But how can we discover molecular factors if we can't hold the molecules?
如果我们不能控制分子,要如何发现分子因子呢?
An answer comes from the fact that the most informative molecules, proteins, are encoded in our DNA,
答案来自于信息最丰富的分子,蛋白质,它们编码在我们的DNA中,
which has the recipes our cells follow to make all of our proteins.
DNA携带了我们的细胞所遵循的食谱,用来制造我们所有的蛋白质。
And these recipes vary from person to person to person in ways
这些食谱的内容因人而异,
that cause the proteins to vary from person to person in their precise sequence and in how much each cell type makes of each protein.
所制造的蛋白质会根据不同人的精确序列而变化,而且每个细胞类型对每种蛋白质的影响程度不同。
It's all encoded in our DNA, and it's all genetics, but it's not the genetics that we learned about in school.
这些信息全都编码在我们的DNA中,都是可遗传的,但这不是我们在学校学到的遗传。
Do you remember big B, little b? If you inherit big B, you get brown eyes? It's simple.
你还记得大B,小b吗?如果继承了大B,你就有棕色的眼睛?原理很简单。
Very few traits are that simple. Even eye color is shaped by much more than a single pigment molecule.
很少有这样简单的特征。即便塑造眼睛颜色的因素也要比单一色素分子多很多。
And something as complex as the function of our brains is shaped by the interaction of thousands of genes.
像我们大脑功能那样复杂的东西是由上千个基因的相互作用塑造的。
And each of these genes varies meaningfully from person to person to person,
每一个基因在人与人之间都有显著的差异,
and each of us is a unique combination of that variation. It's a big data opportunity.
我们每个人都是这种变异的独特组合。这是大数据的机会。
And today, it's increasingly possible to make progress on a scale that was never possible before.
今天,我们越来越有可能在史无前例的规模上取得进展。
People are contributing to genetic studies in record numbers,
参与遗传研究的人数创下了记录,
and scientists around the world are sharing the data with one another to speed progress.
全球各地的科学家彼此分享数据以加速取得进展。
I want to tell you a short story about a discovery we recently made about the genetics of schizophrenia.
我想通过一个简短的故事介绍一下我们最近在精神分裂遗传学方面的发现。
It was made possible by 50,000 people from 30 countries, who contributed their DNA to genetic research on schizophrenia.
该发现包含了来自30多个国家的5万人贡献的DNA,用来进行精神分裂的遗传研究。
It had been known for several years that
很多年前我们就知道,
the human genome's largest influence on risk of schizophrenia comes from a part of the genome
人类基因组对患上精神分裂症风险的最大影响来自我们的部分基因组,
that encodes many of the molecules in our immune system.
这些基因组编码了我们免疫系统中的很多分子。
But it wasn't clear which gene was responsible.
但目前还不清楚哪个基因起了作用。
A scientist in my lab developed a new way to analyze DNA with computers, and he discovered something very surprising.
我实验室的科学家开发了一个使用电脑分析DNA的新方法,他发现了一些让人惊讶的事情。
He found that a gene called "complement component 4" -- it's called "C4" for short
他发现了一个被称为补体成分4的基因--简称为C4,
comes in dozens of different forms in different people's genomes,
在不同人的基因组中有几十种不同的形式,
and these different forms make different amounts of C4 protein in our brains.
这些不同的形式会产出我们大脑中不同数量的C4蛋白质。
And he found that the more C4 protein our genes make, the greater our risk for schizophrenia.
他发现我们的基因产生的C4蛋白质越多,患精神分裂的风险就越高。
Now, C4 is still just one risk factor in a complex system.
目前,C4只是一个复杂系统中的风险因素之一。
This isn't big B, but it's an insight about a molecule that matters.
这不是大B,但这是一个对重要分子的洞察。
Complement proteins like C4 were known for a long time for their roles in the immune system,
像C4那样的补体分子因它们在免疫系统中的角色很早就被人了解,
where they act as a kind of molecular Post-it note that says, "Eat me."
它们扮演着类似便利贴的角色,写着,“吃我”。
And that Post-it note gets put on lots of debris and dead cells in our bodies and invites immune cells to eliminate them.
这些便利贴被放在我们身体的很多废弃物和死细胞上,邀请免疫细胞去清除它们。
But two colleagues of mine found that the C4 Post-it note also gets put on synapses in the brain and prompts their elimination.
但我们的两个同事发现C4便利贴也被贴到了大脑的突触上面,促进了这些突触连接消失。
Now, the creation and elimination of synapses is a normal part of human development and learning.
突触的创造和消除是人类发展和学习的正常部分。
Our brains create and eliminate synapses all the time.
我们的大脑一直在创造和消除突触。
But our genetic results suggest that in schizophrenia, the elimination process may go into overdrive.
但我们的遗传研究结果表明,在精神分裂过程中,这个清除可能在超速运行。
Scientists at many drug companies tell me they're excited about this discovery,
很多医药公司的科学家告诉我,他们对这个发现感到非常兴奋,
because they've been working on complement proteins for years in the immune system, and they've learned a lot about how they work.
因为他们在免疫系统的补体分子上已经花费了数年功夫,对这些分子的工作原理也有了更深入的了解。
They've even developed molecules that interfere with complement proteins,
他们甚至开发了分子来干预补体分子,
and they're starting to test them in the brain as well as the immune system.
并在大脑和免疫系统中进行测试。
It's potentially a path toward a drug that might address a root cause rather than an individual symptom,
这可能是一种去除根本病因的药物,而不只针对单个症状,
and we hope very much that this work by many scientists over many years will be successful.
我们非常希望许多科学家多年来所做的工作能够成功。
But C4 is just one example of the potential for data-driven scientific approaches to open new fronts on medical problems that are centuries old.
但C4只是数据驱动的科学方法的一个例子,有可能在存在了几个世纪的医疗问题上开辟新的战线。
There are hundreds of places in our genomes that shape risk for brain illnesses,
在我们的基因组中有数百个地方存在影响大脑疾病的风险,
and any one of them could lead us to the next molecular insight about a molecule that matters.
它们中的任何一个都能带给我们关于下一个重要分子的洞见。
And there are hundreds of cell types that use these genes in different combinations.
有数百种细胞类型在不同的组合中使用这些基因。
As we and other scientists work to generate the rest of the data that's needed and to learn all that we can from that data,
我和其他科学家合作生成了能够让我们获得所有信息的余下的部分数据,
we hope to open many more new fronts.
我们希望开辟更多的新战线。
Genetics and single-cell analysis are just two ways of trying to turn the brain into a big data problem.
遗传和单细胞分析只是试图将大脑转化为大数据问题的两种方式。
There is so much more we can do.
我们能做的事情太多了。
Scientists in my lab are creating a technology for quickly mapping the synaptic connections in the brain
我实验室的科学家正在开发一种技术来快速绘制大脑中的突触连接,
to tell which neurons are talking to which other neurons and how that conversation changes throughout life and during illness.
以辨别哪些神经元在与其他神经元交流,以及这些交流在衰老和疾病中是如何变化的。
And we're developing a way to test in a single tube how cells with hundreds of different people's genomes respond differently to the same stimulus.
我们正在开发一种方法,在单管道中测试包含上百种人类基因的细胞如何对同样的刺激做出不同的反应。
These projects bring together people with diverse backgrounds and training and interests
这些项目将拥有不同背景,不同教育和兴趣,
biology, computers, chemistry, math, statistics, engineering.
如生物、计算机、化学、数学、 统计学、工程学的人吸引到一起。
But the scientific possibilities rally people with diverse interests into working intensely together.
科学的可能性让兴趣各异的人聚集到一起,共同努力工作。
What's the future that we could hope to create? Consider cancer.
我们期待的未来是什么样呢?想想癌症。
We've moved from an era of ignorance about what causes cancer,
我们已经走出对癌症致因一无所知的时代,
in which cancer was commonly ascribed to personal psychological characteristics,
那时癌症常被归因于个人的心理特征,
to a modern molecular understanding of the true biological causes of cancer.
而今天,对引发癌症真正的生物学原因,我们已经有了现代分子层面的认识。
That understanding today leads to innovative medicine after innovative medicine, and although there's still so much work to do,
这种认识引领了不断创新的医学,尽管仍然有很多的工作要做,
we're already surrounded by people who have been cured of cancers that were considered untreatable a generation ago.
我们周围已经有很多人的癌症被治愈了,而在一代人以前,这些癌症还被认为是无药可治的。
And millions of cancer survivors like my sister
数百万像我妹妹那样的癌症幸存者
find themselves with years of life that they didn't take for granted and new opportunities for work and joy and human connection.
发现自己拥有了意外得到的若干年生命,以及工作、快乐和建立人际关系的新机遇。
That is the future that we are determined to create around mental illness
这就是我们决心围绕精神疾病去创造的未来,
one of real understanding and empathy and limitless possibility. Thank you.
一个充满着真正的理解、共情和无限可能的未来。谢谢。