New Transistor Structures At 3nm/2nm

tyingq · on Jan 25, 2021

This drawing is a bit clearer and more complete than the one in the article: https://images.anandtech.com/doci/16041/SamGAA_575px.png

Note that "MBCFET" is Samsung's name for their "nanosheet" FET.

And the Anandtech article it comes from: https://www.anandtech.com/show/16041/where-are-my-gaafets-ts...

kens · on Jan 25, 2021

Are there drawings with more detail? I'm unclear where the gate dielectric is and the channel. The silicon is doped differently "inside" the gate to form the channel? That seems hard to fabricate. (I searched around a bit but couldn't find a better diagram.)

tyingq · on Jan 25, 2021

This diagram seems to be what you're looking for: https://spectrum.ieee.org/image/MzM0NTY3NA.jpeg

Shows the stages of material removal/deposit so that the dielectric and channel are clear.

ashz8888 · on Jan 25, 2021

Just wanted to point out that 3nm or 2nm are nothing but the marketing terms. The physical channel length will continue to remain at around 10 nm in this decade, irrespective of the device architecture. See Fig. 1.2 of my PhD thesis: https://etheses.whiterose.ac.uk/22492/1/Novel%20Approaches%2...

colejohnson66 · on Jan 26, 2021

Why is that? What’s the point in calling it “3nm” when it’s not?

<rant>

Also, how is that not false advertising then? If I’m told a device is 5nm, and that traditionally meant the gate length, why isn’t it a lie when they’re actually 10nm or whatever? We need to stop calling it “marketing terms” and call it what it is: lies. Just because everyone does it doesn’t make it right.

</rant>

souprock · on Jan 26, 2021

Clearly we need a better measurement. I suggest picking an open core, then reporting the number that can be produced per square meter. Some choices:

https://en.wikipedia.org/wiki/Category:Open_microprocessors

The IBM A2 and the UltraSPARC T2 look like sane choices. Perhaps there is a suitable RISC-V.

So then comparing processes, we could say that process X can do 54255 chips per square meter and process Y can do 83802 chips per square meter.

ibobev · on Jan 26, 2021

There is such standard measure but it is the number of transistors for a square millimeter[1].

[1] https://en.wikipedia.org/wiki/Transistor_count#Transistor_de...

souprock · on Jan 26, 2021

That is not a great measurement. Transistors can be of different types, each with different value in terms of making products work. Other things matter too: metal, pads, die separation saw cuts, e-fuses, diodes, resistors, and capacitors. Nobody cares about transistors that can't be hooked up to anything.

ashz8888 · on Jan 26, 2021

Well because of tradition. Historically it made sense to define new technology node in terms physical gate length. But now even if that's not viable, they continue to do so for better marketing I guess.

colejohnson66 · on Jan 26, 2021

But if we’re not basing it on reality, when does “5nm” become “3nm”? And the fact that it’s just marketing makes comparing across manufacturers (eg. Intel vs. TSMC) impossible.

ericlewis · on Jan 26, 2021

Why is that? (Sorry, the paper is beyond my understanding)

ashz8888 · on Jan 26, 2021

Short answer is tradition. Historically they were able to stay on Moore's law with each new technology node representing the physical gate length. Currently however, even if they can't continue on that trajectory they still define the next technology node that way.

ur-whale · on Jan 25, 2021

One thing I've always wondered about when it comes to new process design at smaller scale: how much actual quantum mechanics is actually needed to get the job done?

And ... if the answer is, as I suspect, a lot, what kind of numerical methods and processes are used to design and simulate these tiny quantum mechanical machines?

[EDIT] I mean, when taking a basic QM course, there is a lot of contorsions to try and find analytical solutions to the Schrödinger equation, but as soon as you have three particle interacting with each other, analytical methods run into a wall.

Am I right to think that sub-10nm process design is all done numerically?

Anyone who happens to work on this type of problems care to give pointers?

typon · on Jan 25, 2021

Depends on what you mean by get the job done. (Rather which job)

If you are doing research into designing advanced transistors with new geometry or new materials (which is what I did my graduate research in), you would be using something like DFT (Density functional theory) for equilibrium analysis and NEGF, Huckel theory etc. for simulating current. These methods only realistically work on ~500-1000 atom systems, beyond which the simulation takes too long to run even on supercomputers (which is what i was using). I think GPUs here would be very useful, but there weren't any tools at the time that were seriously optimized for GPU. The codes I was using were SIESTA/TransSIESTA, Atomistix, QuantumEspresso and others.

For simulating multiple transistors, or transistors with a large geometry (for example 14nm gate length), you would use TCAD simulators that use FEM + measured parameters to simulate the transistors. The equations behind these are traditional semiconductor equations with a bunch of heuristics and curve fitting. The main tool I used was Sentaurus TCAD.

For simulating larger circuits, say a low-noise amplifier or maybe a small DAC, you would use tools like Cadence Virtuoso + the provided PDK from your foundry. The equations here are simpler than the ones used in Sentaurus and they are also calibrated to measurement.

siver_john · on Jan 25, 2021

Just as a note, GPU supported code just now seems to be coming on line with like Quantum Espresso GPU version getting an alpha in 2019. I think a large part of that is due to the 64 bit precision making the speed up not as great on GPUs (and to a large extent eliminating the usage of consumer level cards which are popular at the classical molecular dynamics software like GROMACS). Plus Guassian, while having support since Kepler days note that earlier cards didn't have the memory requirements (which also is a huge problem for consumer cards) making adoption slower, because of the added costs.

ur-whale · on Jan 25, 2021

Thank you for all these pointers, this is really cool.

In particular, I had never heard of DFT ... really intesting and first time I am exposed to what feels like real "hands-on" QM (as opposed to the very stripped down systems one learns about in introductory QM textbooks).

siver_john · on Jan 25, 2021

Full disclosure, I don't work on processors but am in a tangentially related field.

However, I don't feel that you need to have an analytical solution to the Schrödinger equation. In fact even in chemistry we don't do analytical solutions instead using fancy basis sets which allow us to do approximations.

Regardless, I don't think even that is particularly necessary, as quantum at that level means you basically have some level of leakage where the electrons can just tunnel through the barrier created by the transistor when off. So if I had to guess most of it is just ways to rectify this leakage so it doesn't effect calculations, probably similar to a form of error correcting.

(This ignores that you may have to do some initial quantum calculations using Density Functional Theorem to get a guess at how much leakage based off the materials you are using, though if I had to guess most of that work was done a while ago.)

patcassidy2000 · on Jan 25, 2021

Solid State Physics has a bunch of different types of models to try to explain the behavior of electrons in semiconductors and conductors. Most of them are only valid under specific situations and they sometimes give erroneous results unless you apply them carefully. That being said, your absolutely right that the industry uses approximations. They usually use some kind of simulation physics package similar to the ones used by EE engineers when designing circuits.

mlindner · on Jan 25, 2021

Hacker news comment sections on articles about semiconductors always seems to be full of people talking authoritatively about things they clearly don't understand. And this is all visible to someone who only majored in computer engineering and got basic training in transistors and logic design who's day job is now in software. I can only imagine the people who actually know transistors cringing in here (and thus avoid commenting).

pvarangot · on Jan 25, 2021

I worked in an aerospace startup for seven years, and while not an expert I really know my way around some spacey stuff. People speaking authoritatively about that on HN also make me cringe, I rarely intervene because they almost always engage in a discussion about how they are right even when I send references to papers and book chapters. In Reddit it's even worse.

With law and politics I feel there's a similar attitude going around but it's of course more an up for debate topic.

mrfusion · on Jan 25, 2021

It’s not just articles about semi conductors ... you just happen to have some knowledge in that area so you notice it.

You’ve just discovered another instance of the gellman amnesia effect.

guardiangod · on Jan 25, 2021

I agree with you (I have been following silicon fabrication news for 20 years already). I cringed when I saw people commenting on, say Intel replacing its CEO, or that RISC-V will replace ARM for design houses in the next few year, when it's clear the commentators haven't taken the time to do even a cursory research.

I don't even want to think of the comments I've seen for my main subject (cybersecurity, exploits, and vulnerabilities). I will never be able to correct all of them.

I think some humility would be nice for all of us. We know what we know and we should be aware of what we don't know. I don't know much about web programming, and I freely admit to it. I certainly don't pretend to have anything insightful to say about them.

lizknope · on Jan 26, 2021

I do semiconductor digital physical design for the last 23 years. But almost everything I do is at the standard cell (AND/OR/flip flop) level. I generally don't do anything at the transitor level. I have seen some good comments here in the past.

deeeeplearning · on Jan 25, 2021

Gell-Man Amnesia might interest you.

"Briefly stated, the Gell-Mann Amnesia effect is as follows. You open the newspaper to an article on some subject you know well. In Murray’s case, physics. In mine, show business. You read the article and see the journalist has absolutely no understanding of either the facts or the issues. Often, the article is so wrong it actually presents the story backward—reversing cause and effect. I call these the “wet streets cause rain” stories. Paper’s full of them.

In any case, you read with exasperation or amusement the multiple errors in a story, and then turn the page to national or international affairs, and read as if the rest of the newspaper was somehow more accurate about Palestine than the baloney you just read. You turn the page, and forget what you know.”

newen · on Jan 25, 2021

It's not just computer engineering. I'm in natural language processing and was in academia and I see a lot of mistakes and misunderstandings in the comments in those topics. Probably the only thing reliable in HN comments is website design.

MayeulC · on Jan 25, 2021

I tend to disagree; while some comments seem to be clearly wrong, mistakes tend to be pointed out by other commenters, and you can always check bios. It's a bit riskier on stories that don't stay on the front page, I'll grant you that.

And there can be a bit of Dunning-Kruger if you work in a related field too.

mrRandomGuy · on Jan 25, 2021

You can expand this astute observation to _many_ things other than semiconductors around these parts

LatteLazy · on Jan 25, 2021

Silicon atoms are about 0.13nm apart. 2nm is about 15 atoms wide. Mass producing anything on that scale is an exceptional feat.

Denvercoder9 · on Jan 25, 2021

It's been a while since node names were actually physical measurements though. For example, at the 3nm node the fin width is 5nm (as noted in the article).

ur-whale · on Jan 25, 2021

>Mass producing anything on that scale is an exceptional feat.

Mass producing anything on that scale that will then go on working for years at ~ 3Billion movements per second reliably is simply astounding.

jabl · on Jan 25, 2021

It's the closest thing to black magic mankind has achieved.

cylon13 · on Jan 25, 2021

> We have captured the lightning in a rock and taught it to think.

Koshkin · on Jan 26, 2021

(Burns at the stake.)

everdrive · on Jan 25, 2021

What is the benefit of smaller transistors? Serious question. Why does it matter if I have 7nm vs. 5nm vs. anything else?

mng2 · on Jan 25, 2021

Besides being able to pack more transistors into a given area, a smaller transistor has less gate capacitance (to first order). This means it can switch faster (smaller RC time constant) and less energy is expended in switching. Thus, going from generation to generation, the overall energy expenditure of a chip can be kept within a reasonable range despite adding many more transistors. You also may have heard of a "die shrink", where an existing design gets shrunk to the next technology node, using less power and clocking faster.

Shrinking isn't always a walk in the park though. Some nodes ago subthreshold leakage became a big problem until they figured out how to solve it.

tryptophan · on Jan 25, 2021

Smaller distance -> lower resistance -> less heat -> higher clocks/more stuff per clock allowed with same amount of heat produced -> higher performance.

algorithm314 · on Jan 25, 2021

No smaller dimensions -> thinner wires -> higher resistance per length.

marcosdumay · on Jan 25, 2021

A chip is not composed of cylindrical wires.

Smaller dimensions means you can set a smaller length for the wire.

As you can see on the diagram on this article, there is a large push into increasing the height of the transistors. That has being going on for more than a decade.

About the width, a finer process means you can keep the width of the most critical transistors the same, but can also trade it off into less width (and performance) where it is less important.

So, overall, smaller dimensions leads to lower resistances. You can trade some of the gain for density, but you'll always get some lower resistance.

baybal2 · on Jan 25, 2021

> Smaller distance -> lower resistance -> less heat -> higher clocks

Smaller distance -> lower capacitance -> higher clocks

systemvoltage · on Jan 25, 2021

None of these reasons are as important as transistor count / mm^2. If you can shrink die size in half, you can effectively reduce its cost by half as well. Processing wafers through 400 steps in the fab and its capacity is limited by how many useful chips you can build on a given 300mm wafer.

jabl · on Jan 25, 2021

The marginal cost of manufacturing a chip is roughly proportional to area. So with smaller transistors you can fit more of them into a given area.

superrad · on Jan 25, 2021

Exactly, the smaller your chip is the more you can fit on a silicon wafer.

If your chip is too large it can even make it practically impossible to manufacture at scale due to the increased chance of defects as your chip size increases.

t-writescode · on Jan 25, 2021

It's crazy to think about if you've never thought about this, but the speed of light is a bottleneck for processors. When we get smaller devices, there's literally less distance that needs to be traversed, so more can be done!

adwn · on Jan 25, 2021

> the speed of light is a bottleneck for processors

It's not the speed of light [in a vacuum], but electric signal propagation speed in copper.

Koshkin · on Jan 26, 2021

The velocity of signal propagation in copper is close to that of light. (Not to be confused with the mean velocity of the electrons.)

Gravityloss · on Jan 25, 2021

An electric signal in a wire can't travel faster than the speed of light.

adwn · on Jan 25, 2021

> An electric signal in a wire can't travel faster than the speed of light.

So? That means that the speed of light is an upper bound, but it's not a bottle neck.

Gravityloss · on Jan 26, 2021

An electrical signal travels close to the speed of light (not the electrons). Even when of course no light travels inside the copper.

If something oscillates at 1 GHz, 15 cm down the wire the phase is opposite. To me it's perfectly correct to say that speed of light affects the design a lot and in many places probably is a bottleneck.

stavros · on Jan 26, 2021

Neither can my car.

Gravityloss · on Jan 26, 2021

Your car does not travel even close to the speed of light for it to matter.

But electrical signals do.

SpaceRaccoon · on Jan 25, 2021

And the "electric signal" is an electromagnetic wave- also known as light. Nowhere did they imply the speed of light in a vacuum, the speed of light in copper is an equally valid interpretation.

adwn · on Jan 25, 2021

> And the "electric signal" is an electromagnetic wave- also known as light.

No, it's not. If it were, you'd have photons moving through your copper wire, which would be quite the sensation!

A moving electron does create a change in the electromagnetic field, however, so maybe that's where your confusion stems from?

ben_w · on Jan 25, 2021

More charitably, charge carriers do move much more slowly than the electric field they transmit[0], and while you’re correct that the time-varying electric field in a processor is not light (nor even a radio wave), if the chips were much larger or much higher frequency the chips and buses would risk becoming antennas and having all the problems that would bring — 5 Ghz ~= 6cm wavelength [1] ~= 3cm half-wave dipole.

[0] https://en.m.wikipedia.org/wiki/Speed_of_electricity

[1] http://www.wolframalpha.com/input/?i=c%2F5ghz

Edit: if the chips were much larger. Smaller chips can go faster without becoming antennae.

mhh__ · on Jan 25, 2021

That's true but it's easy to forget that the velocity factor of that wave is significantly smaller in (say) copper than air or a vacuum.

SpaceRaccoon · on Jan 26, 2021

The velocity factor of light in glass is also significantly smaller, about 70%.

gchamonlive · on Jan 25, 2021

this is only true because diodes are not superconductors, and even if the traveling distance is small, the joule effect is significant.

femto113 · on Jan 25, 2021

The numbers roughly correspond to the width of wires in the circuits, but the number of circuits you can fit per unit area depends on the square of that number so going from 7nm to 5nm roughly doubles density. The first microprocessor[1] was around 10,000nm so we're approaching 5,000x thinner wires or almost 25 million times more circuits (the latest Apple M1[2] is at 5nm have about 8,000,000x as many transistors as the 4004).

[1] https://www.intel.com/content/www/us/en/history/museum-story... [2] https://www.apple.com/newsroom/2020/11/apple-unleashes-m1/

lizknope · on Jan 25, 2021

Smaller distance to travel so signals get from one gate to another quicker which will enable a higher clock speed.

Smaller devices use less power so less heat and longer battery life.

Smaller devices mean a smaller chip which is cheaper (although mask costs will be more expensive) or use the extra area for more features like more cache or another processor core.

ksec · on Jan 25, 2021

> Serious question...

To Oversimplify.

With a Fixed Yield, and an exact 100% increase in Transistor Density that translate to 50% smaller Die Size.

On a Wafer, that would equate to Double the amount Die you have. All of a sudden your profits increase dramatically.

5nm also have a better power curve so within the same clock speed you have lower energy usage. Hence you can push for higher performance if needed.

The first point of Uni Economics is important for the industry. If you have high enough volume, say hundreds Million of chips per year then it make sense to move to the next node for cost saving. If you have small volume or low margin chip then the Design Cost, which is the most expensive part of chip making, would not work to your benefits.

And it also depends on Wafer price, If 5nm is Double the Price of 7nm then in the above example your unit cost would be exactly the same.

The second point is important for CPUs, and other things that are increasingly computational expensive like WiFi 6 and 5G Modem. You want your Smartphone to last longer on battery so they work better on an energy efficient node.

So basically it is a Cost / Performance trade offs.

LatteLazy · on Jan 25, 2021

Smaller means closer together. Closer together means less time for a signal to move from one to another. Less time means higher clock speeds.

If you CPU is 100mm across, the speed of light limits it to 3GHz because that's how many times you can cross the cpu travelling at c. At 10mm you get 30GHz.

cactus2093 · on Jan 25, 2021

I don't really know enough to refute it, but this seems deeply and bizarrely wrong. It doesn't account for transistor count or density just the size of the entire chip? With pipelining I don't think a signal has to travel across the entire chip every cycle. It also doesn't really address the question above, since single core CPU speeds haven't increased in 15 years even though transistors have kept getting smaller and closer together.

Seems like an intriguing napkin math limit/simplification though, I'd be interested if anyone could elaborate on if there's any substance to it.

Nokinside · on Jan 25, 2021

The speed of EM signal in copper is roughly 60% of the speed of light. You also have to account for timing jitter and wait until you are sure that everybody has the signal to prevent going out of sync. This means that reliable distance from a single clock is just a fraction of what the speed of signal theoretically allows.

Clock distribution networks use local clocks to buffer and amplify the global clock but they take a significant amount of chip area and make the chip larger. Clock distribution circuitry draws a significant amount of power. It can be 30-40% of the power usage. You want to use them as little as possible.

corty · on Jan 25, 2021

It is not wrong, it is rather correct. Speed of propagation in semiconductor materials is at most a third of speed of light in vacuum. So the distance travelled is rather limited for a signal. Also, a signal might have to traverse a few transistors or gates, so frequency in the 3GHz range does really limit processor sizes to the order of millimeters. You already said how to get around it: Pipelines, that limits the area a signal has to propagate. Also, one has to take care to make signals arrive early enough in the longest possible signal path as well as to distribute the clock in a way for it to arrive at aligned times everywhere, so you need a clock distribution net with known delays, etc. Chip timing is black art.

mhh__ · on Jan 25, 2021

It's an approximation, or better a bound.

When designing chips or doing layout for FPGA designs, we do something called timing analysis to find out if signals get to where they should do such that the chip is stable ("meets timing").

There is a lot more to it than just distance. The transistors have speeds, to start with.

That and just because this size gives a bound on how quickly you can do things, the transistor count is also increasing, so the actual clock doesn't increase all that much.

coolspot · on Jan 25, 2021

> If you CPU is 100mm across, the speed of light limits it to 3GHz because that's how many times you can cross the cpu travelling at c. At 10mm you get 30GHz.

100mm across is 10cm, 0.1m, 4 inches. That’s palm-sized CPU - far from any modern silicon.

LatteLazy · on Jan 25, 2021

The reason we have shrunk the CPU is because the transistors etc that used to take 100mm can now be held in 10mm.

Theres always this trade off between complexity and speed. Making the components smaller means you can have both!

bigmattystyles · on Jan 25, 2021

You can pack more within the same area for one. A multitude of other constraints come in but that’s a big reason.

SemiTom · on Jan 25, 2021

Another good resource for moving to GAA FETs https://semiengineering.com/moving-to-gaa-fets/

mjevans · on Jan 25, 2021

Rather than the change in transistor design, I think the bigger news is the switch from silicon with dopants to silicon with germanium and dopants. The drop in threshold voltage from ~0.7v to ~0.3v might be one of the last levers left in extracting even more performance; at the cost of making semi production and equipment even more hazardous.

j_walter · on Jan 25, 2021

What do you mean "silicon with germanium and dopants"? Implanting germanium as a dopant is already done at much larger geometries than 2/3nm. It's also not any more hazardous than implanting any other ion.

x86_64Ubuntu · on Jan 25, 2021

Would you mind explaining how the production becomes more hazardous?

sitkack · on Jan 25, 2021

I think parent might have confused Germanium with Cadmium? I am no chemist. It could also require other more toxic substances to control reactions or act a carrier. The whole area around Sunnyvale is littered with toxic waste dumps from semiconductor manufacturing. [1] From [2], it says, "Some reactive intermediate compounds of germanium are poisonous", when then references [3] but I can't find the specific citation. I think Germanium is getting lumped in with other toxic chemicals used in semiconductor manufacturing like gallium arsenide, cadmium, etc.

[1] https://www.epa.gov/superfund-redevelopment-initiative/super...

[2] https://en.wikipedia.org/wiki/Germanium#Germanium_and_health

[3] https://www.usgs.gov/centers/nmic/germanium-statistics-and-i...

https://pubchem.ncbi.nlm.nih.gov/compound/germanium#section=...

https://pubchem.ncbi.nlm.nih.gov/compound/cadmium#section=To...

bigmattystyles · on Jan 25, 2021

Fairchild in south San Jose too. When I was at ST High School in the mid 90s calling other kids a Fairchild settlement kid was an insult. https://www.kqed.org/news/11630861/how-silicon-valley-indust...

amelius · on Jan 25, 2021

> The whole area around Sunnyvale is littered with toxic waste dumps from semiconductor manufacturing.

Makes me wonder how this is dealt with in Taiwan.

corty · on Jan 25, 2021

There is a reason those kinds of industries avoid "the west".

amelius · on Jan 25, 2021

According to the article, the new transistor design offers the promise of lower leakage.

bigmattystyles · on Jan 25, 2021

Per transistor maybe, but the leakage goes down less than the amount of transistors you can pack per area, so in effect, per die/chip your leakage increases. Heat as well, but it’s the other side of the same coin.

phkahler · on Jan 25, 2021

Why can't they use a vertical channel? Layers are a thing, and vias are a thing? Why aren't vertical channel transistors a thing?

wallacoloo · on Jan 25, 2021

From what I can gather looking at the images and such, the device is one gate with 3 or more isolated channels, each with separate source/drains. Are these processes constrained such that all the sources/drains have to be linked together later? Or can they be used independently, allowing the designer to construct 3 or more transistors with a shared gate?

riskable · on Jan 26, 2021

I think what the world needs more right now is more foundries and not smaller manufacturing processes. The global chip shortage totally sucks and it's because there's so few players and all the innovation is focused on stuff like this instead of figuring out ways to produce ICs faster and cheaper.