FROM
THE EDITOR
Let’s face it, we’ve all been living a lie. We pretend that FPGA stands for “Field Programmable Gate Array” but we all know that FPGAs are not really arrays of gates. If anything, they’re arrays of look-up tables (LUTs). More recently, however, we’ve been seeing arrays of many other things also showing up in our FPGAs. We’ve got multipliers, memories, transceivers, processors and more. We really should be calling these things Field Programmable Arrays of Stuff (FPAS). This week, we take a look at a different type of FPAS, NVIDIA’s new Tesla T10P GPU. It is an array of 240 processor cores that, combined with the CUDA programming environment, puts the GP into GPGPU. Confused? Our latest feature article explains.
Thanks for reading! If there's anything we can do to make our
publications more useful to you, please let us know at:
comments@fpgajournal.com. If you'd rather sound off in public, please post your comments or questions in our new Journal Forums.
Kevin Morris – Editor in Chief
Techfocus Media, Inc.
|
EVENTS & ANNOUNCEMENTS
Download New Design Software for 40nm FPGAs & ASICs
New Quartus® II software v8.0 is the #1 software for
performance and productivity. Design for the new 40nm
Stratix® IV FPGAs and HardCopy® IV ASICs and achieve
the fastest compile times, highest performance, and
highest logic utilization.
Download Quartus II v8.0 software today!
|
Take our new
SUPER QUICK, JUST A COUPLE OF QUESTIONS,
WON'T TAKE MUCH TIME AT ALL (WE PROMISE)
2008 Journal Reader Survey.
WEIGH IN NOW!!
|
|
|

|
A Passel of Processors
NVIDIA’s Tesla T10P Blurs Some Lines
(Kevin Morris)
Picture this architecture – a high speed application processor doing control coupled to an accelerator comprised of a mass of processing elements ready to power-parallelize compute-intensive components of a complex problem. Sound familiar? Supercomputers have taken advantage of acceleration using schemes like this for a while. People using FPGAs for co-processors do it all the time.
Now, picture a new chip with 1.4 billion transistors, an array of 240 cores, and a processing throughput equivalent to about 1 TeraFLOPS. Many readers of this publication would probably guess a new FPGA, right?
With the new Tesla T10P GPU, NVIDIA is making a lot of us editors re-work our glossaries. The T10P is a GPU that’s aimed directly at the high-performance computing community, not just accidentally clipping it with a bank shot while going after the real target market of graphics acceleration. The T10P represents NVIDIA’s second generation of CUDA (Compute Unified Device Architecture) GPUs (with the Tesla 8 being the first). CUDA is a C dialect with specific constructs for parallelism, and it allows direct access to the low-level hardware capabilities of the processor cores of the GPU. Why would we want that? To do non-graphics applications, of course.
You see – unless your performance-critical application happens to involve a lot of shading and texture mapping, GPUs have traditionally been a locked treasure chest, not ready to share all that parallel-processing goodness with those who aren’t trying to blast billions of bits onto a screen. Many people have always known that processing power was in there, though, and an access mechanism like CUDA is the key that lets them get in to put all those processors to work - doing a lot of interesting tasks that are most certainly NOT graphics acceleration. [more]
|
|