INTRODUCING A COMPLETE INFO ABOUT "INTEL NEXT GENERATION"
Full steam a head
The chip giant seems to have gotten carried away with its strategy: because the Athlon processes faster than the Pentium 4 at the same clock speed, the clock speed had to increase as quickly as possible. Today, Pentium 4 processors work at speeds of up to 3.06 GHz and with 533 MHz FSB. But that's still not enough: dual-channel storage controllers and even faster system clock speeds (up to 200 MHz quad-pumped) are next. In addition, Intel is continuing to develop HyperThreading technology, which provides better allocation of the processor's capabilities and enhances utilization.
Going in another direction, AMD is concentrating on the 64 bit Athlon so that it can continue to be a player in the future. For several weeks, however, Athlon's top model (Athlon XP 2800+) has been getting less attention for its performance and more for its unavailability. Additionally, Athlon XP can only keep apace with the fastest Pentium 4 CPUs if it is combined with a high-caliber platform. We are referring to thenForce2 chipset by nVIDIA , which can work with dual-channel DDR333 or DDR400.
Load Balancing With HyperThreading
HyperThreading could be considered both a blessing and a curse. It's a curse for AMD in some benchmarks, which lose in performance. But it's a blessing for anyone who can afford a 3 GHz P4 and can therefore see the hidden advantages of multiprocessing, i.e. more parallelism through the use of the existing potential that has been misused until now. In the high-end range, however, HyperThreading offers real added value. And whoever doesn't want it can simply turn this function off in the computer's BIOS.
AGP 2/4X Graphics Cards Supported - 1.5V Only
The AGP connector installed on many Pentium® 4 processor-based Desktop Boards is keyed for 1.5V AGP cards only.
The AGP connector is not mechanically compatible with 3.3V AGP cards. To some the 1.5V AGP connector may appear backwards due to the connector key being exactly the opposite of the 3.3V AGP connector.
Intel's smallest chip. Built with the world's smallest transistors¹
"This is our smallest processor built with the world's smallest transistors. The Intel® Atom™ processor is based on an entirely new design, built for low power and designed specifically for a new wave of Mobile Internet Devices and simple, low-cost PC's. This small wonder is a fundamental new shift in design, small yet powerful enough to enable a big Internet experience on these new devices. We believe it will unleash new innovation across the industry."
– Created by Usama Shakeel
As Intel's smallest and lowest power processor², the Intel® Atom™ processor enables the latest Mobile Internet Devices (MIDs), and another new category of devices for the internet called netbooks and nettops.
Intel's digital heath division created by intel
Intel's digital heath division created this prototype portable doctor's assistant. The round knob in the top right hand corner is a wireless detachable Bluetooth stethoscope, while the three modules at the base customise the unit for different tasks
Intel Selections
A selection of concept handtop PCs, which combine full PC functionality with a PDA-like form factor. Now you too can run XP in a box small enough to lose down the back of the sofa
A DVB-H adaptor
A DVB-H adaptor for digital television on the move. UK company Crown Castle is one of the big names behind this new standard, which promises 16 channels delivered wirelessly to PDA or laptop
An amazing outlook
The 82559 Fast Ethernet controller with an integrated 10/100 Mbps physical layer device is Intel’s leading solution for PCI board LAN designs. It is designed for use in Network Interface Cards (NICs), PC LAN On Motherboard (LOM) designs, embedded systems and networking system products. The 82559 combines a low power and small package design which is ideal for power and space constrained environments. The 82559 continues Intel's platform LAN technology leadership supporting: Advanced Configuration and Power Interface (ACPI) 1.20A based power management, wake on Magic Packet*, wake on interesting packet, advanced System Management Bus (SMB) based manageability, Wired for Management (WfM) 2.0 compliance, IP checksum assist, PCI 2.2 compliance, and PC 98, PC 99, Server 99, and PC 2000 compliance. Designs based on the 82559 for desktop systems, laptops, PC cards and servers will set a new industry standard for energy conservation. |
Optimized Integration for Low Cost Solution |
Integrated IEEE 802.3 10BASE-T and 100BASE-TX compatible PHY | |
Glueless 32-bit PCI master interface | |
Modem interface for LAN/modem combination solutions | |
128 Kbyte Flash interface | |
Integrated power management functions | |
Thin BGA 15x15mm package | |
Wired for Management (WfM) Support |
WfM 2.0 compliance | |
System Management Bus (SMB) support for advanced management support | |
Power management capabilities | |
Advanced Configuration and Power Interface (ACPI) 1.20A, and PCI Power Management specifications compliance | |
Magic Packet* support | |
Wake on interesting packets and link status change support | |
Remote power-up support | |
High-performance Networking Functions |
Highly efficient chained memory structure similar to the 82558, 82557, and 82596 enabling backwards compatible software | |
Dynamic transmit chaining with multiple priorities transmit queues | |
Full duplex support at both 10 and 100 Mbps operation | |
IEEE 802.3u Auto-Negotiation support | |
3 Kbyte Transmit FIFO and 3 Kbyte Receive FIFO | |
Back-to-back transmission support with minimum interframe spacing | |
IEEE 802.3x 100BASE-TX Flow Control support | |
Enhanced Adaptive Technology capabilities | |
TCP/UDP checksum off-load capabilities | |
Low Power Features |
Low power 3.3 V device | |
Glueless 32-bit PCI bus master interface | |
Efficient dynamic standby mode | |
Deep power-down support | |
Clockrun protocol support |
Seemingly official-looking pictures have emerged of Asus's upcoming all-in-one PC and display, the Eee Monitor.
The images, which are believed to have been leaked from within Asus, are available on the Pakistani site "www.onlyusama.blogspot.com" It is hard to tell at this point what components are inside the machine, but it is likely that they will not be dissimilar to those inside the Eee Box, which is set to launch in August.
The photographs appear to show at least six USB ports on the machine, alongside standard ports like Ethernet, line-in and line-out. A webcam is integrated, as are an SDHC slot and LEDs to show Wi-Fi and disk activity. Interestingly, the PC appears to have integrated Denon speakers.
The Core i7 die and major components. Source: Intel.
What you're seeing, incidentally, is a pretty good-sized chip—an estimated 731 million transistors arranged into a 263 mm² area via the same 45nm, high-k fabrication process used to produce "Penryn" Core 2 chips. Penryn has roughly 410 million transistors and a die area of 107 mm², but of course, it takes two Penryn dies to make one quad-core product. Meanwhile, AMD's native quad-core Phenom chips have 463 million transistors but occupy a larger die area of 283 mm² because they're made on a 65nm process and have a higher ratio of (less dense) logic to (denser) cache transistors. Then again, size is to some degree relative; the GeForce GTX 280 GPU is over twive a size of a Core i7 or Phenom.
Nehalem's four cores are readily apparent across the center of the chip in the image above, as are the other components (Intel calls these, collectively, the "uncore") around the periphery. The uncore occupies a substantial portion of the die area, most of which goes to the large, shared L3 cache.
This L3 cache is the last level of a fundamentally reworked cache hierarchy. Although not clearly marked in the image above, inside of each core is a 32 kB L1 instruction cache, a 32 kB L1 data cache (it's 8-way set associative), and a dedicated 256 kB L2 cache (also 8-way set associative). Outside of the cores is the L3, which is much larger at 8 MB and smarter (16-way associative) than the L2s. This basic arrangement may be familiar from AMD's native quad-core Phenom processors, and as with the Phenom, the Core i7's L3 cache serves as the primary means of passing data between its four cores. The Core i7's cache setup differs from the Phenom's in key respects, though, including the fact that it's inclusive—that is, it replicates the contents of the higher level caches—and runs at higher clock frequencies. As a result of these and other design differences, including a revamped TLB hierarchy, the Core i7's cache latencies are much lower than the Phenom's, even though its L3 cache is four times the size.
One mechanism Intel uses to make its caches more effective is prefetching, in which the hardware examines memory access patterns and attempts to fill the caches speculatively with data that's likely to be requested soon. Intel claims the Core i7's prefetching algorithm is both more efficient than Penryn's—some server admins wound up disabling hardware prefetch in Xeons because it harmed performance with certain workloads, a measure Intel says should no longer be needed—and more aggressive, as well.
The Core i7 can get to main memory very quickly, too, thanks to its integrated memory controller, which eliminates the chip-to-chip "hop" required when going over a front-side bus to an external north bridge. Again, this is a familiar page from AMD's template, but Intel has raised the stakes by incorporating support for three channels of DDR3 memory. Officially, the maximum memory speed supported by the first Core i7 processors is 1066 MHz, which is a little conservative for DDR3, but frequencies of 1333, 1600, and 2000 MHz are possible with the most expensive Core i7, the 965 Extreme Edition. In fact, we tested it with 1600 MHz memory, since this is a more likely configuration for a thousand-dollar processor.
For a CPU, the bandwidth numbers involved here are considerable. Three channels of memory at 1066 MHz can achieve an aggregate of 25.6 GB/s of bandwidth. At 1333 MHz, you're looking at 32 GB/s. At 1600 MHz, the peak would be 38.4 GB/s, and at 2000 MHz, 48 GB/s. By contrast, the peak effective memory bandwidth on a Core 2 system would be 12.8 GB/s, limited by the throughput of a 1600MHz front-side bus. With dual channels of DDR2 memory at 1066MHz, the Phenom's peak would be 17.1 GB/s. The Core i7 is simply in another league. In fact, our Core i7-965 Extreme test rig with 1600MHz memory has the same total bus width (192 bits) and theoretical memory bandwidth as a GeForce 9600 GSO graphics card
With the memory controller onboard and the front-side bus gone, the Core i7 communicates with the rest of the system via the QuickPath interconnect, or QPI. QuickPath is Intel's answer to HyperTransport, a high-speed, narrow, packet-based, point-to-point interconnect between the processor and the I/O chip (or other CPUs in multi-socket systems.) The QPI link on the Core i7-965 Extreme operates at 6.4 GT/s. At 16 bits per transfer, that adds up to 12.8 GB/s, and since QPI links involve dedicated bidirectional pairs, the total bandwidth is 25.6 GB/s. Lower-end Core i7 processors have 4.8 GT/s QPI links with up to 19.2 GB/s of bandwidth. Obviously, these are both just starting points, and Intel will likely ramp up QPI speeds from here in successive product generations. Still, both are somewhat faster than the HyperTransport 3 interconnects in today's Phenoms, which peak at either 16 or 14.4 GB/s, depending on the chip.
A block diagram of the Core i7 system architecture. Source: Intel.
This first, high-end desktop implementation of Nehalem is code-named Bloomfield, and it's essentially the same silicon that should go into two-socket servers eventually. As a result, Bloomfield chips come with two QPI links onboard, as the die shot above indicates. However, the second QPI link is unused. In 2P servers based on this architecture, that second interconnect will link the two sockets, and over it, the CPUs will share cache coherency messages (using a new protocol) and data (since the memory subsystem will be NUMA)—again, very similar to the Opteron.
In order to take advantage of this radically modified system architecture, the design team tweaked Nehalem's processor cores in a range of ways big and small. Although the Core 2's basic four-issue-wide design and execution resources remain more or less unchanged, almost everything around the execution units has been altered to keep them more fully occupied. The instruction decoder can fuse more types of x86 instructions together and, unlike Core 2, it can do so when running in 64-bit mode. The branch predictor's accuracy has been enhanced, too. Many of the changes involve the memory subsystem—not just the caches and memory controller, which we've already discussed, but inside the core itself. The load and store buffers have been increased in size, for instance.
These modifications make sense in light of the Core i7's much higher system-level throughput, but they also help make another new mechanism in the chip work better: the resurrected Hyper-Threading, or simultaneous multithreading (SMT). Each core in Nehalem can track two independent hardware threads, much like some other Intel processors, including later versions of the Pentium 4 and, more recently, the Atom. SMT takes advantage of the explicit parallelism built into multithreaded software to keep the CPU's execution units more fully occupied, and done well, it can be a clear win, delivering solid performance gains at very little cost in terms of additional die area or power use. Intel architect Rohank Singhal outlined how Nehalem's implementation of Hyper-Threading works at this past Fall IDF. Some hardware, such as the registers, must be duplicated for each thread, but much of it can be shared. Nehalem's load, store, and reorder buffers are statically partitioned between the two threads, for example, while the reservation station and caches are shared dynamically based on demand. The execution units themselves don't need to be altered at all.
Intel Core i7 Processors: Nehalem and X58 Have Arrived
There are only a select few events in the PC hardware world that get hardcore enthusiasts truly excited. For example, when popular trade shows like Computex, IDF, and CES take place, there is a fair amount of buzz. Also, anytime the major players in graphics release next-generation GPUs, things definitely heat up; or likewise when a hot new game hits. Finally, when either of the processor big guns, Intel or AMD unleash new CPU micro-architectures on the world, you can almost bet on the community to come alive with enthusiasm. We're sure we've missed a few other momentous occasions as well, but you get the gist. It takes something new and exciting to get a PC Enthusiast's pulse racing.
Thankfully, today is one of those times. Although Intel won't be officially launching their Core i7 processors, formerly codenamed Nehalem, and the X58 Express chipset until sometime later in the month, we've had them in house for a while now and can finally show you all the goods. We've tested every Core i7 speed grade that will be available at launch, along with at trio of X58 Express based motherboards. We've even thrown in some high-resolution multi-GPU SLI and CrossFireX testing for good measure as well.
There's a lot to cover, so we'll keep the introduction short and dive right in. Below are some Core i7 features and specifications to whet your appetite--the main course is available on the pages ahead. Also, our video spotlight of all this new Intel technology