The new Hexagon 690 now plays an even bigger role

The new Hexagon 690 is both an iteration on past generation units, as well as bringing a totally new aspect to the IP block. The new DSP block doubles up on the number of HVX vector pipelines as compared to the Hexagon 680 and 685 (as a reminder, the 680/685 contained four scalar units along with two 1024b vector processing units). On the scalar pipeline performance, the new unit sees a 20% increase in performance, which might be due to microarchitectural changes or simply a higher clock speed. On the vector side, he new Hexagon now contains four 1024b vector pipelines, doubling the processing power for those units.

What is entirely new for the new Hexagon IP is the introduction of a new dedicated “tensor accelerator” unit that is seemingly tightly coupled within the larger block. This would most likely be a generally more simplistic MAC array resembling that of other neural network inferencing engines from other vendors. Effectively, this thus represents Qualcomm’s first generation dedicated ML inferencing engine, although Qualcomm does market it as a 4th generation “AI engine” (Previously this nomenclature referred to ML inferencing on CPU, GPU and DSP).

One of the interesting limitations that Qualcomm has wrangled with in previous architectures was that DSP couldn’t process traditional DSP workloads alongside ML inferencing work. However for the S855, Qualcomm has confirmed that the new Tensor units are able to work concurrently and independently alongside the vector units. This means that traditional image processing tasks can now be done in parallel to inferencing tasks, vastly increasing the real-world usability of inferencing on the new Snapdragon platform.

Alongside the vector and tensor units is a new Voice Assistant AI module, designed to accelerate processing for common AI assistants – Qualcomm specifically states Baidu and Google in this case, although it’s unlikely to be exclusive to these two.

New Spectra 380 ISP brings a new Computer Vision accelerator

Alongside the AI Engine, Qualcomm is pairing it with an upgraded Spectra 380 image signalling processor. It also gets a boost to its processing power with a tri-core Computer Vision (CV) ISP. This might seem a little odd – Computer Vision is a family of algorithms that we used to have before the latest advent of machine learning acceleration, so why is it here?

Computer Vision IP is still very useful for various tasks, and in comparison to ML, it's computationally a lot lighter. As such, Qualcomm has deepened its ISP acceleration. With the CV-ISP, Qualcomm believes it can save 4x power (75%?) for image object classification, object segmentation, depth sensing (at 60 FPS), augmented reality body tracking, and image stabilisation.

It’s to be noted that Qualcomm’s claim about being the first computer vision ISP is a bit confusing – it’s definitely not the first SoC with a computer vision IP block, as at least Samsung had this two generations ago. Maybe Qualcomm is referencing some better integration within the ISP pipeline?

With the new accelerator, Qualcomm states that rather than using the AI Engine for everything, the CV-ISP is preferred for these tasks due to the power savings. The company expects many of the features we normally take for granted in photography will now be available for video as well, such as 4K HDR at 60 FPS with both portrait mode enhancements and depth sensing for live bokeh. Power consumption for 4K HDR capture will be 30% lower than the Snapdragon 845.

Also in the cards is support for new generations of image formats. Currently JPEG rules the roost in smartphone photography, however the new S855 will support ‘high efficiency image files’, or HEIF, which will offer a large number of new features over JPEG:

  • Regular Images Up to 50% Smaller
  • HDR Support for multiple color spaces
  • Computer Vision data support
  • RAW data
  • Depth Map support
  • Alpha Masking data
  • Burst Photography data
  • Animated Photography data
  • HEVC Video data

The idea here is to enable new use cases for the photos that users take every day. Within the same HEIF, it can store a photo from every camera: a telephoto, a wide angle photo, and a super-wide angle photo for example. This allows of post processing such as bokeh adjustment, object segmentation, quality adjustments, and to a certain extent even basic 3D modelling.

Snapdragon 855 Connectivity

With day one being about Qualcomm’s approach to 5G, connectivity is going to be at the heart of the S855. Built into the chip is the Snapdragon X24 modem, which offers LTE Category 20 class download speeds good for up to 2 Gbps on supported devices and networks. The increased speeds are achieved by using up to 7x carrier aggregation with 4x4 MIMO and 256QAM. On the upload side, the new SoC modem now also for the first time supports 256QAM, able to double the peak upload rate to up to 300Mbps with 3xCA.

Meanwhile when it comes to 5G, for the S855 platform it’s not the actual SoC which delivers this, but rather the external Snapdragon X50 modem. When combined with the S855, the platform will support both mmWave (>24 GHz) and sub-6 GHz 5G NR technologies. Qualcomm made mention of its entrance in the RFFE (RF front-end) field, which the company now includes in its QTM052 RFFE/antenna mini modules.

Not to be left behind, Qualcomm has also improved their local wireless networking capabilities as well. Snapdragon 855 support of the new WiFi 6 standard (802.11ax) as well as the new 60GHz “Wi-Gig” 802.11ay standard, enabled by an external WiFi combo chipset.

First impressions

Overall the new Snapdragon 855 promises some great improvements in pretty much every single aspect of the SoC. The biggest generational improvements seem to be on the CPU side, where Qualcomm promises a 45% jump in performance. The figure is pretty much in line with what we’ve seen on the Kirin 980, and that SoC performed excellently. And I’m certain Qualcomm will be able to squeeze out even more performance through software.

One big uncertaintly that I have is in regards to the CPU configuration. Here at first I thought the 1+3 configuration seemed reasonable, however the disclosure that the different cores are running on the same power/voltage plane has given me some doubts about how this will pan out in terms of power efficiency. Power efficiency of the CPU and the SoC in general use-cases seems to be something that Qualcomm hasn’t talked about at all today and this worries me quite a bit.

On the GPU side, as expected the new Adreno 640 is just an iterative design on the Adreno 630 – increasing the number of ALUs versus its predecessor by 50%. The performance gains were quoted at 20%, which are a bit on the conservative side but not something to be particularly worried about. Power efficiency is claimed to still be  leading edge, and at least based on the sustained performance metrics showed it looks like the Snapdragon 855 will continue to lead in this regard.

The Snapdragon 855 is Qualcomm’s first chipset to include a dedicated neural network inference engine / “NPU”. Here Qualcomm claims to have a lead over its nearest competitor, the Kirin 980, however it’s something that will need a thorough investigation in the future, as we’ve seen that both vendors' software stacks aren’t quite mature enough to be able to fully extract the capabilities of the hardware. Still, this is an important step for Qualcomm as it now allows concurrent usage of the DSP for image processing while having a dedicated hardware block for inferencing tasks.

The ISP and the new computer vision hardware seem fantastic, the new features are really raising the bar in what will be possible with computational photography as well as introduce new aspects such as computational videography. Here we’ll have to see what OEM vendors will be able to actually implement on final commercial devices.

Last but not least, the Snapdragon 855 is the first platform that promises to provide a complete solution of next generation connectivity technologies. Here the features aren’t directly featured into the SoC but rather in accompanying chipsets, so I do expect that there’s going to be many devices that will not chose the full set of next-generation solutions, but also we might see some devices which will implement the new X50 5G modem alongside the new WiFi6/.11ax and 11ay WiFi solutions.

Today’s announcement is just the first wave of information on the Snapdragon 855. To really judge the new chipset, we’re eagerly waiting on Qualcomm to allow us to do a performance preview on the reference designs. Most importantly, we’re still a few months away from the first commercial devices – which will represent the definitive evaluation of the new chipset.

New Cortex A76 Derived CPU & Adreno 640 GPU
Comments Locked

47 Comments

View All Comments

  • Samus - Thursday, December 6, 2018 - link

    This seems really hard to optimize for with the caches all over the place like that
  • peevee - Tuesday, December 11, 2018 - link

    True. Even UBLAS will have a fit. But then if you do compute on mobile CPUs these days you do it wrong to begin with.
  • eastcoast_pete - Thursday, December 6, 2018 - link

    @Andrei: thanks for this preview; also @Ian: thanks to you and Andrei for coverage of the QC event.
    @Andrei @Ian: any comments, mention by anybody from QC on why the three lower perf big cores are on the same power domain as the high frequency core, despite the potential to save battery by not doing so? I am probably not the only one whose initial thought was "not again", and remembering the 808 and 810 debacle; I would hope that QC had some really good reasons to leave the 1+3 on the same power domain, and it's not just a case of "we'll fix that next time".
    @Andrei: Question: How aware does a camera/video app have to be to take advantage of the new kit in the 855, especially the 4K HDR video? I am asking due to some (negative) experiences with recent phones by big name phone makers (here: LG) that simply didn't use many of the features in the QC flagship SoC that the phones had.

    Regarding the GPU figures: I hope (think) it might be that QC tries the under promise, then over deliver marketing strategy. They were already king of the Android hill with the 845, so they can afford to lowball their numbers.
  • peevee - Tuesday, December 11, 2018 - link

    "Regarding the GPU figures: I hope (think) it might be that QC tries the under promise, then over deliver marketing strategy. They were already king of the Android hill with the 845, so they can afford to lowball their numbers."

    Or 7nm gave them room to optimize, because in termally and/or poer-constrained environments more ALUs at optimal frequency/voltage is always better than fewer ALUs at higher-than-optimal frequency and voltage (because performance scales AT BEST linearly with frequency while power increases much faster, especially if voltage increase is necessary to maintain the higher frequency) in those "embarassingly parallel" workloads.

    Eventially 100MHz at 0.01V is going to be the best. :)
  • ballsystemlord - Friday, December 7, 2018 - link

    What platform(s) will the AI support? Android? Vanilla Linux? Apple? And which AI tooling will be supported? How about the CV?
    Thanks!
  • peevee - Tuesday, December 11, 2018 - link

    "because the cores aren’t running on separate voltage planes it means the actual benefits here in real-world applications are just going to be quite minor. The net result is that the setup is leaving a lot of power efficiency on the table: the voltage supplied to both core groups is always going to be the greater of whatever is being asked for, even if one of the two groups could operate on (much) less voltage."

    But the whole package is power limited. When cores 2,3,4 are idle, Prime core can operate at its highest frequency and cores 2,3,4 don't use much power despite being on the same voltage plane.
    When cores 2,3,4 are busy, I am pretty sure Core 1 will be downclocked and voltage reduced accordingly, so cores 2,3,4 do not operate on voltage higher than necessary. I will be surprised if at 8-core MT+GPU loads the 4 big cores can even sustain 2.42GHz in real power-and-heat-limited phones.
  • Ritesh Benjwal - Wednesday, February 6, 2019 - link

    Such a in-depth article. Thanx for sharing.....
    This is also similar to Snapdragon 855.
    You can check... https://techforyouths.com/snapdragon-855/

Log in

Don't have an account? Sign up now