According to some BOINC stats there are some Fusion APU samples in the labs, running complex calculations for over a month.
There we have:
Ontario 1 ("AMD64 Family 20 Model 0 Stepping 0")
Ontario 2 ("AMD64 Family 20 Model 0 Stepping 0")
Llano 1 ("AMD64 Family 18 Model 0 Stepping 0")
Llano 2 ("AMD64 Family 18 Model 0 Stepping 0")
The operating systems being used are Win 2k8 x64 and Linux. A model number 0 and stepping number 0 for all of them indicate early samples (likely early A0 silicon). These systems could actually be just two systems, running different OS' at different times. The number of cores (2 for Ontario, 3! for Llano) or the RAM size are the same. One strange number is the cache size of the Ontarios. In one case it is listed as 512 kB which is ok. But in the other case it is 488 kB, 24 kB less than the maximum amount. One explaination could be a power management feature, which dynamically resizes the L2 cache depending on cache usage and power budget. Edit: This seems to be a problem of the BOINC software.
The ratio of integer performance to floating point performance, which I used in the past to detect a turbo mode in Thuban, is 3.1 for Llano 1 and 3.9 (+26%) for Llano 2 (on Linux, so probably with customized processor driver). In case of Thuban the difference was 3.7 to 3.2 (+16%). This could indicate an improved turbo mode, but this is what is expected for Llano. But there obviously is a flaw in the measurement method, since we don't know when a frequency boost happens.
Ontario has a ratio of 2.3 which speaks for the reduced number of integer units in its Bobcat cores. Another factor is the throughput of the floating point unit. If I'm correct, BOINC uses single precision arithmetic in some of its projects. If it uses single precision for the floating point benchmark, the throughput could be four times that of double precision calculations according to a paper, which I see as being related to the Bobcat FPU. I've read it in full, but the linked abstract tells enough:
"The FPM can perform two parallel single-precision multiplies every cycle with a latency of two cycles, one double-precision multiply every two cycles with a latency of four cycles, or one extended-double-precision multiply every three cycles with a latency of five cycles."
According to my table of BOINC results, the cores of the Llano sample(s) are comparable to 1.5 to 1.9 GHz Phenom II cores. I think it's more likely, that the actual clock frequency range is 1.4 to 1.8 GHz due to the changes to the core and the bigger L2. The Ontario core's integer performance is comparable to an 1.3 GHz Phenom II core, while the (single precision) floating point performance even matches that of an 1.6 GHz Phenom II core. The BOINC benchmark only measures single core performance.