Benutzername:   Noch nicht registriert?
Passwort:   Passwort vergessen?
iB Code Einmal klicken um den Tag zu öffnen, nochmal klicken zum Schliessen

Top Smilies
Beitrag

HTML ist on für dieses Forum

IkonCode ist on für dieses Forum

SMILIES LEGENDE ansehen

Beitragsoptionen Möchten Sie Ihre Signatur hinzufügen?
Wollen Sie per Email über Antworten informiert werden?
Wollen Sie Emoticons in Ihrem Beitrag aktivieren?
 

Beitragsrückblick für SiSoftware Sandra 2021 (die neuesten Beiträge zuerst)
ocinside Erstellt: 11:20 am 1. Nov. 2023
SiSoftware Sandra R26 31.137 wurde gerade veröffentlicht und kann nun von unserem Server heruntergeladen werden :thumb:


- Hardware Support
Resolved detection issues with Intels 14th Gen Core RaptorLake-Refresh e.g. 14700KF
Resolved reporting issues with AMDs Gen4 Ryzen e.g. 7950X-3D

- Software Support
Resolved detection issues with tiered Storage Pools e.g. Windows Storage Spaces
Updated Official Ranker API for greater reliability

- Windows Defender Core Isolation features and Sandras Kernel Driver
Sandra is compatible with most Core Isolation” security features and you can have those enabled:
Memory Integrity – Compatible, can enable
Memory Access Protection – Compatible, can enable
Microsoft Vulnerable Driver Blocklist – Kernel driver compatible
Kernel-mode Hardware-enforced Stack Protection – Kernel driver not compatible, but software will still run with reduced functionality

- Kernel-mode Hardware-enforced Stack Protection and Sandras Kernel Driver

When running Sandra on recent Windows 11 versions and hardware you may receive a pop-up stating that Sandra.sys aka the kernel driver cannot be loaded on this device due to a security feature called Hardware-enforced Stack Protection. Microsoft has started to ask Windows users to enable this new security features that may not be compatible with Sandra’s kernel driver Sandra.sys. As this driver must run in kernel space and directly interacts with hardware – as with most other hardware utilities of this type, e.g. game anti-cheats, anti-virus/malware and similar kernel-level software – it cannot be compatible with all features. While we are working on finding a solution, unfortunately there is no easy or immediate fix for it. While most of the features will still work without the kernel device driver – its hardware detection functionality is greatly reduced, e.g. CPU/Chipset/Memory data will not be available e.g. CPU multipliers, memory controller timings, memory modules detection, etc.. You can still run the benchmarks. For full Sandra functionality, you will need to disable this feature and perhaps re-enable it later if all your other software is compatible with it. While we are working on a solution, unfortunately there is no immediate/easy work-around or fix.

ocinside Erstellt: 10:52 am 6. Aug. 2023
SiSoftware Sandra R25 31.133 wurde gerade veröffentlicht und kann nun von unserem Server heruntergeladen werden :thumb:


- CPU Multi-Media Benchmarks
AVX-iFMA(52): New 256-bit code path based on AVX512-iFMA(52) 512-bit for future arch (e.g. Meteor Lake MTL, Arrow Lake ARL). We saw +66% improvement as detailed in our AVX512-iFMA(52) Improvement for IceLake & TigerLake article.
AVX512-FP16: New code path for Xeon processors that support AVX512-FP16. We expect +90% improvement over FP32 if precision loss in acceptable (e.g. zoomed out fractals).
Note: Future FP16 code-paths will also be added to the other CPU benchmarks, however some parts may remain FP32 as the loss of precision would make the results useless. We have explored this in our GP-GPU article dealing with FP16 support: FP16 GP-GPU Image Processing Performance & Quality.

- CPU Cryptogaphy Benchmarks
SHA2-512 HWA: Hardware-accelerated hashing SHA512 code-path – based on current SHA2-256 HWA. We expect ~3x (three times) better performance based on the SHA2 non-accelerated/HWA results.
Future SM3-256 (China) HWA: Hardware-accelerated hashing SM3 code-path (China's version of SHA hashing functions). We expect similar performance improvement to SHA HWA.
Future SM4-128/256 (China) HWA: Hardware-accelerated block crypto SM4 code-path (China's version of AES block crypto functions). We expect similar performance improvement to AES HWA.
Note: We will default to SM4 + SM3 benchmarks – rather than AES + SHA for both CPU & GP-GPU Cryptography benchmarks when China locale is detected as these algorithms are more likely to be used there.

- All CPU Benchmarks – AVX10 Support
AVX10.2+ 256-bit future code paths (FP32, FP64 and FP16) for Hybrid architectures (e.g. Meteor Lake MTL, Arrow Lake ARL). Note both Core (P) and Atom (E) will run the same 256-bit width binary and not different widths
AVX10.1+ 512-bit & AVX512 shared code path (FP32, FP64 and FP16) for Xeon architectures
Possible AVX10.2+ 128-bit future code path for Atom/Other discrete architectures if required

- Hardware Support
Intel 14th gen Hybrid Raptor Lake Refresh RPL-S support
Intel future gen Hybrid Meteor Lake (MTL-S/M/P/N), Arrow Lake (ARL-S/U), Lunar Lake (LNL-M) detection
Intel future gen Xeon Granite Rapids SP/D detection

ocinside Erstellt: 13:59 am 1. März 2023
SiSoftware Sandra 31.117 mit AMD X3D Unterstützung wurde online gestellt und kann nun von unserem Server heruntergeladen werden :thumb:


- CPU / Platform Detection
Additional Intel Core 13 (RaptorLake) H/P/U-Series support
Preliminary AMD Ryzen 3D V-Cache/L3 (7950X-3D, 7900X-3D, 7800X-3D etc) support
- GP-GPU Benchmarks
Updated CUDA SDK

ocinside Erstellt: 7:05 am 16. Dez. 2022
SiSoftware Sandra 31.115 wurde online gestellt und kann nun von unserem Server heruntergeladen werden :thumb:


- Benchmarks, Hardware Support updates and fixes
GP-GPU – CUDA
Updated to CUDA 11.8
Support for 8.9/9.0 devices
nVidia 4090 series FP16 validation failure (too low precision for image size)
GP-GPU Memory Bandwidth
Corrected DirectX Compute (11/12) bandwidths with internal graphics
Memory Detection
Intel XMP 3.0, AMD EXPO DDR5 detection missing in some instances
Jedec, Intel XMP 2.0 DDR4 write timings if provided by newer standard revision
Corrected timings (tRP, tRAS) for some Intel ADL/RPL systems
Missing memory timings for some mobile Intel ADP-P/U systems
Added support for more chipsets (both AMD and Intel)

ocinside Erstellt: 9:45 am 19. Okt. 2022
SiSoftware Sandra 2021 31.109 wurde online gestellt und kann nun von unserem Server heruntergeladen werden :thumb:


- Benchmarks, Hardware Support updates and fixes
Cluster Bandwidth Contribution: All Cache/Memory Benchmarks
It is the sum of the bandwidth of all cores in each cluster (%)
Reveals how much the big/P & LITTLE/E core clusters have contributed compute power wise to each benchmark score in percent: e.g. on Intel 12600K - Memory Bandwidth Benchmark:
60% big/P Cluster - 40% LITTLE/E Atom cluster
Core Bandwidth Ratio: All Cache/Memory Benchmarks
It is the ratio of the big/P Core vs. LITTLE/E core bandwidth (ratio)
Reveals just how much more performant the big/P Cores are vs. the LITTLE/E cores.
2.15x big/P Core - 1.x LITTLE/E Atom core

- Additional Hardware Support
Additional AMD Ryzen 7000 (Zen4) support
Enabled DDR5 SPD (Hub, PMIC, TS) AMD information
Additional Intel Sapphire Rapids (SRP-X/EX) support
Enabled DDR5 SPD (Hub, PMIC, TS) Intel information

- Example of performance contribution/ratio on Intel Core i5 12600K (ADL):
Cluster Bandwidth Contribution (per Cluster type)
Integer Memory Bandwidth : 64% big Cores Cluster – 32% LITTLE cores Cluster
Float Memory Bandwidth : 65% big Cores Cluster – 32% LITTLE cores Cluster
L1D (1st Level) Data Cache Bandwidth : 78% big Cores Cluster – 18% LITTLE cores Cluster
L2 (2nd Level) Unified/Data Cache Bandwidth : 89% big Cores Cluster – 7% LITTLE cores Cluster
L3 (3rd Level) Unified/Data Cache Bandwidth : 90% big Cores Cluster – 7% LITTLE cores Cluster
Note: Here we sum the memory bandwidth performance contribution for all cores in the type of cluster.
Note2: This feature also works on Arm64 big.LITTLE / DynamicQ SoCs, it is not tied to x86 Intel hybrid systems.
Cluster Bandwidth Ratio (per core type)
Integer Memory Bandwidth : 1.3x big-Core – 1.0x LITTLE-core
Float Memory Bandwidth : 1.4x big-Core – 1.0x LITTLE-core
L1D (1st Level) Data Cache Bandwidth : 2.9x big-Core – 1.0x LITTLE-core
L2 (2nd Level) Unified/Data Cache Bandwidth : 8.5x big-Core – 1.0x LITTLE-core
L3 (3rd Level) Unified/Data Cache Bandwidth : 8.6x big-Core – 1.0x LITTLE-core
Note: Here we divide the cluster performance contribution by the number of cores in each cluster – and then work out the big/LITTLE ratio. If SMT is enabled or not, we still only count the number of cores per cluster, not threads.

Weniger Antworten Mehr Antworten
ocinside Erstellt: 7:30 am 7. Sep. 2022
SiSoftware Sandra 2021 31.104 wurde online gestellt und kann nun von unserem Server heruntergeladen werden :thumb:


- Benchmarks, Hardware Support updates and fixes
Parallelism: All CPU/Memory Benchmarks
Additional Multi-Threaded big/P Cores Only aka only using threads running on big/P cores (SMT)
Note that as (current) LITTLE/E Atom cores do not support SMT, there is no corresponding option for them yet

Cache & Memory Bandwidth Benchmark
Improved/fixed hybrid bandwidth workload allocation when all cores types are used; e.g. on Intel 12600K:
Memory Bandwidth Improvement: 49GB/s to 56GB/s: +14% improvement (float AVX2/buffered)
L1D Cache Bandwidth – not affected/improvement
L2 Caches Bandwidth fix: 145GB/s to 836GB/s
L3 Cache Bandwidth fix: 56GB/s to 452GB/s

Cluster Performance Contribution: All CPU/Memory Benchmarks
It is the sum of the performance of all cores in each cluster (%)
Reveals how much the big/P & LITTLE/E core clusters have contributed compute power wise to each benchmark score in percent: e.g. on Intel 12600K - CPU Arithmetic Benchmark:
80% big/P Cluster - 20% LITTLE/E Atom cluster

Core Performance Ratio: All CPU/Memory Benchmarks
It is the ratio of the big/P Core vs. LITTLE/E core performance (ratio)
Reveals just how much more performant the big/P Cores are vs. the LITTLE/E cores.
2.15x big/P Core - 1.x LITTLE/E Atom core

- Future Hardware Support
Additional AMD Ryzen 7000 (Zen4) support
Enabled DDR5 SPD (Hub, PMIC, TS) AMD information
Additional Intel Sapphire Rapids (SRP-X/EX) support
Enabled DDR5 SPD (Hub, PMIC, TS) Intel information


ocinside Erstellt: 9:13 am 24. Juli 2022
SiSoftware Sandra 2021 31.98 R15 wurde online gestellt und kann nun von unserem Server heruntergeladen werden :thumb:


CUDA GP-GPU Benchmarks
- Updated to CUDA SDK 11.8 with nVidia Ada support GeForce RTX 4090, et.]

Memory Bandwidth Benchmark
- 3-5% bandwidth increase due to L1D block/prefetch optimisations e.g. AVX512 AMD Zen4, Intel ICL-SP, future arch.

Memory Latency Benchmark
- "In-Page Random" memory access latency pattern – additional TLB ranges randomisation in addition to the randomisation within each TLB range. Credit Rob Williams @ TechGage – many thanks!
Benchmark now fails (does not run at all) if TLB information cannot be detected, e.g. CPU does not report it.
This change affects both Data and Code latencies.

- Note that the other tests "Full Random" and "Sequential" memory access patterns – are *not* affected – as the pattern is not affected by TLB data.

- It is always recommended to use "2MB/large" pages rather than "4kB/normal" pages in order to minimise "TLB miss" penalties which is the reason for the “in-page random” test. Please see How to enable large/huge memory pages in Windows.

Cryptography Benchmark
- 3-5% bandwidth increase due to L1D block/prefetch optimisations e.g. AVX512-VAES AMD Zen4, Intel ICL-SP, future arch.

Client (GUI)
- Light/Dark-mode colour optimisations

ocinside Erstellt: 6:44 am 5. Juli 2022
SiSoftware Sandra 2021 31.97 R14c wurde online gestellt und kann nun von unserem Server heruntergeladen werden :thumb:


Memory Latency Benchmark
- "In-Page Random" memory access latency pattern – TLB range fix – that resulted in too-low memory score (latency) to be reported on modern Intel systems (e.g. AlderLake with large L3 cache).
- Better random number generator (2^32 vs. 2^15 states) in order to defeat any possible access pattern detection by the CPU.
- Note that the other tests "Full Random" and "Sequential" memory access patterns – arenot- affected - as the pattern is not affected by TLB data.
- It is always recommended to use "2MB/large" pages rather than "4kB/normal" pages in order to minimise TLB miss penalties which is the reason for the "in-page random" test.
- Reverted to testing latencies of all cores thus "Multi-Core" rather than just 1 thread/core "Single-Core" so that on hybrid systems (AlderLake, RaptorLake, etc.) the average/overall latency does not favour just to Big/P cores. This does increase run-time of test proportional with number of cores.

Memory Bandwidth Benchmark
- Increased buffer sizes for modern processors to match L1D cache size

Cryptography Benchmark
- fixed HWA code paths (AES, SHA) not engaging R13x regression

Hardware
- Resolved L2, L3, L4 caches counts detection R13x regression


ocinside Erstellt: 14:06 am 14. Juni 2022
SiSoftware Sandra 2021 31.93 R14 wurde online gestellt und kann nun von unserem Server heruntergeladen werden :thumb:


- Hardware
Resolved L2 & L3 cache count detection [R13x regression]
- Client (GUI)
Light/Dark-mode font optimisations

ocinside Erstellt: 7:32 am 19. Mai 2022
SiSoftware Sandra 2021 31.88 R13 wurde online gestellt und kann nun von unserem Server heruntergeladen werden :thumb:


- Hardware
Intel 4th gen Intel Xeon Sapphire Rapids ADL-SP AVX512+ support
Intel 3rd gen Intel Xeon Whitley Ice Lake ICL-SP AVX512+ support
AMD Ryzen 6000 Mobile platform
Future hardware support

- Benchmarks
Memory Bandwidth Benchmark: fixed failure with multiple cores/threads CPUs e.g. ICL-SP
Cache Bandwidth Benchmark: fixed failure with multiple cores/threads CPUs e.g. ICL-SP

- Client GUI
Light/Dark-mode font optimisations
Icon dynamic colour customisation

×