Figure 3From: Fast, parallel implementation of particle filtering on the GPU architecture GPU architecture. Simplified architecture of the GPU. Streaming multiprocessors (SMP) contain cuda cores (P i ), an on-chip (shared), fast memory, scheduling unit and special function unit. Besides, all cuda cores in all SMPs have access to the large and slow off-chip memory (global memory).Back to article page