Quantization

Compute interpretation

Deployment adaptation that reduces memory, bandwidth, and latency pressure by lowering numerical precision.

Supporting reading cards

Obsolete or less central under later compute

Track this only through linked reading cards; do not treat this method page as standalone evidence.