FAST '26 - Accelerating Model Loading in LLM Inference by Programmable Page Cache Accelerating Model Loading