Last mile Infrastructure for AI, where the inference runs: a distributed architecture