cross-scale token projection