### Bug description 1, Does not appear to be bracketed by compile flag. https://github.com/pytorch/torchtitan/blob/ad06609c4e3ac119112b097b6b317f31c5c760f5/torchtitan/experiments/kernels/moe/indices.py#L16 2. Calls to cuda instead of accelerator. https://github.com/pytorch/torchtitan/blob/ad06609c4e3ac119112b097b6b317f31c5c760f5/torchtitan/distributed/expert_parallel.py#L197 ### Versions TorchTitan and PyTorch master.