vllm.model_executor.layers.fused_moe.flashinfer_trtllm_moe ¶
_supports_parallel_config ¶
_supports_parallel_config(
moe_parallel_config: FusedMoEParallelConfig,
) -> bool
Supports TRTLLM Kernel does not support EPLB.
is_supported_config_trtllm_bf16 ¶
is_supported_config_trtllm_bf16(
moe_config: FusedMoEConfig,
activation_format: FusedMoEActivationFormat,
) -> tuple[bool, str | None]
This method mirrors mk.FusedMoEPermuteExpertsUnpermute.is_supported_config for BF16 unquantized kernels.