Sharding¤
Module: generative_models.scaling.sharding
Source: generative_models/scaling/sharding.py
Overview¤
Sharding strategies and parallelism configuration for scalable training.
This module exposes the retained sharding strategies and configuration
objects used by artifex.generative_models.scaling. It focuses on explicit
strategy composition and mesh metadata instead of claiming a generic
parameter-name to PartitionSpec inference layer.
Classes¤
DataParallelStrategy¤
FSDPStrategy¤
MultiDimensionalStrategy¤
ParallelismConfig¤
PipelineParallelStrategy¤
ShardingConfig¤
ShardingStrategy¤
TensorParallelStrategy¤
Functions¤
init¤
init¤
init¤
init¤
init¤
apply_sharding¤
apply_sharding¤
apply_sharding¤
apply_sharding¤
apply_sharding¤
assign_layers_to_stages¤
from_device_count¤
from_sharding_config¤
get_attention_output_spec¤
get_attention_qkv_spec¤
get_backward_communication_pattern¤
get_combined_partition_spec¤
get_forward_communication_pattern¤
get_gradient_partition_spec¤
get_linear_weight_spec¤
get_partition_spec¤
get_partition_spec¤
get_partition_spec¤
get_partition_spec¤
get_partition_spec¤
get_sharding_constraints¤
get_total_device_count¤
is_valid¤
resolve_sharding_conflicts¤
should_shard_weight¤
Module Statistics¤
- Classes: 8
- Functions: 31
- Imports: 8