pub fn good_chunk_size(len: usize) -> usize
Calculates a good chunk size for parallel processing based on the number of available threads. This is used to ensure that the workload is evenly distributed across the threads.