Is there any design documentation available on how MPICH2 is implemented? I realize that the site says that there is no documentation, however any sort of informal documentation regarding design consideration on it would make reading the code easier. I'm particularly interested in knowing how MPI_ALLREDUCE is implemented. Thank you.