PIO reader parallel performance Space and Time
The PIO reader currently reads an entire scalar field on process zero then redistributes that across the ranks. This is slow and causes a bad malloc on large datasets. The reader needs to have multiple ranks simultaneously perform partial reads and appropriately distribute the read fraction.