new reference return strategy to reduce communication amount in multiprocessing, /close #322 (closed) after test on HPC systems