{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T09:14:49Z","timestamp":1696410889887},"reference-count":64,"publisher":"Wiley","issue":"24","license":[{"start":{"date-parts":[[2016,7,19]],"date-time":"2016-07-19T00:00:00Z","timestamp":1468886400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"funder":[{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["NRF\u20102015R1A1A1A05001480"],"award-info":[{"award-number":["NRF\u20102015R1A1A1A05001480"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J Comput Chem"],"published-print":{"date-parts":[[2016,9,15]]},"abstract":"<jats:p>We investigated the performance of heterogeneous computing with graphics processing units (GPUs) and many integrated core (MIC) with 20 CPU cores (20\u00d7CPU). As a practical example toward large scale electronic structure calculations using grid\u2010based methods, we evaluated the Hartree potentials of silver nanoparticles with various sizes (3.1, 3.7, 4.9, 6.1, and 6.9 nm) via a direct integral method supported by the sinc basis set. The so\u2010called work stealing scheduler was used for efficient heterogeneous computing via the balanced dynamic distribution of workloads between all processors on a given architecture without any prior information on their individual performances. 20\u00d7CPU\u2009+\u20091GPU was up to \u223c1.5 and \u223c3.1 times faster than 1GPU and 20\u00d7CPU, respectively. 20\u00d7CPU\u2009+\u20092GPU was \u223c4.3 times faster than 20\u00d7CPU. The performance enhancement by CPU\u2009+\u2009MIC was considerably lower than expected because of the large initialization overhead of MIC, although its theoretical performance is similar with that of CPU\u2009+\u2009GPU. \u00a9 2016 Wiley Periodicals, Inc.<\/jats:p>","DOI":"10.1002\/jcc.24443","type":"journal-article","created":{"date-parts":[[2016,7,19]],"date-time":"2016-07-19T09:20:46Z","timestamp":1468920046000},"page":"2193-2201","source":"Crossref","is-referenced-by-count":8,"title":["Performance of heterogeneous computing with graphics processing unit and many integrated core for hartree potential calculations on a numerical grid"],"prefix":"10.1002","volume":"37","author":[{"given":"Sunghwan","family":"Choi","sequence":"first","affiliation":[{"name":"Department of Chemistry KAIST, 291 Daehak\u2010Ro Yuseong\u2010Gu Daejeon 34141 Republic of Korea"},{"name":"Supercomputing Service Center, Korea Institute of Science and Technology Information 245 Daehak\u2010Ro Yuseong\u2010Gu Daejeon 34141 Republic of Korea"}]},{"given":"Oh\u2010Kyoung","family":"Kwon","sequence":"additional","affiliation":[{"name":"Supercomputing Service Center, Korea Institute of Science and Technology Information 245 Daehak\u2010Ro Yuseong\u2010Gu Daejeon 34141 Republic of Korea"},{"name":"School of Computing KAIST, 291 Daehak\u2010Ro Yuseong\u2010Gu Daejeon 34141 Republic of Korea"}]},{"given":"Jaewook","family":"Kim","sequence":"additional","affiliation":[{"name":"Department of Chemistry KAIST, 291 Daehak\u2010Ro Yuseong\u2010Gu Daejeon 34141 Republic of Korea"}]},{"given":"Woo Youn","family":"Kim","sequence":"additional","affiliation":[{"name":"Department of Chemistry KAIST, 291 Daehak\u2010Ro Yuseong\u2010Gu Daejeon 34141 Republic of Korea"}]}],"member":"311","published-online":{"date-parts":[[2016,7,19]]},"reference":[{"key":"e_1_2_6_1_1","doi-asserted-by":"publisher","DOI":"10.1103\/RevModPhys.72.1041"},{"key":"e_1_2_6_2_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct501128u"},{"key":"e_1_2_6_3_1","doi-asserted-by":"publisher","DOI":"10.1039\/C5CP00352K"},{"key":"e_1_2_6_4_1","doi-asserted-by":"publisher","DOI":"10.1039\/C5CP00351B"},{"key":"e_1_2_6_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2009.11.038"},{"key":"e_1_2_6_6_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.4942925"},{"key":"e_1_2_6_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2012.23"},{"key":"e_1_2_6_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cma.2011.01.013"},{"key":"e_1_2_6_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1816038.1816021"},{"key":"e_1_2_6_10_1","doi-asserted-by":"crossref","unstructured":"J.Alberdi\u2010Rodriguez M. J. T.Oliveira P.Garc\u00eda\u2010Risue\u00f1o F.Nogueira J.Muguerza A.Arruabarrena A.Rubio inRecent Memory and Performance Improvements in Octopus Code;B.Murgante S.Misra A. M. A. C.Rocha C.Torre J. G.Rocha M. I.Falc\u00e3o D.Taniar B. O.Apduhan andO.Gervasi Eds.;Lecture Notes in Computer Science 8582;Springer International Publishing Cham Switzerland 2014 pp607\u2013622.","DOI":"10.1007\/978-3-319-09147-1_44"},{"key":"e_1_2_6_11_1","doi-asserted-by":"crossref","unstructured":"A.Harju T.Siro F. F.Canova S.Hakala T.Rantalaiho inComputational Physics on Graphics Processing Units;P.ManninenandP.\u00d6ster Eds.;Lecture Notes in Computer Science 7782;Spinger Berlin Heidelberg Heidelberg 2013 pp3\u201326.","DOI":"10.1007\/978-3-642-36803-5_1"},{"key":"e_1_2_6_12_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct501189u"},{"key":"e_1_2_6_13_1","unstructured":"E.Apra M.Klemm K.Kowalski In SC14: International Conference for High Performance Computing Networking Storage and Analysis; IEEE: New Orleans LA 2014."},{"key":"e_1_2_6_14_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct300526w"},{"key":"e_1_2_6_15_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct301130u"},{"key":"e_1_2_6_16_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct1007247"},{"key":"e_1_2_6_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-011-0179-2"},{"key":"e_1_2_6_18_1","doi-asserted-by":"publisher","DOI":"10.1002\/jcc.21815"},{"key":"e_1_2_6_19_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct3001798"},{"key":"e_1_2_6_20_1","doi-asserted-by":"publisher","DOI":"10.1021\/ar500229p"},{"key":"e_1_2_6_21_1","doi-asserted-by":"publisher","DOI":"10.1080\/00268976.2013.874599"},{"key":"e_1_2_6_22_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct300754n"},{"key":"e_1_2_6_23_1","doi-asserted-by":"publisher","DOI":"10.1002\/wcms.1101"},{"key":"e_1_2_6_24_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct100701w"},{"key":"e_1_2_6_25_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct100584w"},{"key":"e_1_2_6_26_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct9003004"},{"key":"e_1_2_6_27_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct800526s"},{"key":"e_1_2_6_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2008.148"},{"key":"e_1_2_6_29_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct700268q"},{"key":"e_1_2_6_30_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct3004645"},{"key":"e_1_2_6_31_1","unstructured":"Y.Hasegawa J.\u2010I.Iwata M.Tsuji D.Takahashi A.Oshiyama K.Minami T.Boku F.Shoji A.Uno M.Kurokawa H.Inoue I.Miyoshi M.Yokokawa In 2011 International Conference for High Performance Computing Networking Storage and Analysis (SC); IEEE: Seatle WA 2011."},{"key":"e_1_2_6_32_1","unstructured":"F.Spiga I.Girotto In Proceedings \u2010 20th Euromicro International Conference on Parallel Distributed and Network\u2010Based Processing PDP 2012 IEEE: Garching 2012."},{"key":"e_1_2_6_33_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct300321a"},{"key":"e_1_2_6_34_1","doi-asserted-by":"crossref","unstructured":"A. T.Tzanov M. E.Tuckerman InComputational Techniques for Density Functional Based Molecular Dynamics Calculations in Plane\u2010Wave and Localized Basis Sets;V.Bach L.Delle Site Eds.;Springer International Publishing:Cham 2014 pp.261\u2013283.","DOI":"10.1007\/978-3-319-06379-9_15"},{"key":"e_1_2_6_35_1","doi-asserted-by":"publisher","DOI":"10.1002\/jcc.21576"},{"key":"e_1_2_6_36_1","doi-asserted-by":"publisher","DOI":"10.1002\/qua.24880"},{"key":"e_1_2_6_37_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.4921956"},{"key":"e_1_2_6_38_1","first-page":"63","volume-title":"Lecture Notes in Computer Science 7782","author":"Hakala S.","year":"2013"},{"key":"e_1_2_6_39_1","first-page":"401","volume-title":"Harnessing the Power of Graphic Processing Units","author":"Andrade X.","year":"2012"},{"key":"e_1_2_6_40_1","doi-asserted-by":"publisher","DOI":"10.1002\/qua.24819"},{"key":"e_1_2_6_41_1","doi-asserted-by":"publisher","DOI":"10.1038\/nchem.2099"},{"key":"e_1_2_6_42_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct8001046"},{"key":"e_1_2_6_43_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct400520e"},{"key":"e_1_2_6_44_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.2335442"},{"key":"e_1_2_6_45_1","doi-asserted-by":"publisher","DOI":"10.1002\/jcc.23487"},{"key":"e_1_2_6_46_1","doi-asserted-by":"publisher","DOI":"10.1002\/bkcs.10189"},{"key":"e_1_2_6_47_1","doi-asserted-by":"publisher","DOI":"10.1080\/00268976.2013.810793"},{"key":"e_1_2_6_48_1","doi-asserted-by":"publisher","DOI":"10.1142\/S1793962314410037"},{"key":"e_1_2_6_49_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.1768161"},{"key":"e_1_2_6_50_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.acha.2005.01.003"},{"key":"e_1_2_6_51_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.1791051"},{"key":"e_1_2_6_52_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.2436880"},{"key":"e_1_2_6_53_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.3036423"},{"key":"e_1_2_6_54_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.3291027"},{"key":"e_1_2_6_55_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.1898206"},{"key":"e_1_2_6_56_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.3009264"},{"key":"e_1_2_6_57_1","first-page":"1166","volume":"4","author":"Toivanen E. A.","year":"2015","journal-title":"Phys. Chem. Chem. Phys"},{"key":"e_1_2_6_58_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.4913569"},{"key":"e_1_2_6_59_1","unstructured":"Steven G. Johnson Faddeeva Package http:\/\/ab-initio.mit.edu\/wiki\/index.php\/Faddeeva_Package(accessed June 29 2016)."},{"key":"e_1_2_6_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/324133.324234"},{"key":"e_1_2_6_61_1","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jctc.5b00419"},{"key":"e_1_2_6_62_1","unstructured":"S. Choi HeterogeneousHartree https:\/\/gitlab.com\/sunghwan\/HeterogeneousHartree.git(accessed June 29 2016)."},{"key":"e_1_2_6_63_1","unstructured":"Dal Corso E. Kucukbenli THEOS pseudopotentials http:\/\/theossrv1.epfl.ch\/Main\/Pseudopotentials(accessed June 29 2016)."},{"key":"e_1_2_6_64_1","doi-asserted-by":"publisher","DOI":"10.1021\/ct4010596"}],"container-title":["Journal of Computational Chemistry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fjcc.24443","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/jcc.24443","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T00:59:47Z","timestamp":1696381187000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/jcc.24443"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,7,19]]},"references-count":64,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2016,9,15]]}},"alternative-id":["10.1002\/jcc.24443"],"URL":"https:\/\/doi.org\/10.1002\/jcc.24443","archive":["Portico"],"relation":{},"ISSN":["0192-8651","1096-987X"],"issn-type":[{"value":"0192-8651","type":"print"},{"value":"1096-987X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,7,19]]}}}