| Name | Faculty | Position |
|---|---|---|
| TATEBE Osamu | Professor/Chief | |
| TAKAHASHI Daisuke | Professor | |
| NUKADA Akira | Professor | |
| TSUJI Miwako | Professor | |
| TADANO Hiroto | Associate Professor | |
| FUJITA Norihisa | Assistant Professor | |
| MAEDA Munenori | Senior Researcher | |
| HANAWA Toshihiro | Information Technology Center, The University of Tokyo / University of Tsukuba | Professor / Visiting Associate Professor |
| SAKURAI Tetsuya | Graduate School of Systems and Information Engineering | Professor (Collaborative Fellow) |
| YAMAGUCHI Yoshiki | Graduate School of Systems and Information Engineering | Professor (Collaborative Fellow) |
| IMAKURA Akira | Graduate School of Systems and Information Engineering | Associate Professor (Collaborative Fellow) |
|
氏名 | 建部 修見 |
| Name | TATEBE Osamu | |
| Faculty | ||
| Section | ||
| Position | Professor/Chief | |
| Theme | High Performance Computing, Grid Computing, Distributed File System | |
| Related Links | ||
tatebe cs.tsukuba.ac.jp |
Research Interests
Academic & Professional Experience
Apr 2015-PresentUniversity of Tsukuba ProfessorApr 2006-Mar 2015University of Tsukuba Associate ProfessorOct 2005-Mar 2006National Institute of Advanced Industrial Science and Technology (AIST) Senior ResearcherApr 2001-Sep 2005National Institute of Advanced Industrial Science and Technology (AIST) ResearcherApr 1997-Mar 2001Electrotechnical Laboratory ResearcherPublished Papers
AMD MI300A APUにおける共有メモリシステムの性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-203 (2) Mar 2026LocustaRPC: 次世代リーダーシップマシンのためのスケーラブルなRPC基盤研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-203 (9) Mar 2026FS3.0: 富岳NEXT時代を見据えたHPCI運用システム整備計画に関する調査研究研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-203 (33) Mar 2026Tracing the RPC Lifecycle for Performance Analysis in Margo-Based HPC Data Services (unreferred)研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-203 (10) Mar 2026SCA/HPCAsiaWS '26: Proceedings of the Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region Workshops Jan 2026[Refereed]第3世代Optaneメモリによる大規模言語モデルKVキャッシュの性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-202 (3) Dec 2025ネットワーク転送ベンチマーク向けの仮想ファイルシステムの提案研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-202 (21) Dec 2025Block-Diagonal K-FAC: A Trade-off Between Curvature Information and Resource EfficiencyProceedings of 17th International OPT Workshop on Optimization for Machine Learning (OPT) Dec 2025Proceedings of 4th Workshop on Re-envisioning Extreme-Scale I/O for Emerging Hybrid HPC Workloads (REX-IO) Sep 2025共有メモリアーキテクチャにおける高性能RPCの方式検討研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-200 (32) Aug 2025Pluvio: アドホックファイルシステムのためのzero-copy I/O非同期ランタイムの設計研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-200 (33) Aug 2025BBView: Viewを意識したMPI-IO対応バーストバッファの設計研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-200 (34) Aug 2025大規模高速ストレージアーキテクチャの実現に向けた非同期RPC基盤の設計研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-200 (31) Aug 2025AD-KFAC: Asynchronous Decentralized Distributed K-FAC with Dynamic Load Balancing and Fault ToleranceProceedings of 2025 10th International Conference on Machine Learning Technologies (ICMLT) May 2025[Refereed]Asynchronous Decentralized Distributed K-FAC: Enhancing Training Efficiency and Load Balancing in Heterogeneous Environments (unreferred)IPSJ SIG Notes 2024-HPC-197 (5) Dec 2024分散ファイルシステムにおける通信イベントとI/Oイベントの非同期スケジューリングを統合した非同期I/Oの性能評価IPSJ SIG Notes 2024-HPC-197 (16) Dec 2024FINCHFS: Design of Ad-Hoc File System for I/O Heavy HPC WorkloadsProceedings of IEEE International Conference on Cluster Computing (CLUSTER) Sep 2024[Refereed]Distributed K-FAC Over Unstable Networks (unreferred)IPSJ SIG Notes 2024-HPC-195 (13) Aug 2024Proceedings of 30th International European Conference on Parallel and Distributed Computing (Euro-Par) Aug 2024[Refereed]GH200の予備性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2024-HPC-195 (4) Aug 2024Awards & Honors
Oct 2024情報処理学会 2024年度コンピュータサイエンス領域功績賞 並列分散システムソフトウェアの分野で優れた研究を行ってきた.特に,Gfarmファイルシステムの研究開発においてその成果は国際会議SC 2003のHigh Performance Bandwidth Challengeにおいて受賞するなど,高性能計算分野の学会で多くの賞を受賞し,国際的にも高く評価されている.また,CS領域における人材育成への貢献も大きい.以上の研究・開発業績だけではなく,多数の国際会議のプログラム委員を歴任し,CS領域の発展に顕著な貢献を行ってきた.Nov 2018Workshop Best PaperOct 2018IEEE Espana Best Paper AwardNov 2006IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC06) SC 2006 HPC Storage Challenge, Winner – Large SystemsNov 2006IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC06) SC 2006 HPC Storage Challenge, Winner – Large SystemsBooks etc
Advanced Software Technologies for Post-Peta Scale Computing(Role:Contributor, System Software for Data-Intensive Science)ASTONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XIX(Role:Contributor, Impact of Gfarm, a Wide-area Distributed File System, upon Astronomical Data Analysis and Virtual Observatory)ファイル共有とグリッド技術(Role:Sole author)Introduction to Grid Technology(Role:Sole author)MGCG METHOD: A ROBUST AND HIGHLY PARALLEL ITERATIVE METHOD(Role:Sole author)Research Grants & Projects
次世代ストレージアーキテクチャの研究基盤研究(A)Research period: 2022 - 2026EBD:次世代の年ヨッタバイト処理に向けたエクストリームビッグデータの基盤技術CREST (戦略的創造研究事業) (分担者)Research period: Sep 2013 - Mar 2019Research on machine learning system toward prediction of extreme weatherGrants-in-Aid for Scientific ResearchResearch period: 2017 - 2019System Software for Post Petascale Data Intensive ScienceResearch period: Apr 2011 - Mar 2017スケーラブルな広域ファイルシステムの研究特定領域研究Research period: 2009 - 2010情報爆発時代を支えるスケーラブルな広域分散ファイルシステムの研究特定領域研究Research period: 2007 - 2008
|
氏名 | 高橋 大介 |
| Name | TAKAHASHI Daisuke | |
| Faculty | ||
| Section | ||
| Position | Professor | |
| Theme | High-performance computing: High-performance numerical algorithms on parallel computers and performance evaluation | |
| Related Links | ||
daisuke cs.tsukuba.ac.jp |
Research Interests
Academic & Professional Experience
Apr 2016-PresentUniversity of Tsukuba Center for Computational Sciences ProfessorMay 2012-Mar 2016University of Tsukuba Faculty of Engineering, Information and Systems ProfessorOct 2011-May 2012University of Tsukuba Faculty of Engineering, Information and Systems Associate ProfessorApr 2007-Sep 2011University of Tsukuba Graduate School of Systems and Information Engineering Associate ProfessorJul 2006-Mar 2007University of Tsukuba Graduate School of Systems and Information Engineering Associate ProfessorJun 2006-Mar 2007Toyohashi University of Technology Faculty of Engineering LecturerApr 2004-Jul 2006University of Tsukuba Graduate School of Systems and Information Engineering Assistant ProfessorOct 2001-Mar 2004University of Tsukuba Institute of Information Sciences and Electronics Assistant ProfessorFeb 2000-Sep 2001Saitama University Graduate School of Science and Engineering Research AssociateApr 2000-Mar 2001Nagoya University Graduate School of Engineering LecturerApr 1999-Jan 2000The University of Tokyo Information Technology Center Research AssociateApr 1997-Mar 1999The University of Tokyo Computer Centre Research AssociatePublished Papers
GH200の予備性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2024-HPC-195 (4) Aug 2024CLUSTER Workshops 2024IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING 11 (1) Jan 2023[Refereed]ANNALS OF COMBINATORICS 26 (2) Jun 2022[Refereed]RAMANUJAN JOURNAL Epub Jul 2021[Refereed]CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 32 (7) Apr 2020[Refereed]RAMANUJAN JOURNAL 51 (1) Jan 2020[Refereed]Contemporary High Performance Computing May 2019Proceedings of the IEEE 106 (11) Nov 2018[Refereed]Parallel Computing 75 Jul 2018[Refereed]数学定数の特定の桁を計算するBBP型公式の⾼速計算法⽇本応用数理学会2017年度年会講演予稿集 Sep 2017Xeon Phiプロセッサにおける並列⼀次元実数FFTの実現と評価日本応用数理学会2017年度年会講演予稿集 Sep 2017Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017 Jun 2017[Refereed]Knights Landingクラスタにおける並列FFTの⾃動チューニング2017年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2017論文集 Jun 2017Xeon Phiクラスタ上の並列FFTにおける通信隠蔽の⾃動チューニング計算⼯学講演会論⽂集 22 May 2017Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 10404 2017[Refereed]2016 IEEE 10TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC) 201623RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015) 2015Learning Weights of Training Data by Game ResultsIPSJ Journal 55 (11) Nov 2014[Refereed]INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 28 (3) Aug 2014[Refereed]JOURNAL OF COMPUTATIONAL CHEMISTRY 35 (18) Jul 2014[Refereed]GPU/MICクラスタにおける疎行列ベクトル積の性能評価IPSJ SIG Notes 2014 (4) May 2014PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I 8384 2014[Refereed]COMPUTER PHYSICS COMMUNICATIONS 184 (9) Sep 2013[Refereed]GPUにおける4倍精度浮動小数点演算を用いたクリロフ部分空間法の高速化IPSJ SIG Notes 2013 (35) Jul 2013GPUクラスタにおける幅優先探索の高速化IPSJ SIG Notes 2013 (12) May 2013GPUにおける高速なCRS形式疎行列ベクトル積の実装IPSJ SIG Notes 2013-HPC-138 (5) Feb 2013Sustained Simulation Performance 2012 - Proceedings of the Joint Workshop on High Performance Computing on Vector Systems, and Workshop on Sustained Simulation Performance 2013[Refereed]Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 7975 (5) 2013[Refereed]Implementation and Evaluation of Triple and Quadruple Precision Floating-point Operations on GPUs情報処理学会論文誌. コンピューティングシステム 6 (1) Jan 2013[Refereed]2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013) 2013Highly scalable implementation of an N-body code on a GPU clusterComputer Physics Communications 184 2013[Refereed]GPUにおける4倍精度演算を用いた疎行列反復解法の実装と評価IPSJ SIG Notes 2012 (37) Dec 2012GPUにおける4倍精度演算を用いた疎行列反復解法の実装と評価IPSJ SIG Notes 2012 (37) Dec 2012大規模GPUクラスタにおけるN体計算コードの演算性能とスケーラビリティの評価IPSJ SIG Notes 2012 (1) Sep 2012並列言語XcalableMPのアクセラレータ向け言語拡張のOpenCL実装IPSJ SIG Notes 2012 (9) Mar 2012Multi-block/multi-core SSOR preconditioner for the QCD quark solver for K computerProceedings of Science 130497 2012PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012 9 2012[Refereed]Implementation and Evaluation of Quadruple Precision BLAS Functions on GPUsAPPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I 7133 (7133) 2012[Refereed]Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 7440 (2) 2012[Refereed]2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW) 2012[Refereed]2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW) 2012[Refereed]2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS) 2012[Refereed]15TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2012) / 10TH IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2012) 2012[Refereed]GPUによる3倍精度浮動小数点演算の検討IPSJ SIG Notes 2011 (23) Nov 2011GPUによる3倍精度浮動小数点演算の検討IPSJ SIG Notes 2011 (23) Nov 2011GPU上における多倍長精度浮動小数点演算の実装IPSJ SIG Notes 2011 (25) Nov 2011GPU上における多倍長精度浮動小数点演算の実装IPSJ SIG Notes 2011 (25) Nov 2011演算加速装置に基づく超並列クラスタHA-PACSによる大規模計算科学IPSJ SIG Notes 2011 (21) Jul 2011Proceedings of 2011 SC - International Conference for High Performance Computing, Networking, Storage and Analysis 2011[Refereed]Optimization of Sparse Matrix-Vector Multiplication by Auto Selecting Storage Schemes on GPUCOMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2011, PT II 6783 (6783) 2011[Refereed]Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Systems 2011[Refereed]Optimization of Sparse Matrix-Vector Multiplication by Auto Selecting Storage Schemes on GPUIPSJ SIG Notes 2010 (19) Dec 2010Bulletin of the Japan Society for Industrial and applied Mathematics 20 (4) Dec 2010Optimization of Sparse Matrix-Vector Multiplication by Auto Selecting Storage Schemes on GPUIPSJ SIG Notes 2010 (19) Dec 2010The Realization Probability Search Based on Search ResultsTransactions of Information Processing Society of Japan 51 (11) Nov 2010[Refereed]PARALLEL COMPUTING 36 (8) Aug 2010[Refereed]A SHOGI PROGRAM BASED ON MONTE-CARLO TREE SEARCHICGA JOURNAL 33 (2) Jun 2010[Refereed]A massively-parallel electronic-structure calculations based on real-space density functional theoryJOURNAL OF COMPUTATIONAL PHYSICS 229 (6) Mar 2010[Refereed]An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-core ProcessorsPARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I 6067 (6067) 2010[Refereed]Break the World Record of π : The Road to 2,576,980,370,000 Decimal DigitsJournal of Information Processing Society of Japan 50 (12) Dec 2009A Shogi Program Based on Monte-Carlo Tree SearchTransactions of Information Processing Society of Japan 50 (11) Nov 2009[Refereed]Implementation and Evaluation of Quadruple Precision BLAS on GPUIPSJ SIG Notes 2009 (13) Nov 2009Implementation and Evaluation of Quadruple Precision BLAS on GPUIPSJ SIG Notes 2009 (13) Nov 2009Application and Performance Evaluation of the Volumetric Parallel 3D-FFT to 3D-RISM on Massively Parallel ClusterIPSJ SIG Notes 2009 (3) Oct 200926aQL-3 Collaboration with Computer ScienceMeeting abstracts of the Physical Society of Japan 64 (2) Aug 200926aQL-3 Collaboration with Computer ScienceMeeting abstracts of the Physical Society of Japan 64 (2) Aug 200926aQL-3 Collaboration with Computer ScienceMeeting abstracts of the Physical Society of Japan 64 (2) Aug 2009Implementation of an Othello Program Based on Monte-Carlo Tree Search by Using a Multi-Core Processor and SIMD Instructions情報処理学会研究報告. GI, [ゲーム情報学] 2009 (7) Jun 2009Implementation and Evaluation of Volumetric Parallel 3-D FFT on Massively Parallel Cluster of Multi-Core ProcessorsIPSJ SIG Notes 2009 (14) Feb 2009Implementation and Evaluation of Volumetric Parallel 3-D FFT on Massively Parallel Cluster of Multi-Core ProcessorsIPSJ SIG Notes 2009 (14) Feb 2009Design and Power Performance Evaluation of On-Chip Memory Processor with Arithmetic AcceleratorsProc. 2008 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA 2008) 2 (1) Jan 2009[Refereed]Performance Evaluation of Linpack on T2K-Tsukuba SystemIPSJ SIG Notes 2008 (74) Jul 2008FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE 24 (6) Jun 2008[Refereed]Efficient Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix Multiplication情報処理学会論文誌. コンピューティングシステム 1 (1) Jun 2008[Refereed]2U-4 A Shogi program using Monte-Carlo method全国大会講演論文集 70 (2) Mar 2008Empirical study for optimization of power-performance with on-chip memoryHIGH-PERFORMANCE COMPUTING 4759 (4759) 2008[Refereed]A parallel algorithm for multiple-precision division by a single-precision integerLARGE-SCALE SCIENTIFIC COMPUTING 4818 (4818) 2008[Refereed]Proceedings of the Innovative Architecture for Future Generation High-Performance Processors and Systems 2008[Refereed]Power performance evaluation of on-chip memory processor with arithmetic acceleratorsIPSJ SIG Notes 2007 (79) Aug 2007Increasing Neighbour Communication Performance Techniques for the PACS-CS SystemIPSJ SIG Notes 2007 (80) Aug 2007RI2N/UDP : High Bandwidth and Fault-tolerant Network for PC-cluster Based on Multi-link Ethernet(Network)情報処理学会論文誌. コンピューティングシステム 48 (8) May 2007[Refereed]Power-performance Evaluation on Ultra-Low Power High-performance Cluster System: MegaProto/EProc. IEEE Symposium on Low-Power and High-Speed Chips (COOL Chips X) Apr 2007[Refereed]A study on arithmetic accelerators for on-chip memory processor情報処理学会研究報告 2007 (17) Mar 2007A study on arithmetic accelerators for on-chip memory processorIPSJ SIG Notes 2007 (17) Mar 2007A study on arithmetic accelerators for on-chip memory processorIPSJ SIG Notes 2007 (17) Mar 2007Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM 2007[Refereed]High performance FFT on SGI Altix 3700HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS 4782 (4782) 2007[Refereed]An implementation of parallel 1-D FFT using SSE3 instructions on dual-core processorsApplied Parallel Computing 4699 (4699) 2007[Refereed]Implementation and evaluation of parallel FFT using SIMD instructions on multi-core processorsINNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS 2007[Refereed]Power Performance Evaluation and Power Performance Optimization on MegaProto/EIPSJ SIG Notes 2006 (106) Oct 2006Dividing program into regions for controlling DVFSIPSJ SIG Notes 2006 (106) Oct 2006High-bandwidth and Fault-tolerant Network for PC Clusters based on Tagged-VLAN and Multi-link Ethernet Technologies(Session 3:Cluster/Grid)IPSJ SIG Notes 2006 (106) Oct 2006Profile-based Optimization of Power Performance by Using Dynamic Voltage Scaling on a PC Cluster(Cluster Systems)IPSJ Transactions on Advanced Computing Systems 47 (12) Sep 2006[Refereed]Reducing Energy of Parallel Programs with Load Imbalance by Using DVS(Cluster Systems)IPSJ Transactions on Advanced Computing Systems 47 (12) Sep 2006[Refereed]VFREC-Net: Multi-path Network for PC Clusters Based on Tagged-VLAN Technology with Driver ControlIPSJ Transactions on Advanced Computing Systems 47 (SIG 12(ACS 15)) Sep 2006[Refereed]Power Performance Optimization using Total Power Profile on a PC clusterIPSJ SIG Notes 2006 (88) Jul 2006Implementation and Performance Evaluation of the Large Scale Cluster PACS-CS for Scientific ComputationIPSJ SIG Notes 2006 (87) Jul 2006A Design of High Performance Communication Library for the PACS-CS SystemIPSJ SIG Notes 2006 (87) Jul 2006Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix MultiplicationIPSJ SIG Notes 2006 (63) Jun 2006EthernetマルチリンクによるPCクラスタ向け耐故障ネットワークRI2N/UDP情報処理学会シンポジウム論文集 2006 (5) May 2006Design and Implementation of Grid RPC System Integrating Computing Resources on Multiple Grid-enabled Job Execution Systems(Grid System)情報処理学会論文誌. コンピューティングシステム 47 (7) May 2006[Refereed]Report on SC|05計算工学 = Journal of The Japan Society for Computational Engineering and Science (JSCES) 11 (2) Apr 2006Reducing energy of parallel programs with load imbalance by using DVSIPSJ SIG Notes 2006 (20) Feb 2006Profile-based Optimization of Power Performance by using Dynamic Voltage Scaling on a PC clusterIPSJ SIG Notes 2006 (20) Feb 2006Reducing energy of parallel programs with load imbalance by using DVSIPSJ SIG Notes 2006 (20) Feb 2006RI2N/UDP: Fault-tolerant network for PC-clusters based on multi-link EthernetIPSJ SIG Notes 2006 (20) Feb 2006RI2N/UDP: Fault-tolerant network for PC-clusters based on multi-link EthernetIPSJ SIG Notes 2006 (20) Feb 2006PACS-CS: A large-scale bandwidth-aware PC cluster for scientific computationsSIXTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID 2006[Refereed]Formation of dwarf galaxies in reionized universe with heterogeneous multicomputer systemINTERNATIONAL JOURNAL FOR MULTISCALE COMPUTATIONAL ENGINEERING 4 (2) 2006Computation of high-precision mathematical constants in a combined cluster and grid environmentLARGE-SCALE SCIENTIFIC COMPUTING 3743 (3743) 2006[Refereed]A parallel method for large sparse generalized eigenvalue problems by OmniRPC in a grid environmentAPPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING 3732 (3732) 2006[Refereed]An implementation of parallel 3-D FFT using short vector SIMD instructions on clusters of PCsAPPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING 3732 (3732) 2006[Refereed]20th International Parallel and Distributed Processing Symposium, IPDPS 2006 2006 (20) 200620th International Parallel and Distributed Processing Symposium, IPDPS 2006 2006 2006[Refereed]20th International Parallel and Distributed Processing Symposium, IPDPS 2006 2006 2006[Refereed]Performance improvement by data management layer in a grid RPC systemADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS 3947 (3947) 2006[Refereed]A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clustersPARALLEL PROCESSING AND APPLIED MATHEMATICS 3911 (3911) 2006[Refereed]Emprical study on reducing energy of parallel programs using slack reclamation by DVFS in a power-scalable high performance cluster2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2 2006[Refereed]Performance Improvement by Initial Data Management on Grid RPC System OmniRPCIPSJ SIG Notes 2005 (97) Oct 2005High-bandwidth Tree Network for PC Clusters based on Tagged-VLAN TechnologyIPSJ SIG Notes 2005 (97) Oct 2005Optimization and Evaluation of Power Performance by Using On-Chip RAMIPSJ SIG Notes 2005 (80) Aug 2005Optimization of Power-Performance by controlling DVS on a PC clusterIPSJ SIG Notes 2005 (80) Aug 2005MegaProto : A Low-power and Compact Cluster for High-performance Computing(HPC Hardware)情報処理学会論文誌. コンピューティングシステム 46 (12) Aug 2005[Refereed]Design and Implementation of a Grid RPC System on Multiple Grid MiddlewaresIPSJ SIG Notes 2005 (81) Aug 2005"FIRST"-a hybrid cluster system for the elucidation on the origin of FIRST generation objects in the universeIPSJ SIG Notes 2005 (81) Aug 2005OmniRPC Grid Parallel Programming Environment for a Large Scale Numerical ComputationProc. 17th IMACS World Congress Scientific Computation, Applied Mathematics and Simulation Jul 2005APPLIED MATHEMATICS AND COMPUTATION 166 (2) Jul 2005[Refereed]Design of Software Distributed Shared Memory System Using MPI Communication Layer(Software DSM)情報処理学会論文誌. コンピューティングシステム 46 (7) May 2005[Refereed]A Master-worker Type Parallel Method for Large-scale Eigenvalue ProblemsIPSJ Transactions on Advanced Computing Systems 46 (SIG 7(ACS 10)) May 2005[Refereed]MegaProto: A Low-Power and Compact Cluster for High-Performance ComputingProc. 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05), Workshop on High Performance, Power-Aware Computing (HPPAC) 162 Apr 2005[Refereed]MegaProto : A Low-Power and Compact Cluster for High-Performance ComputingIPSJ SIG Notes 2005 (19) Mar 2005MegaProto : A Low-Power and Compact Cluster for High-Performance Computing情報処理学会研究報告. ARC,計算機アーキテクチャ研究会報告 162 Mar 2005Proceedings of the International Symposium on Parallel Architectures, Algorithms and Networks, I-SPAN 2005 (7) 2005[Refereed]Proceedings of the International Symposium on Parallel Architectures, Algorithms and Networks, I-SPAN 2005 2005[Refereed]MegaProto: A Low-Level and Compact Cluster for High-Performance ComputingProc. of HP-PAC05 (in IPDPS2005), Denver Jan 2005[Refereed]Proceedings - 19th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2005 2005 2005[Refereed]Proceedings - 19th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2005 2005 2005[Refereed]Design of a software distributed shared memory system using an MPI communication layer8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings 46 (7) 2005[Refereed]Proceedings of the ACM/IEEE 2005 Supercomputing Conference, SC'05 2005 2005[Refereed]Design of a software distributed shared memory system using an MPI communication layer8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings 2005[Refereed]Computing Environment Independent Interface for Matrix Computation LibraryIPSJ SIG Notes 2004 (128) Dec 2004OpenMPI --- OpenMP like tool for easy programming in MPIProc. 6th European Workshop on OpenMP (EWOMP 2004) Nov 2004[Refereed]Implementation and Evaluation of Parallel FFT Using Short Vector SIMD Instructions(Performance Optimization)情報処理学会論文誌. コンピューティングシステム 45 (11) Oct 2004[Refereed]Measurement of Microprocessor's Power Consumption and Prototyping Low Power Cluster with Low Power Processors(Power Conservation)情報処理学会論文誌. コンピューティングシステム 45 (SIG11(ACS7)) Oct 2004[Refereed]Design of Grid RFC System OmniRPC on XtremWeb P2P GridIPSJ SIG Notes 2004 (81) Jul 2004Implementation and Performance Evaluation of CONFLEX-G: Grid-enabled Molecular Conformational Space Search Program with OmniRPCProc. 18th International Conference on Supercomputing (ICS'04) Jun 2004[Refereed]Implementation and Performance Evaluation of CONFLEX-G : A Grid Enabled Conformational Space Search Program by OmniRPC(Grid Applications)情報処理学会論文誌. コンピューティングシステム 45 (6) May 2004[Refereed]Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous Clusters(Numerical Computation)情報処理学会論文誌. コンピューティングシステム 45 (SIG06(ACS6)) May 2004OmniRPCによるグリッド環境での大規模固有値問題の並列解法 (数値解析と新しい情報技術)RIMS Kokyuroku 1362 Apr 2004Parallel Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous ClustersProc. 18th International Parallel and Distributed Processing Symposium (IPDPS'04), The 13th Heterogeneous Computing Workshop (HCW 2004) Apr 2004[Refereed]Software Distributed Shared Memory System on MPIIPSJ SIG Notes 2004 (38) Apr 2004Measurement and Characterization for Power Consumption of Microprocessors for Power-aware ClusterProc. An International Symposium on Low-Power and High-Speed Chips (COOL Chips VII) Apr 2004[Refereed]Heterogeneous remote computing system for computational astrophysics with OmniRPC2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS 2004[Refereed]Formation of dwarf galaxies in reionized universe with heterogeneous multi-computer systemCOMPUTATIONAL SCIENCE - ICCS 2004, PROCEEDINGS 3039 2004[Refereed]Performance evaluation of OmniRPC in a grid environment2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS 2004[Refereed]ACM International Conference Proceeding Series 68 2004[Refereed]Formation of dwarf galaxies in reionized universe with heterogeneous multi-computer systemCOMPUTATIONAL SCIENCE - ICCS 2004, PROCEEDINGS 3039 (2) 2004[Refereed]Implementation of First Touch page allocation on Omni/SCASHIPSJ SIG Notes 2003 (84) Aug 2003Low Power Cluster using Low Power CPUIPSJ SIG Notes 2003 (84) Aug 2003Performance evalusation of RI2N-Interconnection network system for clusters with wide-bandwidth and fault-tolerancyIPSJ SIG Notes 2003 (83) Aug 2003Perfomance Evaluation of Grid Applications by OmniRPC in Wide Area NetworkIPSJ SIG Notes 2003 (83) Aug 2003HMCS-G : Grid-enabled Hybrid Computing System for Computational Astrophysics (Grid Applications)IPSJ Transactions on Computing Systems 44 (11) Aug 2003PARALLEL COMPUTING 29 (6) Jun 2003[Refereed]Performance evaluation of the Hitachi SR8000 using SPEC OMP2001 benchmarksINTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING 31 (3) Jun 2003[Refereed]Remote accessing environment of GRAPE-6 gravity engineIPSJ SIG Notes 2003 (62) Jun 2003高バンド幅/耐故障性を持つクラスタ向け結合ネットワークRI2N情報処理学会シンポジウム論文集 2003 (8) May 2003SMP Configuration and Performance Evaluation of SCIMA --- On-chip Memory Processor Architecture for HPCIPSJ Transactions on Advanced Computing Systems 44 (SIG 6(ACS 1)) May 2003[Refereed]COMPUTER PHYSICS COMMUNICATIONS 152 (2) May 2003[Refereed]SMP Configuration and Performance Evaluation of SCIMA On-chip Memory Processor Architecture for HPC情報処理学会論文誌. コンピューティングシステム 44 (6) May 2003RI2N - Interconnection network system for clusters with wide-band width and fault-tolerancy based on multiple・linksIPSJ SIG Notes 2003 (29) Mar 2003Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous ClustersIPSJ SIG Notes 2003 (29) Mar 2003HMCS-G: Grid-enabled hybrid computing system for computational astrophysicsCCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS 2003[Refereed]OmniRPC: A grid RPC system for parallel programming in cluster and grid environmentCCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS 2003[Refereed]An OpenMP implementation of parallel FFT and its performance on IA-64 processorsOPENMP SHARED MEMORY PARALLEL PROGRAMMING 2716 (2716) 2003[Refereed]A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS 2 2003[Refereed]Proceedings - CCGrid 2003: 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid 44 (SIG 11(ACS 3)) 2003[Refereed]Proceedings - CCGrid 2003: 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid 44 (SIG 11(ACS 3)) 2003[Refereed]OmniRPC: A grid RPC system for parallel programming in cluster and grid environmentCCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS 44 (SIG 11(ACS 3)) 2003[Refereed]RI2N - Interconnection network system for clusters with wide-bandwidth and fault-tolerancy based on-multiple linksHIGH PERFORMANCE COMPUTING 2858 (2858) 2003[Refereed]A Feasibility Study on an Itanium-based ClusterIPSJ SIG Notes 2002 (99) Oct 2002Parallel Forward Deduction System for General-Purpose Entailment Calculus on Clusters of PCsProc. IASTED International Conference on Parallel and Distributed Computing, Applications and Technologies (NPDPA 2002) Oct 2002[Refereed]OmniRPC : a Grid RPC System for Parallel Programming in Grid EnvironmentIPSJ SIG Notes 2002 (99) Oct 2002A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCsIPSJ Transactions on High Performance Computing Systems 43 (SIG 6(HPS 5)) Sep 2002[Refereed]Hybrid Parallelization for SPAM Particle Simulation on SMP-PC Clusters情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002Hybrid Parallelization for SPAM Particle Simulation on SMP-PC ClustersIPSJ Transactions on High Performance Computing Systems 43 (SIG6(HPS 5)) Sep 2002[Refereed]Improving Performance of Automated Forward Deduction System EnCal on Shared-Memory Parallel ComputersProc. Third International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT 2002) Sep 2002[Refereed]A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCs情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002SMP configuration and performance evaluation of SCIMA : on-chip memory processor architecture for HPCIPSJ SIG Notes 2002 (81) Aug 2002Performance Evaluation of Omni/SCASH Software Distributed Shared Memory System on Ethernet-based ClusterIPSJ SIG Notes 2002 (80) Aug 2002A Blocking Algorithm for Parallel FFT on Shared-memory Parallel ComputersIPSJ Journal 43 (4) Apr 2002[Refereed]A Blocking Algorithm for Parallel FFT on Shared-memory Parallel Computers(Parallel Processing) IPSJ Journal 43 (4) Apr 2002Performance Evaluation of the Hitachi SR8000 Under OpenMP BenchmarksIPSJ SIG Notes 2002 (22) Mar 2002Performance Evaluation of the Hitachi SR8000 Under OpenMP BenchmarksIPSJ SIG Notes 2002 (22) Mar 2002Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2327 (2327) 2002[Refereed]A blocking algorithm for parallel 1-D FFT on shared-memory parallel computersAPPLIED PARALLEL COMPUTING 2367 (2367) 2002[Refereed]A blocking algorithm for parallel 1-D FFT on clusters of PCsEURO-PAR 2002 PARALLEL PROCESSING, PROCEEDINGS 2400 (2400) 2002[Refereed]Parallel Forward Deduction Algorithms of General-Purpose Entailment Calculus on Shared-Memory Parallel ComputersProc. 2nd International Conference on Software Engineering, Artificial Intelligence, Networking & Parallel/Distributed Computing (SNPD'01) Aug 2001[Refereed]A Blocking Algorithm for Parallel FFT on SMP ClustersIPSJ SIG Notes 2001 (77) Jul 2001An extended split-radix FFT algorithmIEEE SIGNAL PROCESSING LETTERS 8 (5) May 2001[Refereed]A Mixed-Radix Parallel Three-Dimensional FFT Algorithm on Clusters of Vector SMPsProc. Tenth SIAM Conference on Parallel Processing for Scientific Computing (PP01) Mar 2001[Refereed]A parallel 3-D FFT algorithm on clusters of vector SMPsAPPLIED PARALLEL COMPUTING, PROCEEDINGS 1947 (1947) 2001[Refereed]A performance study on a single processing node of the HITACHI SR8000NUMERICAL ANALYSIS AND ITS APPLICATIONS 1988 (1988) 2001[Refereed]A blocking algorithm for FFT on cache-based processorsHIGH-PERFORMANCE COMPUTING AND NETWORKING 2110 (2110) 2001[Refereed]A fast algorithm for computing large Fibonacci numbersINFORMATION PROCESSING LETTERS 75 (6) Nov 2000[Refereed]An Extended Split-Radix FFT AlgorithmIPSJ SIG Notes 2000 (93) Oct 2000Efficient Implementation of CG & CR Methods for Linear Systems on a Single Processing Node of HITACHI SR8000Proc. 2000 International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC2000) Jul 2000[Refereed]A New Radix-8 FFT Kernel Suitable for Multiply-Add InstructionIPSJ Journal 41 (7) Jul 2000[Refereed]A Fast Algorithm for Computing Fibonacci NumbersIPSJ Journal 41 (6) Jun 2000A Fast Algorithm for Computing Fibonacci NumbersIPSJ Journal 41 (6) Jun 2000[Refereed]A Divide and Rationalize Method which Improves the Multiple-Precision Function Computation with Series ExpansionIPSJ Journal 41 (6) Jun 2000[Refereed]High-performance radix-2, 3 and 5 parallel 1-D complex FFT algorithms for distributed-memory parallel computersJOURNAL OF SUPERCOMPUTING 15 (2) Feb 2000[Refereed]Proceedings - 4th International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region, HPC-Asia 2000 1 2000[Refereed]A new radix-6 FFT algorithm suitable for multiply-add instruction2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI 6 2000[Refereed]Implementation of multiple-precision parallel division and square root on distributed-memory parallel computers2000 INTERNATIONAL WORKSHOPS ON PARALLEL PROCESSING, PROCEEDINGS 2000[Refereed]Fast High-Precision Arithmetic on Distributed Memory Parallel MachinesProc. Ninth SIAM Conference on Parallel Processing for Scientific Computing Mar 1999[Refereed]Calculation of pi to 51.5 Billion Decimal Digits on Distributed Memory Parallel ProcessorsTransactions of Information Processing Society of Japan 39 (7) Jul 1998[Refereed]Implementation and Evaluation of Radix-2, 3 and 5 1-D FFT on Distributed Memory Parallel ComputersTransactions of Information Processing Society of Japan 39 (3) Mar 1998[Refereed]Improvement of the Algorithms for pi Calculation: The Gauss-Legendre Algorithm and the Borwein's Quartically COnvergent AlgorithmTransactions of Information Processing Society of Japan 38 (11) Nov 1997[Refereed]An Implementation of Factorization on Massively Parallel SIMD ComputersTransactions of Information Processing Society of Japan 36 (11) Nov 1995[Refereed]Awards & Honors
Feb 2024IEEE Computer Society 2023 Class of IEEE Computer Society Distinguished ContributorsSep 2021The IEEE International Conference on Cluster Computing (Cluster 2021) Outstanding Service AwardJul 201616th International Conference on Computational Science and Its Applications (ICCSA 2016) NVIDIA Best Paper AwardNov 2011Association for Computing Machinery ACM Gordon Bell PrizeApr 2010Ministry of Education, Culture, Sport, Science and Technology The Commendation for Science and Technology by the Minister of Education, Culture, Sports, Science and TechnologyNov 2009情報処理学会第14回ゲームプログラミングワークショップ優秀論文賞Nov 2008情報処理学会第13回ゲームプログラミングワークショップ優秀論文賞May 2004Information Processing Society of Japan IPSJ Best Paper AwardJan 2003情報処理学会 情報処理学会2003年ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2003)最優秀論文賞May 1999Information Processing Society of Japan IPSJ Best Paper AwardOct 1998Information Processing Society of Japan IPSJ Yamashita SIG Research AwardBooks etc
計算科学のためのHPC技術2(Role:Contributor, 計算科学のためのHPC技術2)COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II(Role:Contributor, Implementation of Multiple-Precision Floating-Point Arithmetic on Intel Xeon Phi Coprocessors)COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II(Role:Contributor, Parallel Sparse Matrix-Vector Multiplication Using Accelerators)COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT V(Role:Contributor, Optimization of Sparse Matrix-Vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs)PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012(Role:Contributor, A Fast Implementation and Performance Analysis of Collisionless N-body Code Based on GPGPU)Software Automatic Tuning: From Concepts to State-of-the-Art Results(Role:Sole author)PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I(Role:Contributor, An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-core Processors)IT Text HPCプログラミング(Role:Sole author)LARGE-SCALE SCIENTIFIC COMPUTING(Role:Contributor, A parallel algorithm for multiple-precision division by a single-precision integer)HIGH-PERFORMANCE COMPUTING(Role:Contributor, Empirical study for optimization of power-performance with on-chip memory)INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS(Role:Contributor, Implementation and evaluation of parallel FFT using SIMD instructions on multi-core processors)Applied Parallel Computing - STATE OF THE ART IN SCIENTIFIC COMPUTING(Role:Contributor, An implementation of parallel 1-D FFT using SSE3 instructions on dual-core processors)Sixth IEEE International Symposium on Cluster Computing and the Grid - SPANNING THE WORLD AND BEYOND(Role:Contributor, PACS-CS: A large-scale bandwidth-aware PC cluster for scientific computations)PARALLEL PROCESSING AND APPLIED MATHEMATICS(Role:Contributor, A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clusters)ADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS(Role:Contributor, Performance improvement by data management layer in a grid RPC system)APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING(Role:Contributor, An implementation of parallel 3-D FFT using short vector SIMD instructions on clusters of PCs)APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING(Role:Contributor, An implementation of parallel 3-D FFT using short vector SIMD instructions on clusters of PCs)LARGE-SCALE SCIENTIFIC COMPUTING(Role:Contributor, Computation of high-precision mathematical constants in a combined cluster and grid environment)2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2(Role:Contributor, Emprical study on reducing energy of parallel programs using slack reclamation by DVFS in a power-scalable high performance cluster)8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings(Role:Contributor, Design of a software distributed shared memory system using an MPI communication layer)8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings(Role:Contributor, Design of a software distributed shared memory system using an MPI communication layer)Parallel and Distributed Scientific and Engineering Computing: Practice and Experience(Role:Sole author)2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS(Role:Contributor, Performance evaluation of OmniRPC in a grid environment)2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS(Role:Contributor, Performance evaluation of OmniRPC in a grid environment)COMPUTATIONAL SCIENCE - ICCS 2004, PROCEEDINGS(Role:Contributor, Formation of dwarf galaxies in reionized universe with heterogeneous multi-computer system)HIGH PERFORMANCE COMPUTING(Role:Contributor, RI2N - Interconnection network system for clusters with wide-bandwidth and fault-tolerancy based on-multiple links)EURO-PAR 2002 PARALLEL PROCESSING, PROCEEDINGS(Role:Contributor, A blocking algorithm for parallel 1-D FFT on clusters of PCs)Research Grants & Projects
メニーコア超並列クラスタにおける多倍長演算に関する研究科学研究費助成事業Research period: Apr 2022 - Mar 2025Research on algorithm of fast Fourier transform in exascale systemGrants-in-Aid for Scientific ResearchResearch period: Apr 2019 - Mar 2023Research on Rational Number Arithmetic Library in Many-Core Massively Parallel ClusterGrants-in-Aid for Scientific ResearchResearch period: 2016 - 2018Real-time Analytics Frameworks for Big Heterogeneous Data in Composite Parallel Computing EnvironmentsGrants-in-Aid for Scientific ResearchResearch period: Apr 2014 - Mar 2017Materials Design thorough Computics: Complex Correlation and Non-Equilibrium DynamicsGrants-in-Aid for Scientific ResearchResearch period: Apr 2010 - Mar 2016Research on FFT Algorithms for Exa-Scale Computing EnvironmentGrants-in-Aid for Scientific ResearchResearch period: 2012 - 2014Numerical Computation Algorithms for Large-scale Parallel EnvironmentGrants-in-Aid for Scientific ResearchResearch period: 2010 - 2014Interdisciplinary algorithms and computer simulationsGrants-in-Aid for Scientific ResearchResearch period: 2008 - 2012Research on FFT Algorithms for Peta-Scale Computing EnvironmentGrants-in-Aid for Scientific ResearchResearch period: 2010 - 2011Adaptive Auto-tuning Technology Aiming Complex Multicore and Multiprocessor EnvironmentsGrants-in-Aid for Scientific ResearchResearch period: 2009 - 2011Research on FFT Algorithms for Many-Core Massively Parallel ClustersGrants-in-Aid for Scientific ResearchResearch period: 2008 - 2009情報爆発時代を支えるスケーラブルな広域分散ファイルシステムの研究科学研究費助成事業Research period: 2007 - 2008Study on large-scale scalable P2P grid infrastructure for large-capacity distributed computingGrants-in-Aid for Scientific ResearchResearch period: 2005 - 2007Elucidation on the Origin of First Generation Objects by HMCS-E (Heterogeneous MultiComputer System-Embedded)Grants-in-Aid for Scientific ResearchResearch period: 2004 - 2007A research on interconnection network for large scale clusters based on commodity networkGrants-in-Aid for Scientific ResearchResearch period: 2005 - 2006ヘテロジニアス環境における高速フーリエ変換の並列アルゴリズムに関する研究若手研究(A)Research period: 2004 - 2006大規模クラスタにおける並列FFTライブラリの開発出資金による受託研究Research period: Aug 2004 - Jan 2005Fast solvers of PDEs on a sphereGrants-in-Aid for Scientific ResearchResearch period: 2002 - 2004Study on advanced programming environment using OpenMP for a next generation high performance cluster systemGrants-in-Aid for Scientific ResearchResearch period: 2002 - 2004計算物理学分野のGRIDアプリケーションと並列プログラミングシステムの研究科学研究費助成事業Research period: 2001 - 2004計算物理学分野のGRIDアプリケーションと並列プログラミングシステムの研究科学研究費助成事業Research period: 2003 - 2003A research on high performance file server with PC clusters based on parallel I/O systemGrants-in-Aid for Scientific ResearchResearch period: 2002 - 2003PCクラスタにおける高速フーリエ変換の並列アルゴリズムに関する研究若手研究(B)Research period: 2002 - 2003計算物理学分野のGRIDアプリケーションと並列プログラミングシステムの研究科学研究費助成事業Research period: 2002 - 2002並列計算機における高速フーリエ変換のアルゴリズムに関する研究奨励研究(A)Research period: 2000 - 2001並列計算機による高精度数学定数の高速計算法に関する研究奨励研究(A)Research period: Apr 1999 - Mar 2000Study on High Performance Computing
|
氏名 | 額田 彰 |
| Name | NUKADA Akira | |
| Faculty | ||
| Section | ||
| Position | Professor | |
| Theme | High Performance Computing, Performance Optimization, GPU Computing | |
| Related Links | ||
nukada ccs.tsukuba.ac.jp |
Research Interests
Academic & Professional Experience
Apr 2020-PresentUniversity of Tsukuba Center for Computational SciencesApr 2018-Mar 2020Tokyo Institute of Technology Global Scientific Information and Computing CenterApr 2013-Mar 2018Tokyo Institute of Technology Global Scientific Information and Computing CenterNov 2007-Mar 2013Tokyo Institute of Technology Global Scientific Information and Computing CenterApr 2004-Oct 2007科学技術振興機構Published Papers
Journal of Information Processing 34 Feb 2026SCA/HPCAsiaWS '26: Proceedings of the Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region Workshops Jan 2026[Refereed]次世代HPC・AI研究開発支援センターと国内におけるGPUプログラム開発支援研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-202 (9) Dec 2025不揮発性メモリを用いたVlasovシミュレーションの大規模化情報処理学会研究報告 2025-HPC-201 (7) Sep 2025Internatonal Conference HPC Asia 2025 Jun 2025[Refereed]GPU演算加速による一般相対論的輻射磁気流体シミュレーションコードの性能評価情報処理学会研究報告 2025-HPC-198 (60) Mar 20252024 IEEE International Conference on Cluster Computing (CLUSTER) Sep 2024[Refereed]GH200の予備性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2024-HPC-195 (4) Aug 2024Pegasusビッグメモリスーパコンピュータの性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2023-HPC-190 (7) Jul 2023PARALLEL COMPUTING 116 (103018) Jul 2023[Refereed]DO CONCURRENT 構文による OpenSWPC の GPU 化研究報告ハイパフォーマンスコンピューティング(HPC) 2023-HPC-188 (19) Mar 2023The International Journal of High Performance Computing Applications 36 (3) May 2022[Refereed]GPGPU '22: Proceedings of the 14th Workshop on General Purpose Processing Using GPU (2) Apr 2022[Refereed]遊休GPUを利用したホスト・デバイス間通信の高速化研究報告ハイパフォーマンスコンピューティング(HPC) 2022-HPC-183 (4) Mar 20222021 IEEE International Conference on Big Data (Big Data) Dec 2021[Refereed]TSUBAME3.0におけるストレージ利用効率化のためのファイルシステムベンチマーク情報処理学会研究報告 2019-HPC-170 (24) Jul 201919th Annual IEEE/ACM International Symposium in Cluster, Cloud, and Grid Computing (CCGrid 2019) May 2019[Refereed]小疎行列積計算のGPU最適化情報処理学会研究報告 2019-HPC-168 (19) Mar 2019GraphCNN向けの疎行列積計算Batch最適化情報処理学会研究報告 2018-HPC-167 (7) Dec 2018Parallel Computing 77 Sep 2018[Refereed]PASC 2018: Platform for Advanced Scientific Computing Conference Jul 2018[Refereed]18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2018) May 2018[Refereed]32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2018) May 2018[Refereed]Overview of TSUBAME3.0, Green Cloud Supercomputer for Convergence of HPC, AI and Big-DataTsubame ESJ. : e-science journal 16 Nov 2017Proceedings of the International Conference on Parallel Processing Sep 2017[Refereed]High-Performance and Memory-Saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPUProceedings of the International Conference on Parallel Processing Sep 2017[Refereed]Procedia Computer Science 80 2016[Refereed]疎行列ベクトル積計算を対象としたGPU向けメモリアクセス削減手法情報処理学会研究報告 2015-HPC-151 (8) Sep 2015EURO-PAR 2015: PARALLEL PROCESSING 9233 2015[Refereed]ACM International Conference Proceeding Series 09-12- Sep 2014[Refereed]超省エネスーパーコンピューター TSUBAMEペトロテック 37 (8) Aug 2014GPU間マイグレーションによる効率的な並列実行情報処理学会研究報告 2014-HPC-145 (42) Jul 2014TSUBAME-KFC : the Greenest Supercomputer in the World With Liquid Submersion CoolingTsubame ESJ. : e-science journal 11 Jun 2014GPUのキャッシュを考慮した疎行列ベクトル積計算手法の性能評価情報処理学会研究報告 2014-HPC-144 (5) May 2014Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS 2015- 2014[Refereed]Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS 2015- 2014[Refereed]TSUBAME-KFC: 液浸冷却を用いたウルトラグリーンスパコン研究設備情報処理学会研究報告 2013-ARC-199/HPC-142 Dec 2013APU上の混合精度AMG法IPSJ SIG Notes 2013 (13) Sep 2013ウルトラグリーンスパコンTSUBAME2.5/TSUBAME-KFC大学ICT推進協議会年次大会論文集 2013SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis Nov 2012[Refereed]GPU スパコンTSUBAME 2.0 によるフェーズフィールド法を用いた2 petaflops樹枝状凝固成長計算第17回計算工学講演会論文集 17 May 2012ACM International Conference Proceeding Series 2012[Refereed]Operation of TSUBAME 2.0 Green Supercomputer dealing with Power Crisis研究報告ハイパフォーマンスコンピューティング(HPC) 2011 (12) Nov 2011Achievement of Linpack Performance of over 1PFlops on TSUBAME 2.0 Supercomputer情報処理学会論文誌コンピューティングシステム(ACS) 4 (4) Oct 2011[Refereed]Achievement of Linpack Performance of over 1PFlops on TSUBAME 2.0 Supercomputer先進的計算基盤システムシンポジウム論文集 (2011) May 2011[Refereed]Fast fourier transform using GPUTsubame ESJ. 3 Feb 2011Proceedings of 2011 SC - International Conference for High Performance Computing, Networking, Storage and Analysis 2011[Refereed]PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011 2011[Refereed]IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum 2011[Refereed]Efficient PageRank on GPU Clusters情報処理学会研究報告 2010-HPC-128 (21) Dec 2010Performance Evaluation of TSUBAME 2.0 Heterogeneous Supercomputer with Linpack Benchmark情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2010-HPC-128 (5) Dec 2010Optimization of electric power efficiecy based on model in GPU情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2010-HPC-128 (5) Dec 2010CUDAによる高速フーリエ変換応用数理 20 (2) Jun 2010異種アクセラレータを持つTSUBAMEスーパーコンピュータのLinpack評価応用数理 20 (2) Jun 2010Power-Aware Task Scheduling on GPU Accelerated Clusters情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 124 Feb 2010Bulletin of the Japan Society for Industrial and Applied Mathematics 20 (2) 2010Bulletin of the Japan Society for Industrial and Applied Mathematics 20 (2) 201017th International Conference on High Performance Computing, HiPC 2010 2010[Refereed]2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2010 2010[Refereed]2010 International Conference on Green Computing, Green Comp 2010 2010[Refereed]Computer Science - Research and Development 25 (1-2) 2010[Refereed]Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010 2010[Refereed]Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010 2010[Refereed]CG on GPU-enhanced Clusters情報処理学会研究報告 2009-HPC-123 Dec 2009Software Framework for GPU Memory Errors情報処理学会研究報告. 計算機アーキテクチャ研究会報告 186 Nov 2009SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Nov 2009[Refereed]Linpack Tuning Method on a Heterogeneous Supercomputer with Hybrid AcceleratorsProc. Summer United Workshops on Parallel, Distributed and Cooperative Processing, SWoPP2009, Sendai, Aug. 2009-HPC-121 (3) Oct 2009CUDA GPU向けの自動最適化FFTライブラリ情報処理学会論文誌コンピューティングシステム(ACS) 2 (3) Sep 2009[Refereed]GPUにおける性能と消費電力の相関性の解析情報処理学会研究報告 2009-HPC-121 Jul 2009GPUにおける耐故障性を考慮した数値計算の電力性能情報処理学会研究報告 2009-HPC-121 Jul 2009Acceleration of Himeno Benchmark on Multi-node GPU System by Overlapping Communication with Calculation : Over 700 GFLOPS of Sustained Performance is Achieved with 32 GPUs情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 120 (3) Jun 2009An Efficient Conjugate Gradient Solver on Double Precision Multi-GPU Systems先進的計算基盤システムシンポジウムSACSIS2009論文集 May 2009[Refereed]CUDA GPU向けの自動最適化FFTライブラリ先進的計算基盤システムシンポジウムSACSIS2009論文集 May 2009[Refereed]Linpack Tuning on a Heterogeneous Supercomputer with Four Types of ProcessorsIPSJ SIG Notes 182 (14) Feb 2009Performance Evaluation of Software-Based ECC for GPUsIPSJ SIG Notes 2009 2009COMPUTATIONAL SCIENCE - ICCS 2009, PART I 5544 2009[Refereed]SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing Nov 2008[Refereed]High Performance 3-D FFT in CUDA Environment情報処理学会論文誌コンピューティングシステム(ACS) 1 (2) Aug 2008[Refereed]ソフトウェアECCによるGPUメモリの耐故障性の実現と評価IEICE technical report 108 (181) Aug 2008Lecture Notes in Computer Science 4967(PPAM2007) May 2008[Refereed]High Performance FFT on SGI Altix 3700Proc. 3rd International Conference on High Performance Computing and Communications (HPCC 2007), Lecture Notes in Computer Science 4782 (4782) Sep 2007[Refereed]Proceedings of the Second international Workshop on Automatic Performance Tuning (iWAPT 2007) Sep 2007[Refereed]Distributed SILC: An easy-to-use interface for MPI-based parallel matrix computation librariesLecture Notes in Computer Science 4699 (PARA06) 4699 Jan 2007[Refereed]PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS 4742 2007[Refereed]A Performance Evaluation Model for the SILC Matrix Computation FrameworkProceeding of the IFIP International Conference on Network and Parallel Computing (NPC2006) Oct 2006[Refereed]SILC: A Flexible and Environment Independent Interface to Matrix Computation LibrariesLecture Notes in Computer Science 3911 (PPAM2005) 3911 Sep 2006[Refereed]Implementation of the Matrix Computation Library Interface SILC in Distributed Parallel EnvironmentsIPSJ SIG Notes 2006 (87) Jul 20062006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings III May 2006[Refereed]分散型 SILC の設計: MPI ベースの行列計算ライブラリを使いやすくするインタフェース2006年ハイパフォーマンスコンピューティングと計算科学シンポジウム HPCS2006 ポスター論文集 Jan 2006LAPACK in SILC: Use of a Flexible Application Framework for Matrix Computation LibrariesProceedings on the Eighth International Conference on High-Performance Computing in Asia-Pacific Region (HPC Asia 2005) Dec 2005[Refereed]共有メモリ並列環境における SILC の実現と利用第 34 回数値解析シンポジウム講演予稿集 Jun 2005SILC: 行列計算ライブラリの利用を簡単化するフレームワーク第10回計算工学講演会講演論文集 10 (2) Jun 2005行列計算ライブラリに対する計算環境に依存しないインタフェースの開発2005年ハイパフォーマンスコンピューティングと計算科学シンポジウム HPCS2005 ポスター論文集 Jan 2005Computing Environment Independent Interface for Matrix Computation LibraryIPSJ SIG Notes 2004 (128) Dec 2004Parallel Implementation of FFT Algorithms on Distributed Shared Memory Architecture and Its Optimization情報処理学会論文誌コンピューティングシステム(ACS) 44 (6) May 2003[Refereed]Performance Evaluation of Commodity Distributed Shared Memory IBM x440IPSJ SIG Notes 93 Mar 2003Fine Grain Parallel Implementation of Sparse Matrix Algorithms and its OptimizationIPSJ SIG Notes 91 (80) Aug 2002Awards & Honors
Jul 2012日本計算工学会 第17回 計算工学講演会ベストペーパーアワードNov 2011ACM ACM Gordon Bell Prize – Special Achievements in Scalability and Time-to-SolutionDec 2010IEEE Computer Society Japan Chapter IEEE Computer Society Japan Chapter Young Author Award 2010Mar 2010情報処理学会 平成21年度山下記念研究賞May 2009情報処理学会 第7回先進的基盤システムシンポジウムSACSIS2009最優秀論文賞Jun 2008IEEE Computer Society Japan Chapter 第6回先進的基盤システムシンポジウムSACSIS2008優秀若手研究賞Books etc
はじめてのCUDAプログラミング(Role:Sole author)Research Grants & Projects
GPUアプリケーションに対するシステムレベルのチェックポイント技術の確立科学研究費補助金 若手研究Research period: Apr 2020 - Mar 2024コード化環境光を用いた完全鏡面物体の3次元形状計測科学研究費助成事業Research period: Apr 2013 - Mar 2015Development of novel GPU programming techniquesGrants-in-Aid for Scientific ResearchResearch period: Apr 2011 - Mar 2013Acceleration of algebraic multigrid solver on APUGrants-in-Aid for Scientific ResearchResearch period: 2013 - 2013Auto-tuning FFT using GPUGrants-in-Aid for Scientific ResearchResearch period: Apr 2010 - Mar 2012
|
氏名 | 辻 美和子 |
| Name | TSUJI Miwako | |
| Faculty | ||
| Section | ||
| Position | Professor | |
| Theme | Quantum HPC Hybried, Performance Model, Programming Model | |
| Related Links | ||
tsuji ccs.tsukuba.ac.jp |
Research Interests
Academic & Professional Experience
Published Papers
The Role of Quantum Computing in Advancing Scientific High-Performance Computing: A perspective from the ADAC InstituteFuture Generation Computer Systems Mar 2026[Refereed]AMD MI300A APUにおける共有メモリシステムの性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-203 (2) Mar 2026SCA/HPCAsiaWS '26: Proceedings of the Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region Workshops Jan 2026[Refereed]量子HPC連携プラットフォームに向けたソフトウェアとプログラミング環境第255回システム・アーキテクチャ・第202回ハイパフォーマンスコンピューティング合同研究発表会 2025-HPC-202 (14) Dec 2025Proceedings of the SC '25 Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis Nov 2025[Refereed]Massively Parallel CMA-ES With Increasing PopulationConcurrency and Computation: Practice and Experience Nov 2025[Refereed]INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS Oct 2025[Refereed]2025 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops) Sep 2025[Refereed]HPCQCMark: a new modular HPC-QC benchmarking framework2025 IEEE International Conference on Quantum Computing and Engineering (QCE) Sep 2025[Refereed]Visualizing the Effects of Quantum Circuits in a Hybrid Classical-Quantum Machine Learning Algorithm第15回量子ソフトウェア研究発表会 2025-QS-15 (6) Jun 2025NVIDIA GH200におけるSystem-Allocated Memoryの性能評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-199 (7) May 2025Intel Spin-qubit Quantum Simulator Performance Evaluation on Supercomputer Fugaku第198回ハイパフォーマンスコンピューティング・第14回量子ソフトウェア合同研究発表会 hpc198qs14 Mar 2025量子HPC連携プラットフォームに向けた環境構築と課題第198回ハイパフォーマンスコンピューティング・第14回量子ソフトウェア合同研究発表会/hpc198qs14 Mar 2025JOURNAL OF SUPERCOMPUTING 80 (14) Sep 2024[Refereed]Computational Science -- ICCS 2024 Jun 2024[Refereed]Journal of Parallel and Distributed Computing 191 May 2024[Refereed]Physical Review A 109 May 2024[Refereed]Advancements in Traffic Simulations with multiMATSim’s Distributed Framewok16th International Conference on Agents and Artificial Intelligence (ICAART) Feb 2024[Refereed]HPCAsia '24 Workshops: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops Jan 2024[Refereed]OpenACC単一記述によるGPU+FPGA複合デバイス処理システム情報処理学会論文誌コンピューティングシステム(ACS) 16 (2) Nov 2023[Refereed]2023 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops) Oct 2023[Refereed]ISC High Performance 2023: High Performance Computing 13999 Aug 2023[Refereed]OpenACC Execution Models for Manycore Processor with ARMInternational Workshop on Arm-based HPC: Practice and Experience (IWAHPCE-2023)/International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia2023) Feb 2023[Refereed]COMPUTER PHYSICS COMMUNICATIONS 282 Jan 2023[Refereed]Design and Performance Evaluation of UCX for Tofu-D Interconnect with OpenSHMEM-UCX on Fugaku2022 IEEE/ACM Parallel Applications Workshop: Alternatives To MPI+X (PAW-ATM) Nov 2022[Refereed]2022 IEEE International Conference on Cluster Computing (CLUSTER) Workshop on EAHPC-2022 - Embracing Arm: a journey of porting and optimization to the latest Arm-based processors Sep 2022[Refereed]2020 IEEE International Conference on Cluster Computing (CLUSTER) Workshop on EAHPC-2020 - Embracing Arm: a journey of porting and optimization to the latest Arm-based processors Sep 2022[Refereed]Computational Science – ICCS 2022 Jun 2022[Refereed]OpenACCによる宇宙物理シミュレーションのGPU+FPGA協調計算の実装研究報告ハイパフォーマンスコンピューティング(HPC) 2022-HPC-183 (11) Mar 2022IEEE Micro 42 (42) Mar 2022[Refereed]HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops Jan 2022[Refereed]International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia2022) Jan 2022[Refereed]A64FXに向けたNeK CFD solverにおけるaxhelmカーネルの最適化と評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2021-HPC-182 (4) Nov 2021通信ライブラリUCXのTofu-D対応の検討情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2021-HPC-181 (8) Sep 2021EEE International Conference on Cluster Computing (CLUSTER) EAHPC-2021 - Embracing ARM: a journey of porting and optimization to the latest ARM-based processors Sep 2021[Refereed]2021 IEEE International Conference on Cluster Computing (CLUSTER) EAHPC-2021 - Embracing ARM: a journey of porting and optimization to the latest ARM-based processors Sep 2021[Refereed]The Journal of Supercomputing Jul 2021[Refereed]Platform for Advanced Scientific Computing Conference (PASC) Jul 2021IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Apr 2021[Refereed]次世代HPCシステムのためのプロセッサアーキテクチャ評価環境と電力性能予測情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2021-HPC-178 (10) Mar 2021A64FXプロセッサにおけるFiberミニアプリスイートの性能評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2021-HPC-178 (5) Mar 2021JOURNAL OF COMPUTATIONAL SCIENCE 49 Feb 2021[Refereed]低レイテンシuTofuインターフェースを用いた格子QCD計算における通信の高速化情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-177 (22) Dec 20202020 SC20: International Conference for High Performance Computing, Networking, Storage, and Analysis (SC20) Nov 2020[Refereed]HPCベンチマークプログラムによるA64FXプロセッサ試作機の性能評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-173 (6) Mar 2020"将来システムのコデザインのためのCPUシミュレータによるMPIリプレイ環境および性能推定手法の検討情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-173 (5) Mar 2020ThunderX2 ArmプロセッサにおけるFiberミニアプリスイートの性能評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-171 (4) Sep 2019MYX:マルチSPMDプログラミングモデルにおける実行時正当性チェック情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-168 (6) Feb 2019Distributed and Parallel Programming Paradigms on the K computer and a ClusterInternational Conference on High Performance Computing in Asia-Pacific Region (HPCAsia2019) Jan 2019[Refereed]International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia2019) Jan 2019[Refereed]Fourth International IEEE Workshop on Extreme Scale Programming Models and Middleware (ESPM2) Nov 2018[Refereed]Preliminary Performance Evaluation of Application Kernels Using ARM SVE with Multiple Vector LengthsIEEE International Conference on Cluster Computing (CLUSTER) Workshop Re-Emergence of Vector Architectures Workshop (Rev-A) Sep 2017[Refereed]IEEE International Conference on Cluster Computing (CLUSTER) Workshop on Representative Applications (WRAp) Sep 2017[Refereed]First International Workshop on Extreme Scale Programming Models and Middlewar Nov 2015[Refereed]マルチSPMDプログラミング開発実行環境における耐故 障性実現に向けたワークフロースケジューリングの検討第148回ハイパフォーマンスコンピューティング研究発表会 2015-HPC-148 (23) Mar 2015K-scale applications on the K computer and co-design effort for the design and development of post-KInternational Conference on Parallel Computing (ParCo) 2015 Advances in Parallel Computing, Parallel Computing: On the Road to Exascale 27 2015[Refereed]マルチSPMD環境における耐故障性実現に向けた OmniRPC-MPI の拡張情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2014-HPC-146 (8) Oct 2014INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 28 (3:::SI) Aug 2014[Refereed]International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC) Oct 2013[Refereed]Multi level programming Paradigm for Extreme ComputingJoint International Conference on Supercomputing in Nuclear Applications + Monte Carlo 2013[Refereed]WCCI 2012 IEEE World Congress on Computational Intelligence Jun 2012[Refereed]International Conference for High Performance Computing, Networking, Storage, and Analysis (SC11) Nov 2011[Refereed]Journal of Algorithms and Computational Technology 5 (2) Jun 2011[Refereed]大規模SMPクラスタにおけるOpenMP/MPIハイブリッドプログラムの性能評価SACSIS2009 - 先進的計算基盤システムシンポジウム Sep 2009[Refereed]大規模SMPクラスタにおけるOpenMP/MPIハイブリッドNPB,RSDFTの評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2009-HPC-119 Feb 2009Performance evaluation of OpenMP and MPI hybrid programs on a large scale multi-core multi-socket cluster T2K Open Supercomputer",International Conference on Parallel Processing Workshops 2009[Refereed]Genetic and Evolutionary Computation (GECCO 2008) 2008[Refereed]リンケージ同定とコンテクスト依存交叉を用いた遺伝的アルゴリズムの並列化IPSJ SIG Technical Reports 2007 (128(MPS-67 BIO-11)) Dec 2007複雑なビルディングブロック重複を持つ問題に対する交叉手法の提案情報処理学会論文誌数理モデル化と応用(TOM) 48 (15) Oct 2007[Refereed]A Network Design Problem by a GA with Linkage Identification and Recombination for Overlapping Building BlocksIEEE Congress on Evolutionary Computation (CEC2007) 2007[Refereed]EVOLUTIONARY COMPUTATION 4 (14) 2006[Refereed]Theoretical and Empirical Investigations on Difficulty in Structure Learning by Estimation of Distribution AlgorithmsIEEE Conference on Systems Man and Cybernetics (SMC 2006) 2006Genetic and Evolutionary Computation (GECCO 2006) 2006[Refereed]ビルディングブロック重複のある問題に対するD^5-GAの適用IPSJ SIG Technical Reports Sep 2005IEEE Congress on Evolutionary Computation (CEC2005) 2005[Refereed]The eighth Foundations of Genetic Algorithms Conference (FOGA 2005) 3469 2005[Refereed]Linkage Identification for Problems with Hierarchical Structure情報処理学会論文誌数理モデル化と応用(TOM) 45 (10) 2004[Refereed]Genetic and Evolutionary Computation (GECCO 2004) 2004[Refereed]適応度差分により分類された個体の分布に基づくGAの遺伝子座依存関係モデルの構築IPSJ SIG Technical Reports Dec 2003Metropolitan Area Network Design Using GA Based on Hierarchical Linkage IdentificationGenetic and Evolutionary Computation Conference (GECCO 2003) 2724 2003[Refereed]階層型のリンケージを考慮した遺伝的アルゴリズムによる都市圏ネットワーク設計IPSJ SIG Technical Reports Nov 2002リンケージ同定を導入した遺伝的アルゴリズム による都市圏ネットワークの設計情報処理学会論文誌数理モデル化と応用(TOM) 43 (6) 2002[Refereed]Metropolitan Area Network Design Using GA Based on Linkage Identification with Epistasis Measures4th Asia-Pacific Conference on Simulated Evolution and Learning 2002[Refereed]Awards & Honors
Nov 2011Association for Computing Machinery (ACM) Gordon Bell Prize First-principles calculations of electron states of a silicon nanowire with 100,000 atoms on the K computerNov 2011Association for Computing Machinery (ACM) ACM Gordon Bell PrizeNov 2011Association for Computing Machinery (ACM) Gordon Bell Prize First-principles calculations of electron states of a silicon nanowire with 100,000 atoms on the K computerBooks etc
XcalableMP PGAS Programming Language( Multi-SPMD Programming Model with YML and XcalableMP)Software for Exascale Computing SPPEXA 2016-2019( MYX: Runtime Correctness Analysis for Multi-Level Parallel Programming Paradigms)Linkage in Evolutionary Computation( A Network Design Problem by a GA with Linkage Identification and Recombination for Overlapping Building Blocks)Computational Intelligence Paradigms - Innovative Applications( Linkage Analysis in Genetic Algorithms)Research Grants & Projects
|
氏名 | 多田野 寛人 |
| Name | TADANO Hiroto | |
| Faculty | ||
| Section | ||
| Position | Associate Professor | |
| Theme | Numerical analysis: Numerical algorithms for large scale linear systems. Parallel computing for eigenvalue problems. | |
| Related Links | ||
tadano cs.tsukuba.ac.jp |
Research Interests
Academic & Professional Experience
Dec 2024-PresentUniversity of Tsukuba Center for Computational Sciences Associate ProfessorApr 2016-Nov 2024University of Tsukuba Center for Computational Sciences Assistant ProfessorOct 2011-Mar 2016University of Tsukuba Faculty of Engineering, Information and Systems Assistant ProfessorMar 2008-Sep 2011Graduate School of Systems and Information Engineering University of Tsukuba Assistant ProfessorApr 2007-Feb 2008Kyoto University Graduate School of Informatics JSPS Research AssociateApr 2006-Mar 2007Japan Science and Technology Agency ResearcherPublished Papers
2024 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops) Sep 2024Performance evaluation of a hierarchical parallel solver with iterative refinement for saddle point problems in a parallel computing environmentProc. of The 43rd JSST Annual International Conference on Simulation Technology (JSST2024) Sep 2024[Refereed]GH200の予備性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2024-HPC-195 (4) Aug 2024Journal of Advanced Simulation in Science and Engineering 11 (1) Mar 2024[Refereed]Transaction of the Japan Society for Simulation Technology 15 (2) Sep 2023[Refereed]Parallel calculations of the extremely large number of MPI processes in FugakuProc. of The 42nd JSST Annual International Conference on Simulation Technology (JSST2023) Aug 2023[Refereed]Development and performance evaluation of the Block GPBiCGrQ method with variable grouping strategyProc. of The 42nd JSST Annual International Conference on Simulation Technology (JSST2023) Aug 2023[Refereed]Implementation and performance evaluation of a hierarchical parallel solver for saddle point problems on a GPU cluster,Journal of Advanced Simulation in Science and Engineering 10 (1) Apr 2023[Refereed]JSIAM Letters 14 Aug 2022[Refereed]Implementation of a hierarchical parallel solver for saddle point problems on a GPU clusterProc. of The 41st JSST Annual International Conference on Simulation Technology (JSST2022) Aug 2022[Refereed]JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN 91 (7) Jul 2022[Refereed]Proc. of IEEE 19th Biennial Conference on Electromagnetic Field Computation (CEFC 2020) Jun 2021[Refereed]Journal of Advanced Simulation in Science and Engineering 8 (1) Apr 2021[Refereed]A Numerical Study on the Acceleration of Solution of Saddle Point Problems by Using Block Krylov Subspace MethodsProc. of 19th Biennial IEEE Conference on Electromagnetic Field Computation Nov 2020[Refereed]Parallelized GPU Code of City-Level Large Eddy SimulationProceedings of Int. Symposium on Parallel and Distributed Computing (ISPDC) 2020 Jul 2020[Refereed]Transactions of the Japan Society for Industrial and Applied Mathematics 30 (4) 2020都市気象コードCity-LESの並列GPU実装の最適化と性能評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-170(5) Jul 2019JAPAN JOURNAL OF INDUSTRIAL AND APPLIED MATHEMATICS 36 (2) Jul 2019[Refereed]Journal of Advanced Simulation in Science and Engineering 6 (1) Mar 2019[Refereed]Lecture Notes in Computational Science and Engineering 117 2017[Refereed]Speeding up Large Eddy Simulation by Multigrid preconditioned Krylov subspace methods with mixed precision.JSST 2016 The 35th JSST Annual Conference International Conference on Simulation Technology Oct 2016Transactions of the Japan Society for Industrial and Applied Mathematics 26 (3) 2016[Refereed]Improving the convergence behaviour of BiCGSTAB by applying D-norm minimizationJSIAM Letters 7 May 2015[Refereed]Journal of Computational Chemistry 35 (18) Jul 2014[Refereed]JSIAM Letters 6 2014[Refereed]Journal of Algorithms and Computational Technology 7 (3) Sep 2013[Refereed]Transactions of the Japan Society for Industrial and Applied Mathematics 23 (3) 2013JSIAM Letters 4 Aug 2012[Refereed]COMPUTER PHYSICS COMMUNICATIONS 183 (1) Jan 2012[Refereed]応用数理 21 (4) Dec 2011局地気象シミュレーションで現れる線形方程式に対する前処理の評価日本応用数理学会2011年度年会予稿集 Sep 2011演算加速装置に基づく超並列クラスタHA-PACSによる大規模計算科学IPSJ SIG Notes 2011 (21) Jul 2011JSIAM Letters 3 2011A convergence improvement of the BSAIC preconditioner by deflationJSIAM Letters 3 Jan 2011[Refereed]JOURNAL OF COMPUTATIONAL CHEMISTRY 31 (13) Oct 2010[Refereed]固有値分布の確率的推定法日本応用数理学会年会講演予稿集 2010 Sep 2010独立並列計算による行列固有値分布の確率的推定法IPSJ SIG Notes 2010 (35) Jul 2010JAPAN JOURNAL OF INDUSTRIAL AND APPLIED MATHEMATICS 27 (1) Jun 2010[Refereed]A PARALLEL EIGENSOLVER USING CONTOUR INTEGRATION FOR GENERALIZED EIGENVALUE PROBLEMS IN MOLECULAR SIMULATIONTAIWANESE JOURNAL OF MATHEMATICS 14 (3A) Jun 2010[Refereed]COMPUTER PHYSICS COMMUNICATIONS 181 (5) May 2010[Refereed]JSIAM Letters 2 2010JSIAM Letters 2 2010Parallel Eigensolver for Large Scale Non-linear SystemsNUMERICAL ANALYSIS AND APPLIED MATHEMATICS, VOLS I-III 1281 2010[Refereed]Parallel stochastic estimation method of eigenvalue distributionJSIAM Letters 2 Jan 2010[Refereed]A quadrature-based eigensolver with a Krylov subspace method for shifted linear systems for Hermitian eigenproblems in lattice QCDJSIAM Letters 2 Jan 2010[Refereed]A block sparse approximate inverse with cutoff preconditioner for semi-sparse linear systems derived from Molecular Orbital calculationsJSIAM Letters 2 Jan 2010[Refereed]Performance of a Contour Integral Based Eigensolver with a Complete Sparse Factorization Preconditioner on Multi-Core ClustersLecture Notes in Computer Science Jan 2010[Refereed]COMPUTER PHYSICS COMMUNICATIONS 181 (1) Jan 2010[Refereed]Error analysis for a matrix pencil of Hankel matrices with perturbed complex momentsJSIAM Letters 1 Dec 2009[Refereed]Application and Performance Evaluation of the Volumetric Parallel 3D-FFT to 3D-RISM on Massively Parallel ClusterIPSJ SIG Notes 2009 (3) Oct 2009バンド局所化による電子状態計算の高性能並列アルゴリズム日本応用数理学会年会講演予稿集 2009 Sep 2009A numerical method for nonlinear eigenvalue problems using contour integralsJSIAM Letters 1 Aug 2009[Refereed]A Block Krylov Subspace Method for the Contour Integral Method and Its Application to Molecular Orbital Computations情報処理学会論文誌. コンピューティングシステム 2 (2) Jul 2009[Refereed]A method for nonlinear eigenvalue problems based on contour integrationRIMS Kokyuroku 1638 Apr 2009JSIAM Letters 1 2009JSIAM Letters 1 2009JSIAM Letters 1 2009A Method for Finding Zeros of Polynomial Equations using a Contour Integral Based EigensolverSNC'09: PROCEEDINGS OF THE 2009 INTERNATIONAL WORKSHOP ON SYMBOLIC-NUMERIC COMPUTATION 2009[Refereed]Block BiCGGR: a new Block Krylov subspace method for computing high accuracy solutionsJSIAM Letters 1 Jan 2009[Refereed]A performance evaluation of the preconditioning using double CutoffTransactions of the Japan Society for Industrial and Applied Mathematics 18 (4) Dec 2008[Refereed]グレブナ基底を用いない連立代数方程式の非線形固有値問題への変換法と非線形固有値問題の解法についてBulletin of the Japan Society for Symboric and Algebraic Computation 15 (2) Dec 2008Implementation and Performance Evaluation of Sparse Matrix Vector Multiplication for Mixed Precision Krylov Method on the Cell BE情報処理学会論文誌. コンピューティングシステム 1 (1) Jun 2008[Refereed]On single precision preconditioners for Krylov subspace iterative methodsLARGE-SCALE SCIENTIFIC COMPUTING 4818 (4818) 2008[Refereed]A method for estimating a distribution of eigenvalues using the AMLS methodTransactions of the Japan Society for Industrial and Applied Mathematics 17 (4) Dec 2007[Refereed]Modified Multiple Explicitly Restarted Arnoldi Method with Hybrid GridRPC/MPI Implementation情報処理学会論文誌. コンピューティングシステム 48 (8) May 2007[Refereed]Hokkaido Mathematical Journal 36 (4) 2007[Refereed]A master-worker type eigensolver for molecular orbital computationsAPPLIED PARALLEL COMPUTING 4699 (4699) 2007[Refereed]On an evaluation method of preconditioners for complex symmetric systems of linear equationsTransactions of the Japan Society for Industrial and Applied Mathematics 16 (4) Dec 2006[Refereed]A parallel method for large scale eigenvalue problems in a Grid environmentThe Computational Mechanics Conference 2005 (18) Nov 2005A Stabilization of the CGS Method by Avoiding Near-BreakdownProceedings of International Conference of Numerical Analysis and Applied Mathematics 2005 (ICNAAM 2005) Sep 2005[Refereed]Transactions of the Japan Society for Industrial and Applied Mathematics 15 (2) Jun 2005[Refereed]Transactions of the Japan Society for Industrial and Applied Mathematics 14 (3) Sep 2004[Refereed]A method for avoiding breakdown in product-type iterative methods and its behavior for Toeplitz linear systemsICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004 2 (2) 2004[Refereed]A moment-based method for large scale eigenvalue problemsICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004 1 (3) 2004[Refereed]Awards & Honors
Dec 2023The 42nd JSST Annual International Conference on Simulation Technology (JSST2023) Outstanding Presentation AwardApr 2011日本応用数理学会 日本応用数理学会 第7回 若手優秀講演賞Jan 2008情報処理学会 2008年ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2008)最優秀論文賞Books etc
数値線形代数の数理とHPC(Role:Contributor, 1.2 反復法, 1.2.1 定常反復法, 1.2.2 クリロフ部分空間反復法)SNC'09: PROCEEDINGS OF THE 2009 INTERNATIONAL WORKSHOP ON SYMBOLIC-NUMERIC COMPUTATION(Role:Contributor, A Method for Finding Zeros of Polynomial Equations using a Contour Integral Based Eigensolver)ICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004(Role:Contributor, A moment-based method for large scale eigenvalue problems)ICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004(Role:Contributor, A method for avoiding breakdown in product-type iterative methods and its behavior for Toeplitz linear systems)Research Grants & Projects
Development of a fast and high-accuracy hierarchical parallel solver for saddle point problemsGrant-in-Aid for Scientific Research (C)Research period: Apr 2023 - Mar 2026Development of high accuracy and high performance algorithms for linear systems with multiple right-hand sidesGrants-in-Aid for Scientific ResearchResearch period: Apr 2015 - Mar 2017Development of an advanced method for nonlinear eigenvalue problems and its applicationsGrants-in-Aid for Scientific ResearchResearch period: Apr 2013 - Mar 2017Numerical Computation Algorithms for Large-scale Parallel EnvironmentGrants-in-Aid for Scientific ResearchResearch period: Apr 2010 - Mar 2015Development of fast and accurate methods for solving linear systems with multiple right-hand sides and their application to scientific computationsGrants-in-Aid for Scientific ResearchResearch period: 2010 - 2012Development of Complete Meshless Scheme for Finite Node Method and Boundary Node Method and Technological ApplicationGrants-in-Aid for Scientific ResearchResearch period: 2010 - 2012Interdisciplinary algorithms and computer simulationsGrants-in-Aid for Scientific ResearchResearch period: 2008 - 2012Development and Application of a Method for Generalized EigenvalueGrants-in-Aid for Scientific ResearchResearch period: 2009 - 2011A development of fast solvers for large-scale linear systems and its application to the parallel eigensolver.Grants-in-Aid for Scientific ResearchResearch period: 2008 - 2009
|
氏名 | 藤田 典久 |
| Name | FUJITA Norihisa | |
| Faculty | ||
| Section | ||
| Position | Assistant Professor | |
| Theme | Parallel processing, Interconnection network and Parallel application optimization using accelerators | |
| Related Links | ||
fujita hpcs.cs.tsukuba.ac.jp |
Research Interests
Academic & Professional Experience
Published Papers
AMD MI300A APUにおける共有メモリシステムの性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-203 (2) Mar 2026FS3.0: 富岳NEXT時代を見据えたHPCI運用システム整備計画に関する調査研究研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-203 (33) Mar 2026SCA/HPCAsiaWS '26: Proceedings of the Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region Workshops Jan 2026[Refereed]次世代HPC・AI研究開発支援センターと国内におけるGPUプログラム開発支援研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-202 (9) Dec 2025不揮発性メモリを用いたVlasovシミュレーションの大規模化情報処理学会研究報告 2025-HPC-201 (7) Sep 20252025 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops) Sep 2025[Refereed]Accelerating Deep Learning Inference with a Parallel FPGA SystemProc. of The International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies 2025 (HEART 2025) May 2025[Refereed]ドライミスト効果を持つ都市気象コードのGH200 vs Xeon+H100上の性能比較IPSJ SIG Notes 2025-HPC-199(7) May 2025NVIDIA GH200におけるSystem-Allocated Memoryの性能評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2025-HPC-199 (7) May 20252024 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops) Sep 2024適応型データ圧縮ハードウェアプラットフォームのChisel実装と評価RECONF2024-50 124 (188) Sep 2024多様な環境におけるマルチ・タスク・ミニベンチマークの評価とPerformance Portability研究報告ハイパフォーマンスコンピューティング(HPC) 2024-HPC-195 (3) Aug 2024GH200の予備性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2024-HPC-195 (4) Aug 2024ラベルの出現頻度に着目したFPGAを用いた正規パス問合せの提案第16回データ工学と情報マネジメントに関するフォーラム(DEIM2024) Feb 2024e-Science 2024CLUSTER Workshops 2024CLUSTER Workshops 2024Proceedings of International Workshop IXPUG 2024 (in International Conference HPC Asia 2024) 2023-HPC-192 (16) Jan 2024[Refereed]OpenACC単一記述によるGPU+FPGA複合デバイス処理システム情報処理学会論文誌コンピューティングシステム(ACS) 16 (2) Nov 2023[Refereed]SC-W '23: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis Nov 2023[Refereed]ISC High Performance 2023: High Performance Computing 13999 Aug 2023[Refereed]SYCLに基づく複数の演算加速装置を統一的に扱えるプログラミング手法の提案研究報告ハイパフォーマンスコンピューティング(HPC) 2023-HPC-190 (1) Jul 2023NVIDIA H100 GPUにおけるグラフニューラルネットワークの学習精度と実行性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2023-HPC-190 (17) Jul 2023輻射輸送シミュレーションのためのFPGAとGPUによるスクラッチパッドメモリの効率と有効性の分析IEICE-RECONF2023-6 123 (71) Jun 2023Euro-Par 2022: Parallel Processing Workshops 13835 May 2023[Refereed]HPC利用に向けたFPGA間シリアル通信コントローラKyokkoのIntel FPGAへの実装研究報告ハイパフォーマンスコンピューティング(HPC) 2023-HPC-189 (4) May 2023PDCAT 2022: Parallel and Distributed Computing, Applications and Technologies 13798 Apr 2023[Refereed]FPGA間通信フレームワークCIRCUSを利用した複数FPGAによるグラフ幅優先探索の提案第15回データ工学と情報マネジメントに関するフォーラム (DEIM 2023) Mar 2023FPGA高位合成における演算性能向上のための空間並列性記述に関する研究研究報告ハイパフォーマンスコンピューティング(HPC) 2023-HPC-188 (22) Mar 2023HPC Asia '23 Workshops: Proceedings of the HPC Asia 2023 Workshops Feb 2023[Refereed]HPC Asia '23: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Feb 2023[Refereed]CLUSTER Workshops 2023ICPP Workshops '22: Workshop Proceedings of the 51st International Conference on Parallel Processing (8) Jan 2023[Refereed]2022 IEEE International Conference on Big Data (Big Data) Dec 2022[Refereed]並列FPGA環境における通信システムCIRCUSを用いた集団通信の実装と性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2022-HPC-187 (7) Nov 2022Journal of Information Processing 30 Oct 2022[Refereed]GPU・FPGA複合型演算加速クラスタを用いた宇宙輻射輸送コードARGOTの多ノード並列化研究報告ハイパフォーマンスコンピューティング(HPC) 2022-HPC-185 (1) Jul 2022並列化に伴うデータ空間の分割とそれによるアクセスパターンの変化がもたらすHBMの振る舞い調査IEICE-CPSY2022-15 IEICE-122 (133) Jul 2022The Proceedings of the 12th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART 2022) Jun 2022[Refereed]ノードを跨いだGPU・FPGA複合型演算加速による宇宙物理シミュレーションの実装と評価研究報告ハイパフォーマンスコンピューティング(HPC) 2022-HPC-184 (6) May 2022HPCAsia2022: International Conference on High Performance Computing in Asia-Pacific Region Jan 2022[Refereed]2021 International Conference on Field-Programmable Technology (ICFPT) Dec 2021[Refereed]Proceedings of 2021 IEEE International Conference on Cluster Computing (CLUSTER) Oct 2021[Refereed]Proceedings of the 11th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART '21). (10) Jun 2021[Refereed]HPC Asia 2021: The International Conference on High Performance Computing in Asia-Pacific Region Jan 2021[Refereed]2020 IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC) Dec 2020[Refereed]Performance Evaluation of Parallel FPGA System for OpenCL Programming情報処理学会論文誌コンピューティングシステム(ACS) 13 (3) Nov 2020[Refereed]OpenACC unified programming environment for GPU and FPGA multi-hybrid acceleration2020 Int. Conference on High Level Parallel Programming (HLPP 2020) Jul 2020[Refereed]Multi-Hybrid Accelerated Simulation by GPU and FPGA on Radiative Transfer Simulation in AstrophysicsJournal of Information Processing 28 2020[Refereed]PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION WORKSHOPS (HPC ASIA 2020 WORKSHOPS) 2020[Refereed]2020 IEEE International Parallel and Distributed Processing Symposium Workshops 2020[Refereed]31st IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP) 2020[Refereed]Optimization on Astrophysical Radiative Transfer Code for FPGAs with OpenCL情報処理学会論文誌トランザクション コンピューティングシステム(Web) 12 (3) Jul 2019[Refereed]IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2019, Rio de Janeiro, Brazil, May 20-24, 2019 2019[Refereed]IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2019, Rio de Janeiro, Brazil, May 20-24, 2019 2019[Refereed]Proceedings of the 10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2019, Nagasaki, Japan, June 6-7, 2019. 2019[Refereed]IJHPCA 33 (5) 2019[Refereed]ACM International Conference Proceeding Series Jan 2018[Refereed]Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2018, Toronto, ON, Canada, June 20-22, 2018 2018[Refereed]高位合成によるFPGAの高性能計算へ適用ハイパフォーマンスコンピューティングと計算科学シンポジウム論文集 May 2017[Refereed]Proceedings - 2016 International Conference on Computational Science and Computational Intelligence, CSCI 2016 Mar 2017[Refereed]Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 10150 2017[Refereed]密結合並列演算加速機構TCAによるGPU対応GASNetの実装と評価2016年ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2016) 論文集, 2016 Jun 2016[Refereed]Applying TCA Architecture to QUDA QCD Library for GPUs情報処理学会論文誌トランザクション コンピューティングシステム(Web) 8 (2) Jun 2015[Refereed]2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015 2015[Refereed]2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015 2015[Refereed]PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW) 2014[Refereed]EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT I 8805 2014[Refereed]2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013) 2013[Refereed]IEEE TRANSACTIONS ON MAGNETICS 48 (2) Feb 2012[Refereed]IEEE TRANSACTIONS ON MAGNETICS 47 (5) May 2011[Refereed]Plasma and Fusion Research 6 (1) 2011[Refereed]Web Application for Evaluating Performance of Linear System Solver Using GPUNUMERICAL ANALYSIS AND APPLIED MATHEMATICS, VOLS I-III 1281 2010[Refereed]Awards & Honors
Books etc
Research Grants & Projects
|
氏名 | 前田 宗則 |
| Name | MAEDA Munenori | |
| Faculty | ||
| Section | ||
| Position | Senior Researcher | |
| Theme | Parallel and Distributed Systems, Storage | |
| Related Links | ||
munem ccs.tsukuba.ac.jp |
Academic & Professional Experience
Apr 2021Fujitsu Limited Fujitsu Research Principal ResearcherSep 2017-Mar 2021Fujitsu Laboratories Ltd. IT Systems Laboratories Senior ResearcherNov 1998-Aug 2017Fujitsu Laboratories Ltd. Computer Systems Laboratories ResearcherNov 1992-Oct 1998Real World Computing Partnership (RWCP) Parallel and Distributed System Software TRC Laboratory ResearcherApr 1989-Oct 1992Fujitsu Limited International Institute for Advanced Study of Social Information Science ResearcherPublished Papers
IPSJ Special Interest Group on Programming 15 (2) May 2022IEICE Technical Report 116 (117) Aug 2016FUJITSU SCIENTIFIC & TECHNICAL JOURNAL (FSTJ) 50 (1) Jan 2014Awards & Honors
Jun 2014Interop Tokyo 2014 Best of Show AwardSep 19951995 International Workshop on Memory Management (IWMM'95) Best Presentation Award
|
氏名 | 塙 敏博 |
| Name | HANAWA Toshihiro | |
| Faculty | Information Technology Center, The University of Tokyo / University of Tsukuba | |
| Section | ||
| Position | Professor / Visiting Associate Professor | |
| Theme | High-performance Interconnect, Accelerated Computing, Large-scale Parallel Processing | |
| Related Links | ||
|
氏名 | 櫻井 鉄也 |
| Name | SAKURAI Tetsuya | |
| Faculty | Graduate School of Systems and Information Engineering | |
| Section | ||
| Position | Professor (Collaborative Fellow) | |
| Theme | Numerical algorithms and simulation, Mathematical software for GRID computing | |
| Related Links | ||
|
氏名 | 山口 佳樹 |
| Name | YAMAGUCHI Yoshiki | |
| Faculty | Graduate School of Systems and Information Engineering | |
| Section | ||
| Position | Professor (Collaborative Fellow) | |
| Theme | Reconfigurable System, Energy-efficient computer system and architecture, Dependable computer system | |
| Related Links | ||
|
氏名 | 今倉 暁 |
| Name | IMAKURA Akira | |
| Faculty | Graduate School of Systems and Information Engineering | |
| Section | ||
| Position | Associate Professor (Collaborative Fellow) | |
| Theme | Numerical linear algebra, Algorithms for solving linear systems (Krylov subspace methods and preconditioning techniques) | |
| Related Links | ||

cs.tsukuba.ac.jp