Name | Faculty | Position |
---|---|---|
Boku Taisuke | Professor/Director of CCS/Chief | |
Takahashi Daisuke | Professor | |
Tatebe Osamu | Professor | |
Nukada Akira | Professor | |
Tadano Hiroto | Assistant Professor | |
Kobayashi Ryohei | Assistant Professor | |
Fujita Norihisa | Assistant Professor | |
Hanawa Toshihiro | Information Technology Center, The University of Tokyo / University of Tsukuba | Associate Professor / Visiting Associate Professor |
Yasunaga Moritoshi | Graduate School of Systems and Information Engineering | Professor (Collaborative Fellow) |
Wada Koichi | Graduate School of Systems and Information Engineering | Professor (Collaborative Fellow) |
Sakurai Tetsuya | Graduate School of Systems and Information Engineering | Professor (Collaborative Fellow) |
Yamaguchi Yoshiki | Graduate School of Systems and Information Engineering | Associate Professor (Collaborative Fellow) |
Imakura Akira | Graduate School of Systems and Information Engineering | Associate Professor (Collaborative Fellow) |
![]() |
氏名 | 朴 泰祐 |
Name | Boku Taisuke | |
Faculty | ||
Section | ||
Position | Professor/Director of CCS/Chief | |
Theme | Large scale parallel processing, high performance interconnection, cluster computing, hybrid parallel processing system | |
Related Links | ||
taisuke![]() |
Research Interests
Academic & Professional Experience
Mar 2005-PresentUniversity of Tsukuba Graduate School of Systems and Information Engineering ProfessorMar 2004-Feb 2005University of Tsukuba Graduate School of Systems and Information Engineering Associate ProfessorAug 1995-Mar 2004University of Tsukuba Institute of Information Systems and Engineering Associate ProfessorFeb 1992-Jul 1995University of Tsukuba Institute of Information Systems and Engineering LecturerApr 1988-Feb 1992Keio University Department of Physics, Faculty of Science and Technology Assistant ProfessorPublished Papers
GPU・FPGA複合演算加速による宇宙輻射輸送コードARGOTの性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-173 (8) Mar 2020スーパーコンピュータCygnus上におけるFPGA間パイプライン通信の性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-173 (24) Mar 2020Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops Jan 2020[Refereed]OpenCL対応GPU・FPGAデバイス間連携機構による宇宙輻射輸送コードの演算加速研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-172 (8) Dec 2019GPU-FPGA協調プログラミングを実現するコンパイラの開発研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-172 (11) Dec 2019再構成可能なハードウェアを用いた演算と通信を融合する手法の提案と性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-171 (6) Sep 20192019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Jul 2019[Refereed]2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Jul 2019[Refereed]OpenCL対応FPGA間通信機能によるGPU・FPGA複合型演算加速研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-170 (5) Jul 2019GPU・FPGA複合演算加速による輻射流体シミュレーションコードARGOTの実装研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-170 (22) Jul 2019Optimization on Astrophysical Radiative Transfer Code for FPGAs with OpenCLIPSJ Transactions on Advanced Computing System 12 (3) Jul 2019[Refereed]GPU-FPGA協調計算を記述するためのプログラミング環境に関する研究研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-169 (10) May 2019Proceedings of XXI International Conference on Ultrafast Phenomena 2018 205 (04023) Apr 2019高位設計と低位設計の違いとFPGA演算性能の関係について情報処理学会第81回全国大会講演論文集 Mar 2019Computer Physics Communications 235 Feb 2019[Refereed]GPU・FPGA混載ノードにおけるヘテロ演算加速プログラム環境に関する研究研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-168 (10) Feb 2019INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 33 (1) Jan 2019[Refereed]異デバイス間でのPCIe通信を実現するOpenCL対応FPGAモジュールの提案と検証IEICE-RECONF2018-63 IEICE-118 (432) Jan 2019Scalable Communication Performance Prediction Using Auto-Generated Pseudo MPI Event Trace.Proc. of HPC Asia 2019 Jan 2019[Refereed]Proceedings of the HPC Asia 2019 Workshops Jan 2019[Refereed]OpenCLによるFPGA上の演算と通信を融合した並列処理システムの実装及び性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-167 (9) Dec 2018OpenCLとVerilog HDLの混合記述によるGPU-FPGAデバイス間連携研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-167 (11) Dec 2018FPGAによる宇宙輻射輸送シミュレーションの演算加速IEICE-RECONF2018-25 118 (215) Sep 2018並列FPGAシステムにおけるOpenCLを用いた宇宙輻射輸送コードの演算加速研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-165 (27) Jul 2018GPU-FPGA複合システムにおけるデバイス間連携機構研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-165 (26) Jul 2018Performance Optimization and Evaluation of Scalable Optoelectronics Application on Large Scale KNL ClusterProc. of Int. Symposium on Supercomputing (ISC) 2018 10876/2018 Jun 2018[Refereed]HEART 2018 Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies Article No. 6 Jun 2018[Refereed]Performance Optimization and Evaluation of Scalable Optoelectronics Application on Large Scale KNL Cluster"Proc. of International Symposium Supercomputing 2018 Jun 2018[Refereed]Accelerating Space Radiative Transfer on FPGA using OpenCLProc. of HEART2018 Jun 2018[Refereed]複数のFPGAによる分散ソーティングの実現に向けた予備評価Technical report of IEICE. EA 118 (63) May 2018電子動力学シミュレーションコードのメニーコアプロセッサ とGPUにおける性能比較ハイパフォーマンス・コンピューティング研究会報告 Mar 2018宇宙輻射輸送計算におけるHDL設計とOpenCL設計の比較情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-163 (24) Feb 2018HPC Asia 2018 Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Jan 2018[Refereed]Proc. of Int. Conference on High Performance Computing in Asia-Pacific Region Jan 2018[Refereed]Meeting Abstracts of the Physical Society of Japan 73 2018OpenCLを用いたFPGAによる宇宙輻射輸送シミュレーションの演算加速情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-161 (12) Sep 2017PCIe Gen3データ転送におけるFPGA性能の徹底調査電子情報通信学会技術研究報告 117 (221) Sep 2017電子動力学シミュレーションARTEDのKNLシステムOakforest-PACSでの全系性能評価ハイパフォーマンス・コンピューティング研究会報告 Jul 2017フロー解析によるマルチGPU対応OpenACCコンパイラ情報処理学会ハイパフォーマンスコンピューティング研究会研究報告 Jul 2017OpenCLとVerilog HDLの混合記述によるFPGA間Ethernet接続情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) Jul 2017高位合成によるFPGAの高性能計算へ適用ハイパフォーマンスコンピューティングと計算科学シンポジウム論文集 May 2017[Refereed]アクセラレータクラスタ向けPGAS言語XcalableACCの片側通信機能の実装と評価情報処理学会第158回HPC研究会報告2017-HPC-158 Mar 2017KNLメニーコア・プロセッサにおけるPGAS言語XcalableMPアプリケーションの性能評価情報処理学会第158回HPC研究会報告2017-HPC-158 Mar 2017Meeting Abstracts of the Physical Society of Japan 72 2017電子動力学コード ARTED による Knights Landing プロセッサの性能評価情報処理学会第157回HPC研究会報告2016-HPC-157 Dec 2016Design and Preliminary Evaluation of Omni OpenACC Compiler for Massive MIMD Processor PEZY-SCLNCS 9903: OpenMP: Memory, Devices, and Tasks Oct 2016[Refereed]Design and Preliminary Evaluation of Omni OpenACC Compiler for Massive MIMD Processor PEZY-SCLNCS 9903: OpenMP: Memory, Devices, and Tasks Oct 2016[Refereed]密結合並列演算加速機構TCAにおける複数DMACの活用によるGPU対応GASNetの性能改善情報処理学会第156回HPC研究会報告2016-HPC-156 Sep 2016GPUクラスタにおけるGPUセルフMPIシステムGMPIの予備性能評価情報処理学会第155回HPC研究会報告2016-HPC-155 Aug 2016密結合並列演算加速機構TCAによるGPU対応GASNetの実装と評価2016年ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2016) 論文集, 2016 Jun 2016[Refereed]電子動力学シミュレーションのステンシル計算に対するメニーコアプロセッサ向け最適化2016年ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2016) 論文集 Jun 2016[Refereed]Electron Dynamics Simulation with Time-Dependent Density Functional Theory on Large Scale Symmetric Mode Xeon Phi ClusterProc. of PDSEC2016 (in IPDPS2016) May 2016[Refereed]電子動力学シミュレーションのステンシル計算最適化とメニーコアプロセッサへの実装情報処理学会論文誌コンピューティングシステム(ACS) 9 (4) Apr 2016[Refereed]PEZY-SC向けOmni OpenACCコンパイラの設計・試作情報処理学会第154回HPC研究会報告2016-HPC-154 Apr 2016Performance evaluation of Stratix V DE5-Net FPGA board for high performance computing2016 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS, AND ITS APPLICATIONS (IC3INA) - RECENT PROGRESS IN COMPUTER, CONTROL, AND INFORMATICS FOR DATA SCIENCE 20162016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI) 2016[Refereed]Extreme SIMDアーキテクチャのプログラミングモデル拡張Cによる性能評価IPSJ SIG Notes 2015 (24) Feb 2015PGAS言語XcalableMPを用いたHPC Challengeベンチマークの実装と評価IPSJ SIG Notes 2015 (21) Feb 2015実時間実空間密度汎関数理論による電子動力学シミュレーションのXeon Phiクラスタ向け最適化IPSJ SIG Notes 2015 (19) Feb 2015GPUクラスタにおけるGPU間セルフ通信機構に関する提案IPSJ SIG Notes 2015 (17) Feb 2015GPU向けFFTコードのTCAアーキテクチャによる実装と性能評価IPSJ SIG Notes 2015 (12) Feb 2015Performance Benchmark of FMO Calculation with GPU-Accelerated Fock Matrix Preparation RoutineJournal of Chemical Software 13 (6) 2015[Refereed]GPU-accelerated FMO Calculation with OpenFMO: Four-Center Inter-Fragment Coulomb InteractionJournal of Chemical Software 14 (3) 2015[Refereed]XcalableACC:OpenACCを用いたアクセラレータクラスタのためのPGAS言語XcalableMPの拡張IPSJ SIG Notes 2014 (7) Sep 2014GPU向けQCDライブラリQUDAのTCAアーキテクチャ実装の性能評価IPSJ SIG Notes 2014 (43) Jul 2014[Refereed]A preminarily evaluation on primitive data transfer performance of PEACH3IEICE technical report. Computer systems 114 (155) Jul 2014JOURNAL OF COMPUTATIONAL PHYSICS 265 May 2014[Refereed]Accelerating breadth-first search using Tightly Coupled AcceleratorIEICE technical report. Computer systems 114 (21) Apr 2014GPU向けQCDライブラリQUDAのTCAアーキテクチャによる実装IPSJ SIG Notes 2014 (35) Feb 2014Tightly Coupled Acceleratorsアーキテクチャに向けたXcalableMP拡張IPSJ SIG Notes 2014 (34) Feb 2014HA-PACS/TCAシステムにおけるマルチノードGPU間通信性能評価IPSJ SIG Notes 2014 (20) Jan 2014A FPGA/GPU cooperation in nodes communication using PEACH2IEICE technical report. Computer systems 113 (417) Jan 2014Fock Matrix Preparation in Fragment Molecular Orbital Method with GPGPU情報処理学会論文誌. コンピューティングシステム 6 (4) Oct 2013[Refereed]Implementation and Performance Evaluation of Astrophysical Tree-code for GPU Clusters情報処理学会論文誌. コンピューティングシステム 6 (3) Sep 2013[Refereed]並列言語XMP-devにおけるGPU/CPU動的負荷分散機能IPSJ SIG Notes 2013 (40) Jul 2013GPUクラスタHA-PACSにおける核融合シミュレーションコードの性能評価IPSJ SIG Notes 2013 (39) Jul 2013大規模SIMD型アクセラレータの検討IPSJ SIG Notes 2013 (38) Jul 2013TCAアーキテクチャによる並列GPUアプリケーションの性能評価IPSJ SIG Notes 2013 (37) Jul 2013各種アプリケーションにおけるGPGPU対Many Core Processorの性能比較IPSJ SIG Notes 2013 (21) May 2013京速コンピュータ「京」における核融合シミュレーションコードGTC-Pの評価IPSJ SIG Notes 2013 (2) May 2013GPUクラスタ向け並列言語XMP-devにおけるGPU/CPU協調計算IPSJ SIG Notes 2013-HPC-138 (25) Feb 2013GPUクラスタにおける核融合シミュレーションコードの実装IPSJ SIG Notes 2013-HPC-138 (21) Feb 2013分子軌道計算のGPGPU化に向けた行列加算手法の提案IPSJ SIG Notes 2013-HPC-138 (19) Feb 2013計算宇宙物理のための GPUクラスタ向け並列Tree Codeの開発と性能評価情報処理学会論文誌コンピューティングシステム 6(3) (3) 2013[Refereed]Tightly Coupled Acceleratorsアーキテクチャ向け通信機構の予備評価IPSJ SIG Notes 2012 (13) Dec 2012Tightly Coupled Acceleratorsアーキテクチャのための通信機構IPSJ SIG Notes 2012 (26) Jul 2012PCI ExpressネットワークPEARLにおける耐故障機構IPSJ SIG Notes 2012 (3) Jul 2012都市街区を対象にした並列都市LES気象モデルの開発大会講演予講集 101 Apr 2012並列言語XcalableMPのアクセラレータ向け言語拡張のOpenCL実装IPSJ SIG Notes 2012 (9) Mar 2012スクリプト言語Xcryptによる格子QCDシミュレーションのパラメータサーチ自動化2012年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2012論文集 Memory Card Jan 2012[Refereed]PEACH: A MULTICORE COMMUNICATION SYSTEM ON CHIP WITH PCI EXPRESSIEEE MICRO 31 (6) Nov 2011[Refereed]PEARL: Power-aware, Dependable, and High-Performance Communication Link Using PCI ExpressProc. of IEEE/ACM International Conference on Green Computing and Communitations (GreenCom2010) Nov 2011[Refereed]D302 都市街区を対象にした並列LES気象モデルの開発(大気境界層,一般口頭発表)大会講演予講集 100 Oct 2011D301 高解像度LES計算のGPUによる計算加速(大気境界層,一般口頭発表)大会講演予講集 100 Oct 2011都市街区を対象にした並列LES気象モデルの開発第13回非静力学モデルに関するワークショップ予稿集 Oct 2011高解像度LES計算のGPUによる高速化と性能評価第13回非静力学モデルに関するワークショップ予稿集 Oct 2011気象モデルの高解像度計算のGPU化研究報告ハイパフォーマンスコンピューティング(HPC) Oct 2011気象モデルの高解像度計算のGPU化IPSJ SIG Notes 2011 (2) Sep 2011XMCAPI: Inter-core Communication Interface on Multi-chip Embedded SystemsProc. of Embedded and Ubiquitous Computing (EUC) 2011 Sep 2011[Refereed]複雑地形・都市街区を対象にしたLES気象モデルの開発日本流体力学会年会2011予稿集 Sep 2011An Extension of XcalableMP PGAS Language for Multi-node GPU ClustersProc. of 9th Int. Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (Heteropar) 2011 CD-ROM Aug 2011[Refereed]スクリプト言語Xcryptによる格子QCDシミュレーションの最適化IPSJ SIG Notes 2011 (58) Jul 2011[Refereed]PGAS言語XcalableMPのmulti-node GPU向け拡張仕様の実装と評価IPSJ SIG Notes 2011 (53) Jul 2011PGAS言語XcalableMPとUnified Parallel Cの性能比較IPSJ SIG Notes 2011 (52) Jul 2011演算加速装置に基づく超並列クラスタHA-PACSによる大規模計算科学IPSJ SIG Notes 2011 (21) Jul 2011PCI Expressを用いた通信リンクPEARLにおけるネットワーク管理機構IPSJ SIG Notes 2011 (6) Jul 2011Development of Local Meteorological Model based on LES ModelAbstracts of International Workshop on Urban Weather and Climate:Observation and Modeling Jul 2011PEARL and PEACH: A Novel PCI Express Direct Link and Its ImplementationProc. of Seventh Workshop on High-Performance, Power-Aware Computing (HPPAC 2011) in IPDPS2011 CD-ROM May 2011[Refereed]B403 一般曲線座標系による並列LESモデルの開発(気象予報,一般口頭発表)大会講演予講集 99 Apr 2011Software Distributed Shared Memory for Embedded System by MCAPI情報処理学会研究報告. EMB, 組込みシステム 2011 (17) Mar 2011Software Distributed Shared Memory for Embedded System by MCAPI情報処理学会研究報告. SLDM, [システムLSI設計技術] 2011 (17) Mar 2011Extend to GPU for XcalableMP: A Parallel Programming LanguageIPSJ SIG Notes 2011 (12) Mar 2011Development of Local Meteorological Model based on CFD Model5th International symposium on wind effects on buildings and urban environment (ISWE5) Mar 2011一般曲線座標系による並列LES モデルの開発日本地理学会2011年春季学術大会予稿集 Mar 2011An 80Gb/s Dependable Communication SoC with PCI Express I/F and 8 CPUsProc. of ISSCC2011 CD-ROM Feb 2011[Refereed]Performance Evaluation for Inter-Core Communication Interface on Inter-/Intra-Chip on Embedded Parallel Systems情報処理学会研究報告. UBI, [ユビキタスコンピューティングシステム] 25 2011Performance Evaluation for Inter-Core Communication Interface on Inter-/Intra-Chip on Embedded Parallel Systems情報処理学会研究報告. SLDM, [システムLSI設計技術] 144 2011XcalableMP Implementation and Performance of NAS Parallel BenchmarksProc. of PGAS10 Oct 2010[Refereed]Implementation and Performance Evaluation of XcalableMP: A Parallel Programming Language for Distributed Memory System情報処理学会論文誌. コンピューティングシステム 3 (3) Sep 2010Power-aware, Dependable, and High-Performance Communication Link Using PCI Express: PEARLProc. of IEEE International Conference on Cluster Computing (Cluster2010), poster CD-ROM Sep 2010[Refereed]Implementation and Evaluation of NAS Parallel Benchmarks in XcalableMP情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 126 Jul 2010Implementation and Evaluation of NAS Parallel Benchmarks in XcalableMPIPSJ SIG Notes 2010 (7) Jul 2010Performance Optimization based on MPI Profiler on Multi Rail Interconnection Network情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 125 (6) Jun 2010Performance Optimization based on MPI Profiler on Multi Rail Interconnection NetworkIPSJ SIG Notes 2010 (6) Jun 2010C452 複雑地形・都市を対象とした並列LESモデルの開発(大気境界層II,一般口頭発表)大会講演予講集 97 Apr 2010PACS-CS: Bandwidth-Aware Massively Parallel Cluster計算工学 15 (2) Apr 2010Large Scale PC Cluster "T2K-Tsukuba" and GCM Code Execution on It(Current Status and Future Prospects of Large-Scale Numerical Simulations-Part2) Journal of Japan Society of Fluid Mechanics 29 (2) Apr 2010A massively-parallel electronic-structure calculations based on real-space density functional theoryJOURNAL OF COMPUTATIONAL PHYSICS 229 (6) Mar 2010[Refereed]Performance Evaluation for Inter-Core Communication Interface on Inter-/Intra-Chip on Embedded Parallel Systems情報処理学会研究報告. MBL, [モバイルコンピューティングとユビキタス通信研究会研究報告] = IPSJ SIG technical reports 53 Mar 2010Performance Evaluation for Inter-Core Communication Interface on Inter-/Intra-Chip on Embedded Parallel Systems情報処理学会研究報告. UBI, [ユビキタスコンピューティングシステム] 2010 (40) Mar 2010Performance Evaluation for Inter-Core Communication Interface on Inter-/Intra-Chip on Embedded Parallel Systems情報処理学会研究報告. SLDM, [システムLSI設計技術] 2010 (40) Mar 2010Performance Evaluation for Inter-Core Communication Interface on Inter-/Intra-Chip on Embedded Parallel SystemsIPSJ SIG technical reports 2010 (40) Mar 2010Performance Evaluation for Inter-Core Communication Interface on Inter-/Intra-Chip on Embedded Parallel Systems情報処理学会研究報告. EMB, 組込みシステム 2010 (40) Mar 2010Asymmetric Multi-link Ethernet Trunking System with Adaptive Traffic Control情報処理学会論文誌. コンピューティングシステム 3 (1) Mar 2010[Refereed]Performance Optimization based on Memory Bandwidth Characteristics on Multi-core Node情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 124 Feb 2010Performance Optimization based on Memory Bandwidth Characteristics on Multi-core NodeIPSJ SIG Notes 2010 (4) Feb 2010Communicator Chip for Power-aware, Dependable, and High-performance Communication Link Using PCI Express : PEACH情報処理学会研究報告. 計算機アーキテクチャ研究会報告 187 Jan 2010Communicator Chip for Power-aware, Dependable, and High-performance Communication Link Using PCI Express: PEACHIPSJ SIG Notes 2010 (12) Jan 2010Communicator Chip for Power-aware, Dependable, and High-performance Communication Link Using PCI Express: PEACH情報処理学会研究報告. EMB, 組込みシステム 2010 (12) Jan 2010Communicator chip for power-aware, dependable, and high-performance communication link using PCI Express: PEACHIEICE technical report 109 (405) Jan 2010A Fast and Portable Virtual Memory System Utilizing Memory Resource of a Cluster情報処理学会論文誌. コンピューティングシステム 2 (4) Dec 2009Flexible Multi-link Ethernet Binding System for PC Clusters with Asymmetric TopologyProc. of ICPADS2009 Memory Card Nov 2009[Refereed]A Design of User-Defined Data Distribution in XcalableMP情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 122 Oct 2009A Design of User-Defined Data Distribution in XcalableMPIPSJ SIG Notes 2009 (1) Oct 2009トラフィック量に適応する非対称マルチリンクEthernetトランキング第21回コンピュータシステムシンポジウムComSys2009論文集 CD-ROM Oct 2009[Refereed]Inter-Core Communication Interface on Inter-/Intra-Chip Communication for Embedded Parallel Systems情報処理学会研究報告. 計算機アーキテクチャ研究会報告 184 (2) Aug 2009Flexible Multi-link Ethernet Binding System for PC Clusters with Asymmetrical Topology情報処理学会研究報告. [システムソフトウェアとオペレーティング・システム] 112 Jul 2009Flexible Multi-link Ethernet Binding System for PC Clusters with Asymmetrical TopologyIPSJ SIG Notes 2009 (17) Jul 2009Inter-Core Communication Interface on Inter-/Intra-Chip Communication for Embedded Parallel SystemsIPSJ SIG Notes 2009 (2) Jul 2009XcalableMP: A Prallel Programming Model for Distributed Memory SystemIPSJ SIG Notes 2009 (6) Jul 2009A Fast and Large Virtual Memory on MPI for using a Cluster as a Memory ResourceIEICE technical report. Computer systems 109 (168) Jul 2009Foreword(The Heisei 20 IPSJ Outstanding Paper Award)Journal of Information Processing Society of Japan 50 (7) Jul 2009Performance evaluation on Ethernet Multilink Bonding System for High performance and Fault-tolerance情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 120 Jun 2009Performance evaluation on Ethernet Multilink Bonding System for High performance and Fault-toleranceIPSJ SIG Notes 2009 (9) Jun 2009Evaluation of Multicore Processor for Embedded Systems by Parallel Benchmark Program using OpenMPProc. of 5th International Workshop on OpenMP (IWOMP 2009), Lecture Notes in Computer Science 5568 Jun 2009[Refereed]Design and Power Performance Evaluation of On-chip Memory Processor with Arithmetic Accelerators情報処理学会論文誌. コンピューティングシステム 2 (1) Mar 2009Novel Computer Environment Created with T2K Open SupercomputerJournal of the Japan Society for Computational Engineering and Science 14 (1) Jan 2009Evaluation of NFS on Ethernet Multilink Bonding System for High performance and Fault-toleranceIPSJ SIG Notes 2008 (99) Oct 2008Evaluation of Multi-core Processor for Embedded Systems by Parallel Benchmark Program using OpenMPIPSJ SIG Notes 2008 (75) Jul 2008Software Distributed Shared Memory System with Page Prefetch Thread for Multi-core ProcessorsIPSJ SIG Notes 2008 (74) Jul 2008Performance Evaluation of Linpack on T2K-Tsukuba SystemIPSJ SIG Notes 2008 (74) Jul 2008JOURNAL OF GRID COMPUTING 6 (2) Jun 2008Efficient Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix Multiplication情報処理学会論文誌. コンピューティングシステム 1 (1) Jun 2008[Refereed]User-transparent Ethernet Multilink Bonding System for High Performance and Fault-tolerance情報処理学会論文誌. コンピューティングシステム 1 (1) Jun 2008[Refereed]OpenMPD: A Directive-Based Data Parallel Language Extension for Distributed Memory SystemsProc. of 1st Int. Workshop on Parallel Programming Models and System Software for High-End Computing (P2S2) (included in Proc. of ICPP08), Portland Jan 2008[Refereed]A Dynamic Routing Control System for High-Performance PC Cluster with Multi-path Ethernet ConnectionProc. of CAC2008 (in IPDPS2008), Miami Jan 2008[Refereed]A Dynamic Route Control System for PC Clusters with Multi-path Network(Cluster System)情報処理学会論文誌. コンピューティングシステム 48 (18) Dec 2007[Refereed]Low-Power and High-Performance Communication Mechanism for Dependable Embedded SystemsIPSJ SIG Notes 2007 (122) Dec 2007International Lattice Data Grid for computational particle physics and national Data Grid JLDGIPSJ SIG Notes 2007 (122) Dec 2007User-transparent Ethernet multilink bonding system for fault-tolerance and high performanceIPSJ SIG Notes 2007 (88) Sep 2007Power performance evaluation of on-chip memory processor with arithmetic acceleratorsIPSJ SIG Notes 2007 (79) Aug 2007A Dynamic Route Control System for PC Clusters with Multi-path Network Using tagged-VLAN TechnologyIPSJ SIG Notes 2007 (80) Aug 2007Increasing Neighbour Communication Performance Techniques for the PACS-CS SystemIPSJ SIG Notes 2007 (80) Aug 2007High Performance Computing : Toward a Helpful Match-maker between the System and Applications(1001SIG Nights)Journal of Information Processing Society of Japan 48 (7) Jul 2007RI2N/UDP : High Bandwidth and Fault-tolerant Network for PC-cluster Based on Multi-link Ethernet(Network)情報処理学会論文誌. コンピューティングシステム 48 (8) May 2007[Refereed]RI2N/UDP: High bandwidth and fault-tolerant network for PC-cluster based on multi-link EhternetProc. 2007 IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), The Workshop on Communication Architecture for Clusters (CAC 2007) Apr 2007[Refereed]A study on arithmetic accelerators for on-chip memory processorIPSJ SIG Notes 2007 (17) Mar 2007Design and Implementation of OpenMPD parallel programming language for distributed memoryIPSJ SIG Notes 2007 (17) Mar 2007A study on arithmetic accelerators for on-chip memory processorIPSJ SIG Notes 2007 (17) Mar 2007Design and Implementation of OpenMPD parallel programming language for distributed memoryIPSJ SIG Notes 2007 (17) Mar 2007Design and Implementation of OpenMPD: An OpenMP-like Programming Language for Distributed Memory SystemsProc. of IWOMP2007, Beijing Jan 2007[Refereed]Dividing program into regions for controlling DVFSIPSJ SIG Notes 2006 (106) Oct 2006Power Performance Evaluation and Power Performance Optimization on MegaProto/EIPSJ SIG Notes 2006 (106) Oct 2006High-bandwidth and Fault-tolerant Network for PC Clusters based on Tagged-VLAN and Multi-link Ethernet Technologies(Session 3:Cluster/Grid)IPSJ SIG Notes 2006 (106) Oct 2006Reducing Energy of Parallel Programs with Load Imbalance by Using DVS(Cluster Systems)IPSJ Transactions on Advanced Computing Systems 47 (12) Sep 2006[Refereed]Profile-based Optimization of Power Performance by Using Dynamic Voltage Scaling on a PC Cluster(Cluster Systems)IPSJ Transactions on Advanced Computing Systems 47 (12) Sep 2006[Refereed]VFREC-Net: Multi-path Network for PC Clusters Based on Tagged-VLAN Technology with Driver ControlIPSJ Transactions on Advanced Computing Systems 47 (SIG 12(ACS 15)) Sep 2006[Refereed]Empirical Study on Reducing Energy of Parallel Programs using Slack Reclamation by DVFSProc. 2006 IEEE International Conference on Cluster Computing (Cluster 2006) Sep 2006[Refereed]Power Performance Optimization using Total Power Profile on a PC clusterIPSJ SIG Notes 2006 (88) Jul 2006A Design of High Performance Communication Library for the PACS-CS SystemIPSJ SIG Notes 2006 (87) Jul 2006Implementation and Performance Evaluation of the Large Scale Cluster PACS-CS for Scientific ComputationIPSJ SIG Notes 2006 (87) Jul 2006P2P Overlay Network based on UDP Firewall TraversalIPSJ SIG Notes 2006 (87) Jul 2006Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix MultiplicationIPSJ SIG Notes 2006 (63) Jun 2006Design and Implementation of Grid RPC System Integrating Computing Resources on Multiple Grid-enabled Job Execution Systems(Grid System)情報処理学会論文誌. コンピューティングシステム 47 (7) May 2006[Refereed]Reducing energy of parallel programs with load imbalance by using DVSIPSJ SIG Notes 2006 (20) Feb 2006Profile-based Optimization of Power Performance by using Dynamic Voltage Scaling on a PC clusterIPSJ SIG Notes 2006 (20) Feb 2006RI2N/UDP: Fault-tolerant network for PC-clusters based on multi-link EthernetIPSJ SIG Notes 2006 (20) Feb 2006Reducing energy of parallel programs with load imbalance by using DVSIPSJ SIG Notes 2006 (20) Feb 2006Profile-based Optimization of Power Performance by using Dynamic Voltage Scaling on a PC clusterIPSJ SIG Notes 2006 (20) Feb 2006RI2N/UDP: Fault-tolerant network for PC-clusters based on multi-link EthernetIPSJ SIG Notes 2006 (20) Feb 2006Formation of dwarf galaxies in reionized universe with heterogeneous multicomputer systemINTERNATIONAL JOURNAL FOR MULTISCALE COMPUTATIONAL ENGINEERING 4 (2) 2006PACS-CS: A large-scale bandwidth-aware PC cluster for scientific computationsProc. Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06) Jan 2006[Refereed]複数Gigabit Ethernetを用いたPACS-CSのための高性能通信機構の設計と評価情報処理学会論文誌コンピューティングシステム 47 (SIG12(ACS15)) Jan 2006[Refereed]MegaProto/E: Power-Aware High-Performance Cluster with Commodity TechnologyProc. of HP-PAC06 (in IPDPS2006), Rhodes Jan 2006[Refereed]Scalable Communication Layer for Multi-Dimensional Crossbar Network Using Multiple Gigabit EthernetProc. of ICS2006, Cairns Jan 2006[Refereed]High-bandwidth Tree Network for PC Clusters based on Tagged-VLAN TechnologyIPSJ SIG Notes 2005 (97) Oct 2005Performance Improvement by Initial Data Management on Grid RPC System OmniRPCIPSJ SIG Notes 2005 (97) Oct 2005MegaProto : A Low-power and Compact Cluster for High-performance Computing(HPC Hardware)情報処理学会論文誌. コンピューティングシステム 46 (12) Aug 2005[Refereed]"FIRST"-a hybrid cluster system for the elucidation on the origin of FIRST generation objects in the universeIPSJ SIG Notes 2005 (81) Aug 2005A Design of High Performance Communication Facility Using Ethernet for the PACS-CS systemIPSJ SIG Notes 2005 (81) Aug 2005PACS-CS : A massively parallel cluster for computational sciencesIPSJ SIG Notes 2005 (81) Aug 2005Design and Implementation of a Grid RPC System on Multiple Grid MiddlewaresIPSJ SIG Notes 2005 (81) Aug 2005Optimization of Power-Performance by controlling DVS on a PC clusterIPSJ SIG Notes 2005 (80) Aug 2005Optimization and Evaluation of Power Performance by Using On-Chip RAMIPSJ SIG Notes 2005 (80) Aug 2005Design of Software Distributed Shared Memory System Using MPI Communication Layer(Software DSM)情報処理学会論文誌. コンピューティングシステム 46 (7) May 2005[Refereed]MegaProto : A Low-Power and Compact Cluster for High-Performance ComputingIPSJ SIG Notes 2005 (19) Mar 2005MegaProto : A Low-Power and Compact Cluster for High-Performance ComputingIPSJ SIG Notes 2005 (19) Mar 2005MegaProto : A Low-Power and Compact Cluster for High-Performance Computing情報処理学会研究報告. ARC,計算機アーキテクチャ研究会報告 162 Mar 2005MegaProto: A Low-Level and Compact Cluster for High-Performance ComputingProc. of HP-PAC05 (in IPDPS2005), Denver Jan 2005[Refereed]MegaProto: 1 TFlops/10kW Rack Is Feasible Even with Only Commodity TechnologyProc. of SC05, Seattle Jan 2005[Refereed]Measurement of Microprocessor's Power Consumption and Prototyping Low Power Cluster with Low Power Processors(Power Conservation)情報処理学会論文誌. コンピューティングシステム 45 (SIG11(ACS7)) Oct 2004[Refereed]Implementation and Evaluation of Parallel FFT Using Short Vector SIMD Instructions(Performance Optimization)情報処理学会論文誌. コンピューティングシステム 45 (11) Oct 2004[Refereed]Design of Grid RFC System OmniRPC on XtremWeb P2P GridIPSJ SIG Notes 2004 (81) Jul 2004Implementation and Performance Evaluation of CONFLEX-G : A Grid Enabled Conformational Space Search Program by OmniRPC(Grid Applications)情報処理学会論文誌. コンピューティングシステム 45 (6) May 2004[Refereed]Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous Clusters(Numerical Computation)情報処理学会論文誌. コンピューティングシステム 45 (SIG06(ACS6)) May 2004[Refereed]Software Distributed Shared Memory System on MPIIPSJ SIG Notes 2004 (38) Apr 2004Performance Evaluation of OmniRPC in a Grid EnvironmentProc. of Int. Workshop on High Performance Grid Computing and Networking, in Int. Symp. on Applications and Internet, Tokyo Jan 2004[Refereed]Heterogeneous Remote Computing System for Computational Astrophysics with OmniRPCProc. of Int. Workshop on High Performance Grid Computing and Networking, in Int. Symp. on Applications and Internet, Tokyo Jan 2004[Refereed]Parallel Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous ClustersProc. of Heterogeneous Computing Workshop 2004 (in IPDPS2004), Santa Fe Jan 2004[Refereed]MegaProto: A Prototype of the Ultra Low-Power Mega-Scale SystemIPSJ SIG Notes 2003 (102) Oct 2003Performance of a conformational space search method by grid technology: Development of 3D-structure database for drug discovery platform.ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY 226 (Part 1) Sep 2003Platform for drug discovery by grid technology: Large scale molecular calculations and utilization of 3D descriptors.ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY 226 (Part 1) Sep 2003Report on an efficient conformational space search method using parallel computing and grid technology.ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY 226 (Part 1) Sep 2003OmniRPC : A Grid RPC System for Parallel Programming in Grid Environment (Grid Middleware)情報処理学会論文誌. コンピューティングシステム 44 (11) Aug 2003HMCS-G : Grid-enabled Hybrid Computing System for Computational Astrophysics (Grid Applications)IPSJ Transactions on Computing Systems 44 (11) Aug 2003Implementation of First Touch page allocation on Omni/SCASHIPSJ SIG Notes 2003 (84) Aug 2003Low Power Cluster using Low Power CPUIPSJ SIG Notes 2003 (84) Aug 2003Perfomance Evaluation of Grid Applications by OmniRPC in Wide Area NetworkIPSJ SIG Notes 2003 (83) Aug 2003Performance evalusation of RI2N-Interconnection network system for clusters with wide-bandwidth and fault-tolerancyIPSJ SIG Notes 2003 (83) Aug 2003SMP Configuration and Performance Evaluation of SCIMA On-chip Memory Processor Architecture for HPC情報処理学会論文誌. コンピューティングシステム 44 (6) May 2003RI2N - Interconnection network system for clusters with wide-band width and fault-tolerancy based on multiple・linksIPSJ SIG Notes 2003 (29) Mar 2003HPC向けオンチップメモリプロセッサアーキテクチャSCIMAのSMP化の検討と性能評価情報処理学会論文誌コンピューティングシステム 44 (SIG6(ACS1)) Jan 2003[Refereed]OmniRPC: グリッド環境での並列プログラミングのためのGrid RPCシステム情報処理学会論文誌コンピューティングシステム 44 (SIG11(ACS3)) Jan 2003[Refereed]HMCS-G: グリッド環境における計算宇宙物理のためのハイブリッド計算システム情報処理学会論文誌コンピューティングシステム 44 (SIG11(ACS3)) Jan 2003[Refereed]OmniRPC: a Grid RPC System for Parallel Programming in Cluster and Grid EnvironmentProc. of Int. Workshop on Grid and Advanced Network (GAN'03) in CCGrid2003, Tokyo Jan 2003[Refereed]HMCS-G : grid enabled hybrid computing system for computational astrophysicsProc. of Int. Workshop on Grid and Advanced Network (GAN'03) in CCGrid2003, Tokyo Jan 2003[Refereed]RI2N - Interconnection network system for clusters with wide-bandwidth and fault-tolerancy based on multiple linksProc. of ISHPC-V, Tokyo LNCS (2858) Jan 2003[Refereed]OmniRPC : a Grid RPC System for Parallel Programming in Grid EnvironmentIPSJ SIG Notes 2002 (99) Oct 2002A Feasibility Study on an Itanium-based ClusterIPSJ SIG Notes 2002 (99) Oct 2002Space Radiative Transfer and Hydrodynamics Calculation with Self-gravity on Heterogeneous Multi-Computer System情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002Hybrid Parallelization for SPAM Particle Simulation on SMP-PC Clusters情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCs情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002SMP configuration and performance evaluation of SCIMA : on-chip memory processor architecture for HPCIPSJ SIG Notes 2002 (81) Aug 2002Performance Evaluation of Omni/SCASH Software Distributed Shared Memory System on Ethernet-based ClusterIPSJ SIG Notes 2002 (80) Aug 2002Performance Evaluation of the Hitachi SR8000 Under OpenMP BenchmarksIPSJ SIG Notes 2002 (22) Mar 2002Performance Evaluation of the Hitachi SR8000 Under OpenMP BenchmarksIPSJ SIG Notes 2002 (22) Mar 2002PCクラスタにおける並列一次元FFTのブロックアルゴリズム情報処理学会論文誌ハイパフォーマンスコンピューティングシステム 43 (SIG6(HPS5)) Jan 2002[Refereed]SMP-PCクラスタにおけるSPAM粒子シミュレーションのハイブリッド並列化情報処理学会論文誌ハイパフォーマンスコンピューティングシステム 43 (SIG6(HPS5)) Jan 2002[Refereed]Heterogeneous Multi-Computer System における重力効果を含む宇宙輻射流体計算情報処理学会論文誌ハイパフォーマンスコンピューティングシステム 43 (SIG6(HPS5)) Jan 2002[Refereed]Performance Evaluation of the Hitachi SR8000 Using OpenMP BenchmarksProc. of ISHPC2002 LNCS (2327) Jan 2002[Refereed]Heterogeneous Multi-Computer System: A New Platform for Multi-Paradigm Scientific SimulationProc. of ICS2002, New York Jan 2002[Refereed]Heterogeneous Multi-Computer System: A New Paradim of Parallel ProcessingProc. of Int. Conf. on Parallel Processing and Electrical Engineering, Warsaw Jan 2002[Refereed]Performance Optimization Techniques on SCIMA and Its Fvaluation情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 42 (12) Nov 2001Heterogeneous Multi-Computer System : A massively parallel processing system combining continuum and particle simulatorsIPSJ SIG Notes 2001 (102) Oct 2001Parallelization of AVS/Express可視化情報学会誌 = Journal of the Visualization Society of Japan 21 Jul 2001Hybrid Parallelization for SPAM particle codeIPSJ SIG Notes 2001 (77) Jul 2001TOP 500 (特集 ベンチマーク)Bit 33 (2) Feb 2001SCIMAにおける性能最適化手法の検討情報処理学会論文誌ハイパフォーマンスコンピューティングシステム 42 (SIG12(HPS4)) Jan 2001[Refereed]PIO: Parallel I/O System for Massively Parallel ProcessorsProc. of HPCN2001, Amsterdam LNCS (2110) Jan 2001[Refereed]Implementation and performance evaluation of SPAM particle code with OpenMP-MPI hybrid programmingProc. of EWOMP2001, Barcelona Jan 2001[Refereed]Performance evaluation of SCIMA for NASPB Kernel CG, FTIPSJ SIG Notes 2000 (93) Oct 2000Performance Evaluation of SMP-PC Cluster Based on Memory Bus Access Ratio情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 41 (5) Aug 2000Parallel I/O System on Distributed Memory Parallel Computers情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 41 (5) Aug 2000SCIMA : A New Architecture for High Performance Computing情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 41 (5) Aug 20002000-HPC-82-31 Performance evaluation of parallelized visualization tool AVS/ExpressIPSJ SIG Notes 2000 (73) Aug 2000Performance Evaluation of OpenMP+MPI on SMP-PC ClusterIPSJ SIG Notes 2000 (23) Mar 2000Performance Evaluation of OpenMP+MPI on SMP-PC ClusterIPSJ SIG Notes 2000 (23) Mar 2000Preliminary Performance Evaluation of New Memory Architecture for High Performance ComputingIPSJ SIG Notes 2000 (1) Jan 2000ハイパフォーマンスコンピューティング向けアーキテクチャSCIMA情報処理学会論文誌ハイパフォーマンスコンピューティングシステム 41 (SIG5(HPS1)) Jan 2000[Refereed]メモリバスアクセス率に基づくSMP-PCクラスタの性能評価情報処理学会論文誌ハイパフォーマンスコンピューティングシステム 41 (SIG5(HPS1)) Jan 2000[Refereed]分散メモリ型超並列計算機における並列入出力情報処理学会論文誌ハイパフォーマンスコンピューティングシステム 41 (SIG5(HPS1)) Jan 2000[Refereed]Performance Analysis of PC-CLUMP based on SMP-Bus UtilizationProc. of Int. Workshop on Cluster Based Computing (WCBC2000) in ICS2000, Santa Fe Jan 2000[Refereed]SCIMA: A Novel Processor Architecture for High Performance ComputingProc. of HPCAsia2000, Beijin Jan 2000[Refereed]Evaluation of TEA Expert-an automated performance tuning environment-on Real MPP SystemIPSJ SIG Notes 99 (66) Aug 1999Performance evaluation of hybrid parallel program on shared memory PC clusterIPSJ SIG Notes 99 (66) Aug 1999Developing Parallel Network-based Visualization SystemIPSJ SIG Notes 99 (66) Aug 1999Memory Architecture for HPC Processing ElementsIPSJ SIG Notes 99 (67) Aug 1999Large-scale Parallel Program Simulation Environment and Performance Analysis of NAS Parallel Benchmarks 2.3 (Special Issue on Parallel Processings)IPSJ Journal 40 (5) May 1999Molecular Dynamics Simulations on CP-PACS (Special Issue on Parallel Processings)IPSJ Journal 40 (5) May 1999Performance Analysis of Multistage Interconnection Network for Massively Parallel Processors (Special Issue on Parallel Processings)IPSJ Journal 40 (5) May 1999Commodity Network based Parallel I/O SystemIPSJ SIG Notes 99 (38) May 1999A study of HPC processor using On-Chip MemoryIPSJ SIG Notes 99 (21) Mar 1999A study of HPC processor using On-Chip MemoryIPSJ SIG Notes 99 (21) Mar 1999A study of HPC processor using On-Chip MemoryIPSJ SIG Notes 99 (21) Mar 1999Parallel I/O and Visualization System for Massively Parallel Processors based on Commodity NetworkIEICE technical report. Computer systems 98 (572) Jan 1999CP-PACS: A massively parallel processor at the University of TsukubaParallel Computing 25 Jan 1999[Refereed]大規模データ並列プログラムの性能予測手法とNPB2.3の性能評価情報処理学会論文誌 40 (5) Jan 1999[Refereed]超並列計算機CP-PACSにおける大規模分子動力学法シミュレーション情報処理学会論文誌 40 (5) Jan 1999[Refereed]超並列計算機用多段結合網における転送性能の解析情報処理学会論文誌 40 (5) Jan 1999[Refereed]Commodity Network based Parallel I/O System for Massively Parallel ProcessorsProc. of PDPTA'99, Las Vegas Jan 1999[Refereed]Automatic Adaptive Performance Tuning Tool, TEA ExpertIPSJ SIG Notes 98 (72) Aug 1998Performance Analysis of NAS Parallel Benchmarks on acculate Large-scale Parallel Program Simulation EnvironmentIPSJ SIG Notes 98 (72) Aug 1998Advanced Buffer Control Schemes in Massively Parallel Interconnection NetworksIPSJ SIG Notes 98 (70) Aug 1998Performance Evaluation of NPB Kernel CG on CP-PACS(Special Issue on Parallel Processing)IPSJ Journal 39 (6) Jun 1998超並列計算機CP-PACSにおけるNPB Kernel CGの評価情報処理学会論文誌 39 (6) Jan 1998[Refereed]Practical Simulation of Large-Scale Parallel Programs and Its Performance Analysis of the NAS Parallel BechmarksProc. of Euro-Par'98, Manchester LNCS (1470) Jan 1998[Refereed]Accuracy of fast performance prediction by instrumentation tool EXCITProc. of HPCAsia'98, Singapore Jan 1998[Refereed]Large Scale Molecular Dynamics Simulations on CP-PACSProc. of HPCAsia'98, Singapore Jan 1998[Refereed]VIPPES : A Virtual Parallel Processing System Simulation EnvironmentProc. of HPCAsia'98, Singapore Jan 1998[Refereed]Molecular Dynamics simulation with Spatial Decomposition method on CP-PACSIPSJ SIG Notes 97 (121) Dec 1997Design Automation System of Router Chip for Network of Parallel ComputerIPSJ SIG Notes 97 (119) Dec 1997Performance Evaluation of CP-PACS' Interconnection NetworkIPSJ SIG Notes 97 (75) Aug 1997Molecular dynamics simulation on CP-PACSIPSJ SIG Notes 97 (37) May 1997Accurate performance analysis based on code instrumentationIPSJ SIG Notes 97 (22) Mar 1997Network description verification system for massively parallel network simulator generation system INSPIRE全国大会講演論文集 54 (1) Mar 1997Automatic design system for MDX network router chip全国大会講演論文集 54 (1) Mar 1997Detour Routing Algorithm on Hyper-Crossbar Network全国大会講演論文集 54 (1) Mar 1997Evaluation of the basic performance of CP-PACSIPSJ SIG Notes 97 (22) Mar 1997Performance evaluation of CP-PACS on CG benchmarkProc. of HPCAsia'97, Seoul 96 (97) Jan 1997[Refereed]Advanced Processor Design Using Hardware Description Language AIDLProc. of Asia and South Pacific Design Automation Conference 1997, Makuhari Jan 1997[Refereed]The Architecture of Massively Parallel Processor CP-PACSProc. of 2nd pAs, Aizu Jan 1997[Refereed]Performance Improvement for Matrix Calculation on CP-PACS Node ProcessorProc. of HPCAsia'97, Seoul Jan 1997[Refereed]Performance evaluation of CP-PACS on CG benchmarkProc. of HPCAsia'97, Seoul Jan 1997[Refereed]CP-PACS: A massively parallel processor for large scale scientific calculationsProc. of ICS'97 Jan 1997[Refereed]Effectiveness of Register Preloading on CP-PACS Node ProcessorProc. of Innovative Architecture for Future Generation High-Performance Processors and Systems, Mauii Jan 1997Design Assistance for Advanced Processors Using Hardware Description Language AIDLIPSJ SIG Notes 96 (121) Dec 1996Design of Rotifer Chip for Hyper-Crossbar Network Using VHDLIPSJ SIG Notes 96 (121) Dec 1996Fast List Vector Computation on Pseudo Vector ProcessorIPSJ Journal 37 (10) Oct 1996VIPPES: A Performance Pre-Evaluation System for Parallel ProcessorsIPSJ SIG Notes 96 (81) Aug 1996Implementation PVM on CP-PACSIPSJ SIG Notes 96 (80) Aug 1996Adaptive Routing by Dynamic Selection of Virtual Channels on Hyper-Crossbar NetworkIPSJ Journal 37 (7) Jul 1996The Design of the "TEA Library" : A Performance Evaluation Tool-set for Parallel and Distributed SystemsIPSJ SIG Notes 96 (22) Mar 1996LINPACK Benchmark Evaluation on CP-PACS Pilot-ModelIPSJ SIG Notes 96 (23) Mar 1996The Architecture of Massively Parallel Processor CP-PACSIPSJ Magazine 37 (1) Jan 1996The MDX (Multi-Dimensional X'bar): A Class of Networks for Large Scale MultiprocessorsIEICE Trans. on Information and Systems E79-D (8) Jan 1996[Refereed]ハイパクロスバ・ネットワークにおけるVirtual Channelの動的選択による適応ルーティング情報処理学会論文誌 37 (7) Jan 1996[Refereed]擬似ベクトルプロセッサにおける高速リストベクトル処理情報処理学会論文誌 37 (10) Jan 1996[Refereed]VIPPES: A performance pre-evaluation system for parallel processorsProc. of HPCN'96, Brussel Jan 1996[Refereed]The MDX (Multi-Dimensional X'bar): A class of networks for large scale multiprocessorsProc. of PDCS'96 Jan 1996[Refereed]MDX-Baseline : an interconnection network with locality and large bandwidthTechnical report of IEICE. SSE 95 (327) Oct 1995Performance Evaluation of the Pseudo Vector Processor in List Vector Computation全国大会講演論文集 51 (6) Sep 1995Preliminary Evaluation of Cache Configurations for Multithreaded Architecture全国大会講演論文集 51 (6) Sep 1995NAS Parallel Benchmarks Evaluation on CP-PACS Pilot-ModelIPSJ SIG Notes 95 (81) Aug 1995A Network Performance Evaluation Simulator Generation System INSPIRE for Massively Parallel ProcessingIPSJ SIG Notes 95 (80) Aug 1995Advanced Techniques for Performance Improvement of Hyper-Crossbar NetworkIPSJ Journal 36 (7) Jul 1995MDX(MultiDimensional Crossbar) : A Class of Interconnection Networks for Large Scale Parallel MachinesTechnical report of IEICE. FTS 95 (23) Apr 1995MDX(MultiDimensional Crossbar) : A Class of Interconnection Networks for Large Scale Parallel MachinesIEICE technical report. Computer systems 95 (21) Apr 1995Theoretical Performance Analysis of Throughput of Hyper-Crossbar Network全国大会講演論文集 50 (6) Mar 1995Design and comparison of superscalar and VLIW processors by using hardware description language全国大会講演論文集 50 (6) Mar 1995Performance Evaluation of the Pseudo Vector Processor with Interleaved Multi-Bank MemoryIPSJ SIG Notes 95 (29) Mar 1995NAS Parallel Benchmarks Evaluation on CP-PACSIPSJ SIG Notes 95 (28) Mar 1995Adaptive Routing Technique on Hypercrossbar Network and Its EvaluationThe transactions of the Institute of Electronics, Information and Communication Engineers 78 (2) Feb 1995ハイパクロスバ網における適応ルーチングの導入とその評価電子情報通信学会論文誌 J78-D-I (2) Jan 1995[Refereed]ハイパクロスバ・ネットワークにおける転送性能向上のための手法とその評価情報処理学会論文誌 36 (7) Jan 1995[Refereed]INSPIRE : A general purpose network simulator generating system for massively parallel processorsProc. of PERMEAN'95, Beppu Jan 1995[Refereed]Preliminary evaluation of NAS Parallel Benchmarks on CP-PACSProc. of PERMEAN'95, Beppu Jan 1995[Refereed]The Architecture of CP-PACSIPSJ SIG Notes 94 (91) Oct 1994Performance Evaluation of the Pseudo Vector Processor by Simulation全国大会講演論文集 49 (6) Sep 1994Theoretical Performance Analysis of Hyper-Crossbar Network全国大会講演論文集 49 (6) Sep 1994Evaluation of PVP-SW and Hyper-Crossbar Network全国大会講演論文集 49 (6) Sep 1994Extensions for Hyper-Cross networkIEICE technical report. Computer systems 94 (164) Jul 1994Buffer Usage and Performance on Hyper-Crossbar NetworkIEICE technical report. Computer systems 94 (164) Jul 1994NAS Parallel Benchmarks Evaluation on Hyper-Crossbar NetworkIPSJ SIG Notes 94 (68) Jul 1994List Vector Processing on the Pseudo Vector Processor全国大会講演論文集 48 (6) Mar 1994Performance Evaluation of Hyper-Ccrossbar Network with Virtual Cut-Through Routing全国大会講演論文集 48 (6) Mar 1994All Adaptive Routing Algorithm on Hyper-Crossbar Network全国大会講演論文集 48 (6) Mar 1994Evaluation of Pseudo Vector Processor based on Slide-Windowed RegistersProc. of HICSS'94, Honolulu Jan 1994[Refereed]Pseudo Vector Processor for High-speed List Vector Computation with Hiding Memory Access Latency*EMPTY* Jan 1994[Refereed]Superscalar Processor Design with Hardware Description Language AIDLProc. of 2nd Asia Pacific Conf. on Hardware Description Language, Nagoya Jan 1994[Refereed]Pseudo Vector Processor Based on Slide-Windowed RegistersIPSJ Journal 34 (12) Dec 1993A Performance Evaluation of Hyper-Crossbar NetworkIEICE technical report. Computer systems 93 (320) Nov 1993Parallel Sorting Hyper-Crossbar Network全国大会講演論文集 47 (6) Sep 1993Performance Evaluation of a Pseudo Processor based on Slide-Windowed Rigisters全国大会講演論文集 47 (6) Sep 1993Pseudo Vector Processor based on Slide-Windowed RegistersIPSJ SIG Notes 93 (71) Aug 1993Improvement of System Level Description Language AIDL全国大会講演論文集 46 (6) Mar 1993Performance Evaluation of Random Transfer on Networks for Massively Parallel Processing全国大会講演論文集 46 (6) Mar 1993Pseudo Vector Processing based on Slide-Windowed Registers全国大会講演論文集 46 (6) Mar 1993スライドウィンドウ方式による擬似ベクトルプロセッサ情報処理学会論文誌 34 (12) Jan 1993[Refereed]A Scalar Architecture for Pseudo Vector Processing based on Slide-Windowed RegistersProc. of ICS'93, Tokyo Jan 1993[Refereed]The Technology of Cache and Virtual StorageIPSJ Magazine 33 (11) Nov 1992Algorithms for matrix transposing on paraliel proccssing systems with hyper crossbar network全国大会講演論文集 45 (6) Sep 1992List Vector Processing on the Pseudo Vector Processor全国大会講演論文集 45 (6) Sep 1992A Concurrent Program Restructuring System for Scientific CalculationsProc. of HICSS'91, Honolulu Jan 1991[Refereed]Why do experiments and theory disagree on the turbulence transition of the poiseuille flow ?Proc. of 4th Int. Symp. on Computational Fluid Dynamiscs, Davis Jan 1991[Refereed](SM)^2: A Large-Scale Multiprocessor for Sparse Matrix CalculationsIEEE Transactions on Computer 39 (7) Jan 1990[Refereed]Large-scale elastic-plastic indentation simulation via nonequilibrium molecular dynamicsPhysical Review A 42 (10) Jan 1990[Refereed]マルチプロセッサのための科学技術計算用並行記述言語NCC電子情報通信学会論文誌 J72-D-I (10) Jan 1989[Refereed]IMPULSE: A high performance processing unit for multiprocessors for scientific calculationProc. of ISCA'88, Honolulu Jan 1988[Refereed]DIPROS: A distributed processing system for NDL on (SM)^2-IIProc. of HICSS'87, Kona Jan 1987[Refereed](SM)^2-II: The new version of the sparse matrix solving machineProc. of ISCA'85 Jan 1985[Refereed]NDL: A language for solving scientific problems on MIMD machinesProc. of 1st Supercomputing Symposium, Miami Jan 1985[Refereed]Awards & Honors
Jun 2014Program Committee of HEART2014 HEART2014 Best Paper AwardNov 2011ACM 2011 ACM Gordon Bell Prize Best Performance AwardMay 2005情報処理学会平成16年度論文賞May 2004情報処理学会平成15年度論文賞Jan 2003情報処理学会HPCS2003実行委員会 2003年 ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2003), 最優秀論文賞Books etc
Advanced Software Technologies for Post-Peta Scale Computing(Role:Contributor, GPU-Accelerated Language and Communication Support by FPGA)OpenMP: Memory, Devices, and Tasks(Role:Contributor, Design and Preliminary Evaluation of Omni OpenACC Compiler for Massive MIMD Processor PEZY-SC)2013 IEEE 21ST ANNUAL SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS (HOTI)(Role:Contributor, Interconnection Network for Tightly Coupled Accelerators Architecture)2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5(Role:Contributor, RI2N/DRV: Multi-link Ethernet for High-Bandwidth and Fault-Tolerant Network on PC Clusters)EVOLVING OPENMP IN AN AGE OF EXTREME PARALLELISM(Role:Contributor, Evaluation of Multicore Processors for Embedded Systems by Parallel Benchmark Program Using OpenMP)2008 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING(Role:Contributor, RI2N: High-Bandwidth and Fault-Tolerant Network with Multi-link Ethernet for PC Clusters)2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8(Role:Contributor, A dynamic routing control system for high-performance PC cluster with multi-path Ethernet connection)PRACTICAL PROGRAMMING MODEL FOR THE MULTI-CORE ERA, PROCEEDINGS(Role:Contributor, Design and implementation of OpenMPD: An OpenMP-like programming language for distributed memory systems)ADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS(Role:Contributor, Performance improvement by data management layer in a grid RPC system)LARGE-SCALE SCIENTIFIC COMPUTING(Role:Contributor, Computation of high-precision mathematical constants in a combined cluster and grid environment)8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings(Role:Contributor, Low-cost high-bandwidth tree network for PC clusters based on tagged-VLAN technology)8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings(Role:Contributor, Design of a software distributed shared memory system using an MPI communication layer)2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS(Role:Contributor, Performance evaluation of OmniRPC in a grid environment)2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS(Role:Contributor, Heterogeneous remote computing system for computational astrophysics with OmniRPC)COMPUTATIONAL SCIENCE - ICCS 2004, PROCEEDINGS(Role:Contributor, Formation of dwarf galaxies in reionized universe with heterogeneous multi-computer system)HIGH PERFORMANCE COMPUTING(Role:Contributor, RI2N - Interconnection network system for clusters with wide-bandwidth and fault-tolerancy based on-multiple links)EURO-PAR 2002 PARALLEL PROCESSING, PROCEEDINGS(Role:Contributor, A blocking algorithm for parallel 1-D FFT on clusters of PCs)Research Grants & Projects
Performance evaluation of high performance computing on massively parallel processing systemsHigh performance computing on clustersStudy on interoperability of supercomputers on Grid environment
![]() |
氏名 | 高橋 大介 |
Name | Takahashi Daisuke | |
Faculty | ||
Section | ||
Position | Professor | |
Theme | High-performance computing: High-performance numerical algorithms on parallel computers and performance evaluation | |
Related Links | ||
daisuke![]() |
Research Interests
Academic & Professional Experience
Apr 2016-PresentUniversity of Tsukuba Center for Computational Sciences ProfessorMay 2012-Mar 2016University of Tsukuba Faculty of Engineering, Information and Systems ProfessorOct 2011-May 2012University of Tsukuba Faculty of Engineering, Information and Systems Associate ProfessorApr 2007-Sep 2011University of Tsukuba Graduate School of Systems and Information Engineering Associate ProfessorJul 2006-Mar 2007University of Tsukuba Graduate School of Systems and Information Engineering Associate ProfessorJun 2006-Mar 2007Toyohashi University of Technology Faculty of Engineering LecturerApr 2004-Jul 2006University of Tsukuba Graduate School of Systems and Information Engineering Assistant ProfessorOct 2001-Mar 2004University of Tsukuba Institute of Information Sciences and Electronics Assistant ProfessorFeb 2000-Sep 2001Saitama University Graduate School of Science and Engineering Research AssociateApr 2000-Mar 2001Nagoya University Graduate School of Engineering LecturerApr 1999-Jan 2000The University of Tokyo Information Technology Center Research AssociateApr 1997-Mar 1999The University of Tokyo Computer Centre Research AssociatePublished Papers
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 32 (7) Apr 2020[Refereed]RAMANUJAN JOURNAL 51 (1) Jan 2020[Refereed]Proceedings of the IEEE 106 (11) Nov 2018[Refereed]Parallel Computing 75 Jul 2018[Refereed]数学定数の特定の桁を計算するBBP型公式の⾼速計算法⽇本応用数理学会2017年度年会講演予稿集 Sep 2017Xeon Phiプロセッサにおける並列⼀次元実数FFTの実現と評価日本応用数理学会2017年度年会講演予稿集 Sep 2017Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017 Jun 2017[Refereed]Knights Landingクラスタにおける並列FFTの⾃動チューニング2017年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2017論文集 Jun 2017Xeon Phiクラスタ上の並列FFTにおける通信隠蔽の⾃動チューニング計算⼯学講演会論⽂集 22 May 2017Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 10404 2017[Refereed]2016 IEEE 10TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC) 201623RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015) 2015Learning Weights of Training Data by Game ResultsIPSJ Journal 55 (11) Nov 2014[Refereed]INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 28 (3) Aug 2014[Refereed]INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 28 (3) Aug 2014[Refereed]JOURNAL OF COMPUTATIONAL CHEMISTRY 35 (18) Jul 2014[Refereed]GPU/MICクラスタにおける疎行列ベクトル積の性能評価IPSJ SIG Notes 2014 (4) May 2014PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I 8384 2014[Refereed]COMPUTER PHYSICS COMMUNICATIONS 184 (9) Sep 2013[Refereed]GPUにおける4倍精度浮動小数点演算を用いたクリロフ部分空間法の高速化IPSJ SIG Notes 2013 (35) Jul 2013GPUクラスタにおける幅優先探索の高速化IPSJ SIG Notes 2013 (12) May 2013GPUにおける高速なCRS形式疎行列ベクトル積の実装IPSJ SIG Notes 2013-HPC-138 (5) Feb 2013Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 7975 (5) 2013[Refereed]Implementation and Evaluation of Triple and Quadruple Precision Floating-point Operations on GPUs情報処理学会論文誌. コンピューティングシステム 6 (1) Jan 2013[Refereed]2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013) 2013Highly scalable implementation of an N-body code on a GPU clusterComputer Physics Communications 184 2013[Refereed]GPUにおける4倍精度演算を用いた疎行列反復解法の実装と評価IPSJ SIG Notes 2012 (37) Dec 2012GPUにおける4倍精度演算を用いた疎行列反復解法の実装と評価IPSJ SIG Notes 2012 (37) Dec 2012大規模GPUクラスタにおけるN体計算コードの演算性能とスケーラビリティの評価IPSJ SIG Notes 2012 (1) Sep 2012並列言語XcalableMPのアクセラレータ向け言語拡張のOpenCL実装IPSJ SIG Notes 2012 (9) Mar 2012PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012 9 2012[Refereed]Implementation and Evaluation of Quadruple Precision BLAS Functions on GPUsAPPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I 7133 (7133) 2012[Refereed]An Implementation of Parallel 2-D FFT Using Intel AVX Instructions on Multi-core ProcessorsProc. 12th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2012), Part II, Lecture Notes in Computer Scienc (7440) 2012[Refereed]2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW) 2012[Refereed]2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW) 2012[Refereed]2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS) 2012[Refereed]15TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2012) / 10TH IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2012) 2012[Refereed]Implementation of Multiple-Precision Floating-Point Arithmetic Library for GPU ComputingProc. 23rd IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2011) Dec 2011[Refereed]GPUによる3倍精度浮動小数点演算の検討IPSJ SIG Notes 2011 (23) Nov 2011GPUによる3倍精度浮動小数点演算の検討IPSJ SIG Notes 2011 (23) Nov 2011GPU上における多倍長精度浮動小数点演算の実装IPSJ SIG Notes 2011 (25) Nov 2011GPU上における多倍長精度浮動小数点演算の実装IPSJ SIG Notes 2011 (25) Nov 2011演算加速装置に基づく超並列クラスタHA-PACSによる大規模計算科学IPSJ SIG Notes 2011 (21) Jul 2011Optimization of Sparse Matrix-Vector Multiplication by Auto Selecting Storage Schemes on GPUCOMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2011, PT II 6783 (6783) 2011[Refereed]Optimization of Sparse Matrix-Vector Multiplication by Auto Selecting Storage Schemes on GPUIPSJ SIG Notes 2010 (19) Dec 2010Automatic Tuning for Parallel FFTs on Massively Parallel Platforms with Multi-Core Processors(Auto-Tuning for Numerical Computations (continued)) Bulletin of the Japan Society for Industrial and applied Mathematics 20 (4) Dec 2010Optimization of Sparse Matrix-Vector Multiplication by Auto Selecting Storage Schemes on GPUIPSJ SIG Notes 2010 (19) Dec 2010The Realization Probability Search Based on Search ResultsTransactions of Information Processing Society of Japan 51 (11) Nov 2010[Refereed]PARALLEL COMPUTING 36 (8) Aug 2010[Refereed]A SHOGI PROGRAM BASED ON MONTE-CARLO TREE SEARCHICGA JOURNAL 33 (2) Jun 2010[Refereed]A massively-parallel electronic-structure calculations based on real-space density functional theoryJOURNAL OF COMPUTATIONAL PHYSICS 229 (6) Mar 2010[Refereed]An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-core ProcessorsPARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I 6067 (6067) 2010[Refereed]Break the World Record of π : The Road to 2,576,980,370,000 Decimal DigitsJournal of Information Processing Society of Japan 50 (12) Dec 2009A Shogi Program Based on Monte-Carlo Tree SearchTransactions of Information Processing Society of Japan 50 (11) Nov 2009[Refereed]Implementation and Evaluation of Quadruple Precision BLAS on GPUIPSJ SIG Notes 2009 (13) Nov 2009Implementation and Evaluation of Quadruple Precision BLAS on GPUIPSJ SIG Notes 2009 (13) Nov 2009Application and Performance Evaluation of the Volumetric Parallel 3D-FFT to 3D-RISM on Massively Parallel ClusterIPSJ SIG Notes 2009 (3) Oct 200926aQL-3 Collaboration with Computer ScienceMeeting abstracts of the Physical Society of Japan 64 (2) Aug 200926aQL-3 Collaboration with Computer ScienceMeeting abstracts of the Physical Society of Japan 64 (2) Aug 200926aQL-3 Collaboration with Computer ScienceMeeting abstracts of the Physical Society of Japan 64 (2) Aug 2009Implementation of an Othello Program Based on Monte-Carlo Tree Search by Using a Multi-Core Processor and SIMD Instructions情報処理学会研究報告. GI, [ゲーム情報学] 2009 (7) Jun 2009Design and Power Performance Evaluation of On-chip Memory Processor with Arithmetic Accelerators情報処理学会論文誌. コンピューティングシステム 2 (1) Mar 2009[Refereed]Implementation and Evaluation of Volumetric Parallel 3-D FFT on Massively Parallel Cluster of Multi-Core ProcessorsIPSJ SIG Notes 2009 (14) Feb 2009Implementation and Evaluation of Volumetric Parallel 3-D FFT on Massively Parallel Cluster of Multi-Core ProcessorsIPSJ SIG Notes 2009 (14) Feb 2009Design and Power Performance Evaluation of On-Chip Memory Processor with Arithmetic AcceleratorsProc. 2008 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA 2008) Jan 2009[Refereed]Performance Evaluation of Linpack on T2K-Tsukuba SystemIPSJ SIG Notes 2008 (74) Jul 2008FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE 24 (6) Jun 2008[Refereed]Efficient Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix Multiplication情報処理学会論文誌. コンピューティングシステム 1 (1) Jun 2008[Refereed]2U-4 A Shogi program using Monte-Carlo method全国大会講演論文集 70 (2) Mar 2008Empirical Study for Optimization of Power-Performance with On-Chip MemoryProc. First International Workshop on Advanced Low Power Systems (ALPS 2006), Lecture Notes in Computer Science (4759) Jan 2008[Refereed]A Parallel Algorithm for Multiple-Precision Division by a Single-Precision IntegerProc. 6th International Conference on Large-Scale Scientific Computations (LSSC 2007), Lecture Notes in Computer Science (4818) Jan 2008[Refereed]Power performance evaluation of on-chip memory processor with arithmetic acceleratorsIPSJ SIG Notes 2007 (79) Aug 2007Increasing Neighbour Communication Performance Techniques for the PACS-CS SystemIPSJ SIG Notes 2007 (80) Aug 2007RI2N/UDP : High Bandwidth and Fault-tolerant Network for PC-cluster Based on Multi-link Ethernet(Network)情報処理学会論文誌. コンピューティングシステム 48 (8) May 2007[Refereed]Power-performance Evaluation on Ultra-Low Power High-performance Cluster System: MegaProto/EProc. IEEE Symposium on Low-Power and High-Speed Chips (COOL Chips X) Apr 2007[Refereed]RI2N/UDP: High bandwidth and fault-tolerant network for PC-cluster based on multi-link EhternetProc. 2007 IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), The Workshop on Communication Architecture for Clusters (CAC 2007) Apr 2007[Refereed]A study on arithmetic accelerators for on-chip memory processor情報処理学会研究報告 2007 (17) Mar 2007A study on arithmetic accelerators for on-chip memory processorIPSJ SIG Notes 2007 (17) Mar 2007A study on arithmetic accelerators for on-chip memory processorIPSJ SIG Notes 2007 (17) Mar 2007High performance FFT on SGI Altix 3700HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS 4782 (4782) 2007[Refereed]An implementation of parallel 1-D FFT using SSE3 instructions on dual-core processorsApplied Parallel Computing 4699 (4699) 2007[Refereed]Implementation and evaluation of parallel FFT using SIMD instructions on multi-core processorsINNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS 2007[Refereed]Power Performance Evaluation and Power Performance Optimization on MegaProto/EIPSJ SIG Notes 2006 (106) Oct 2006Dividing program into regions for controlling DVFSIPSJ SIG Notes 2006 (106) Oct 2006High-bandwidth and Fault-tolerant Network for PC Clusters based on Tagged-VLAN and Multi-link Ethernet Technologies(Session 3:Cluster/Grid)IPSJ SIG Notes 2006 (106) Oct 2006Profile-based Optimization of Power Performance by Using Dynamic Voltage Scaling on a PC Cluster(Cluster Systems)IPSJ Transactions on Advanced Computing Systems 47 (12) Sep 2006[Refereed]Reducing Energy of Parallel Programs with Load Imbalance by Using DVS(Cluster Systems)IPSJ Transactions on Advanced Computing Systems 47 (12) Sep 2006[Refereed]VFREC-Net: Multi-path Network for PC Clusters Based on Tagged-VLAN Technology with Driver ControlIPSJ Transactions on Advanced Computing Systems 47 (SIG 12(ACS 15)) Sep 2006[Refereed]Power Performance Optimization using Total Power Profile on a PC clusterIPSJ SIG Notes 2006 (88) Jul 2006Implementation and Performance Evaluation of the Large Scale Cluster PACS-CS for Scientific ComputationIPSJ SIG Notes 2006 (87) Jul 2006A Design of High Performance Communication Library for the PACS-CS SystemIPSJ SIG Notes 2006 (87) Jul 2006Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix MultiplicationIPSJ SIG Notes 2006 (63) Jun 2006EthernetマルチリンクによるPCクラスタ向け耐故障ネットワークRI2N/UDP情報処理学会シンポジウム論文集 2006 (5) May 2006Design and Implementation of Grid RPC System Integrating Computing Resources on Multiple Grid-enabled Job Execution Systems(Grid System)情報処理学会論文誌. コンピューティングシステム 47 (7) May 2006[Refereed]Report on SC|05計算工学 = Journal of The Japan Society for Computational Engineering and Science (JSCES) 11 (2) Apr 2006Profile-based Optimization of Power Performance by using Dynamic Voltage Scaling on a PC clusterProc. 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2006), The Second Workshop on High-Performance, Power-Aware Computing (HP-PAC 2006) Apr 2006[Refereed]MegaProto/E: Power-Aware High-Performance Cluster with Commodity TechnologyProc. 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2006), The Second Workshop on High-Performance, Power-Aware Computing (HP-PAC 2006) Apr 2006[Refereed]Reducing energy of parallel programs with load imbalance by using DVSIPSJ SIG Notes 2006 (20) Feb 2006Profile-based Optimization of Power Performance by using Dynamic Voltage Scaling on a PC clusterIPSJ SIG Notes 2006 (20) Feb 2006Reducing energy of parallel programs with load imbalance by using DVSIPSJ SIG Notes 2006 (20) Feb 2006RI2N/UDP: Fault-tolerant network for PC-clusters based on multi-link EthernetIPSJ SIG Notes 2006 (20) Feb 2006RI2N/UDP: Fault-tolerant network for PC-clusters based on multi-link EthernetIPSJ SIG Notes 2006 (20) Feb 2006Profile-based Optimization of Power Performance by using Dynamic Voltage Scaling on a PC clusterIPSJ SIG Notes 2006 (20) Feb 2006PACS-CS: A large-scale bandwidth-aware PC cluster for scientific computationsSIXTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID 2006[Refereed]Formation of dwarf galaxies in reionized universe with heterogeneous multicomputer systemINTERNATIONAL JOURNAL FOR MULTISCALE COMPUTATIONAL ENGINEERING 4 (2) 2006Computation of high-precision mathematical constants in a combined cluster and grid environmentLARGE-SCALE SCIENTIFIC COMPUTING 3743 (3743) 2006[Refereed]A parallel method for large sparse generalized eigenvalue problems by OmniRPC in a grid environmentAPPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING 3732 (3732) 2006[Refereed]An implementation of parallel 3-D FFT using short vector SIMD instructions on clusters of PCsAPPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING 3732 (3732) 2006[Refereed]Performance improvement by data management layer in a grid RPC systemADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS 3947 (3947) 2006[Refereed]A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clustersPARALLEL PROCESSING AND APPLIED MATHEMATICS 3911 (3911) 2006[Refereed]Emprical study on reducing energy of parallel programs using slack reclamation by DVFS in a power-scalable high performance cluster2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2 2006[Refereed]Design of a Software Distributed Shared Memory System using an MPI communication layerProc. 8th International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN 2005) 46 (7) Dec 2005[Refereed]Design of a Software Distributed Shared Memory System using an MPI communication layerProc. 8th International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN 2005) 46 (7) Dec 2005[Refereed]MegaProto: 1TFlops/10kW Rack Is Feasible Even with Only Commodity TechnologyProc. 2005 ACM/IEEE Conference on Supercomputing (SC|05) Nov 2005[Refereed]Performance Improvement by Initial Data Management on Grid RPC System OmniRPCIPSJ SIG Notes 2005 (97) Oct 2005High-bandwidth Tree Network for PC Clusters based on Tagged-VLAN TechnologyIPSJ SIG Notes 2005 (97) Oct 2005Optimization and Evaluation of Power Performance by Using On-Chip RAMIPSJ SIG Notes 2005 (80) Aug 2005Optimization of Power-Performance by controlling DVS on a PC clusterIPSJ SIG Notes 2005 (80) Aug 2005MegaProto : A Low-power and Compact Cluster for High-performance Computing(HPC Hardware)情報処理学会論文誌. コンピューティングシステム 46 (12) Aug 2005[Refereed]Design and Implementation of a Grid RPC System on Multiple Grid MiddlewaresIPSJ SIG Notes 2005 (81) Aug 2005"FIRST"-a hybrid cluster system for the elucidation on the origin of FIRST generation objects in the universeIPSJ SIG Notes 2005 (81) Aug 2005OmniRPC Grid Parallel Programming Environment for a Large Scale Numerical ComputationProc. 17th IMACS World Congress Scientific Computation, Applied Mathematics and Simulation Jul 2005APPLIED MATHEMATICS AND COMPUTATION 166 (2) Jul 2005[Refereed]A Master-worker Type Parallel Method for Large-scale Eigenvalue ProblemsIPSJ Transactions on Advanced Computing Systems 46 (SIG 7(ACS 10)) May 2005[Refereed]MegaProto: A Low-Power and Compact Cluster for High-Performance ComputingProc. 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05), Workshop on High Performance, Power-Aware Computing (HPPAC) 162 Apr 2005[Refereed]Grid environment for computational astrophysics driven by GRAPE-6 with HMCS-G and OmniRPCProc. 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05), Joint Workshop on High-Performance Grid Computing & High-Level Parallel Programming Models (HIPS-HPGC) Apr 2005[Refereed]MegaProto: A Low-Power and Compact Cluster for High-Performance ComputingProc. 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05), Workshop on High Performance, Power-Aware Computing (HPPAC) Apr 2005[Refereed]MegaProto : A Low-Power and Compact Cluster for High-Performance ComputingIPSJ SIG Notes 2005 (19) Mar 2005MegaProto : A Low-Power and Compact Cluster for High-Performance Computing情報処理学会研究報告. ARC,計算機アーキテクチャ研究会報告 162 (19) Mar 2005Proceedings of the International Symposium on Parallel Architectures, Algorithms and Networks, I-SPAN 2005 2005[Refereed]MegaProto: A Low-Level and Compact Cluster for High-Performance ComputingProc. of HP-PAC05 (in IPDPS2005), Denver Jan 2005[Refereed]Design of a software distributed shared memory system using an MPI communication layer8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings 2005[Refereed]Design of a software distributed shared memory system using an MPI communication layer8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings 46 (7) 2005[Refereed]Design of a software distributed shared memory system using an MPI communication layer8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings 2005[Refereed]Computing Environment Independent Interface for Matrix Computation LibraryIPSJ SIG Notes 2004 (128) Dec 2004OpenMPI --- OpenMP like tool for easy programming in MPIProc. 6th European Workshop on OpenMP (EWOMP 2004) Nov 2004[Refereed]Implementation and Evaluation of Parallel FFT Using Short Vector SIMD Instructions(Performance Optimization)情報処理学会論文誌. コンピューティングシステム 45 (11) Oct 2004[Refereed]Measurement of Microprocessor's Power Consumption and Prototyping Low Power Cluster with Low Power Processors(Power Conservation)情報処理学会論文誌. コンピューティングシステム 45 (SIG11(ACS7)) Oct 2004[Refereed]Design of Grid RFC System OmniRPC on XtremWeb P2P GridIPSJ SIG Notes 2004 (81) Jul 2004Implementation and Performance Evaluation of CONFLEX-G: Grid-enabled Molecular Conformational Space Search Program with OmniRPCProc. 18th International Conference on Supercomputing (ICS'04) Jun 2004[Refereed]SCIMA-SMP: on-chip memory processor architecture for SMPProc. of 3rd Workshop on Memory Performance Issues (WMPI-2004) in ISCA2004, Munich Jun 2004[Refereed]Implementation and Performance Evaluation of CONFLEX-G : A Grid Enabled Conformational Space Search Program by OmniRPC(Grid Applications)情報処理学会論文誌. コンピューティングシステム 45 (6) May 2004[Refereed]Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous Clusters(Numerical Computation)情報処理学会論文誌. コンピューティングシステム 45 (SIG06(ACS6)) May 2004OmniRPCによるグリッド環境での大規模固有値問題の並列解法 (数値解析と新しい情報技術)RIMS Kokyuroku 1362 Apr 2004Parallel Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous ClustersProc. 18th International Parallel and Distributed Processing Symposium (IPDPS'04), The 13th Heterogeneous Computing Workshop (HCW 2004) Apr 2004[Refereed]Software Distributed Shared Memory System on MPIIPSJ SIG Notes 2004 (38) Apr 2004Measurement and Characterization for Power Consumption of Microprocessors for Power-aware ClusterProc. An International Symposium on Low-Power and High-Speed Chips (COOL Chips VII) Apr 2004[Refereed]Heterogeneous remote computing system for computational astrophysics with OmniRPC2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS 2004[Refereed]Formation of dwarf galaxies in reionized universe with heterogeneous multi-computer systemCOMPUTATIONAL SCIENCE - ICCS 2004, PROCEEDINGS 3039 2004[Refereed]Performance evaluation of OmniRPC in a grid environment2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS 2004[Refereed]Formation of dwarf galaxies in reionized universe with heterogeneous multi-computer systemCOMPUTATIONAL SCIENCE - ICCS 2004, PROCEEDINGS 3039 (2) 2004[Refereed]Implementation of First Touch page allocation on Omni/SCASHIPSJ SIG Notes 2003 (84) Aug 2003HMCS-G: Grid-enabled Hybrid Computing System for Computatinal AstrophysicsIPSJ Transactions on Advanced Computing Systems 44 (SIG 11(ACS 3)) Aug 2003[Refereed]OmniRPC: A Grid RPC System for Parallel Programming in Grid EnvironmentIPSJ Transactions on Advanced Computing Systems 44 (SIG 11(ACS 3)) Aug 2003[Refereed]Low Power Cluster using Low Power CPUIPSJ SIG Notes 2003 (84) Aug 2003Performance evalusation of RI2N-Interconnection network system for clusters with wide-bandwidth and fault-tolerancyIPSJ SIG Notes 2003 (83) Aug 2003Perfomance Evaluation of Grid Applications by OmniRPC in Wide Area NetworkIPSJ SIG Notes 2003 (83) Aug 2003HMCS-G : Grid-enabled Hybrid Computing System for Computational Astrophysics (Grid Applications)IPSJ Transactions on Computing Systems 44 (11) Aug 2003PARALLEL COMPUTING 29 (6) Jun 2003[Refereed]Performance evaluation of the Hitachi SR8000 using SPEC OMP2001 benchmarksINTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING 31 (3) Jun 2003[Refereed]Remote accessing environment of GRAPE-6 gravity engineIPSJ SIG Notes 2003 (62) Jun 2003高バンド幅/耐故障性を持つクラスタ向け結合ネットワークRI2N情報処理学会シンポジウム論文集 2003 (8) May 2003SMP Configuration and Performance Evaluation of SCIMA --- On-chip Memory Processor Architecture for HPCIPSJ Transactions on Advanced Computing Systems 44 (SIG 6(ACS 1)) May 2003[Refereed]COMPUTER PHYSICS COMMUNICATIONS 152 (2) May 2003[Refereed]SMP Configuration and Performance Evaluation of SCIMA On-chip Memory Processor Architecture for HPC情報処理学会論文誌. コンピューティングシステム 44 (6) May 2003RI2N - Interconnection network system for clusters with wide-band width and fault-tolerancy based on multiple・linksIPSJ SIG Notes 2003 (29) Mar 2003Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous ClustersIPSJ SIG Notes 2003 (29) Mar 2003HMCS-G: Grid-enabled hybrid computing system for computational astrophysicsCCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS 2003[Refereed]OmniRPC: A grid RPC system for parallel programming in cluster and grid environmentCCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS 2003[Refereed]An OpenMP implementation of parallel FFT and its performance on IA-64 processorsOPENMP SHARED MEMORY PARALLEL PROGRAMMING 2716 (2716) 2003[Refereed]A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS 2 2003[Refereed]OmniRPC: A grid RPC system for parallel programming in cluster and grid environmentCCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS 44 (SIG 11(ACS 3)) 2003[Refereed]RI2N - Interconnection network system for clusters with wide-bandwidth and fault-tolerancy based on-multiple linksHIGH PERFORMANCE COMPUTING 2858 (2858) 2003[Refereed]A Feasibility Study on an Itanium-based ClusterIPSJ SIG Notes 2002 (99) Oct 2002Parallel Forward Deduction System for General-Purpose Entailment Calculus on Clusters of PCsProc. IASTED International Conference on Parallel and Distributed Computing, Applications and Technologies (NPDPA 2002) Oct 2002[Refereed]OmniRPC : a Grid RPC System for Parallel Programming in Grid EnvironmentIPSJ SIG Notes 2002 (99) Oct 2002A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCsIPSJ Transactions on High Performance Computing Systems 43 (SIG 6(HPS 5)) Sep 2002[Refereed]Hybrid Parallelization for SPAM Particle Simulation on SMP-PC Clusters情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002Hybrid Parallelization for SPAM Particle Simulation on SMP-PC ClustersIPSJ Transactions on High Performance Computing Systems 43 (SIG6(HPS 5)) Sep 2002[Refereed]Improving Performance of Automated Forward Deduction System EnCal on Shared-Memory Parallel ComputersProc. Third International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT 2002) Sep 2002[Refereed]A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCs情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002SMP configuration and performance evaluation of SCIMA : on-chip memory processor architecture for HPCIPSJ SIG Notes 2002 (81) Aug 2002Performance Evaluation of Omni/SCASH Software Distributed Shared Memory System on Ethernet-based ClusterIPSJ SIG Notes 2002 (80) Aug 2002Performance Evaluation of the Hitachi SR8000 Using OpenMP BenchmarksProc. 4th International Symposium on High Performance Computing (ISHPC 2002), Lecture Notes in Computer Science (2327) May 2002[Refereed]A Blocking Algorithm for Parallel FFT on Shared-memory Parallel ComputersIPSJ Journal 43 (4) Apr 2002[Refereed]A Blocking Algorithm for Parallel FFT on Shared-memory Parallel Computers(Parallel Processing) IPSJ Journal 43 (4) Apr 2002Performance Evaluation of the Hitachi SR8000 Under OpenMP BenchmarksIPSJ SIG Notes 2002 (22) Mar 2002Performance Evaluation of the Hitachi SR8000 Under OpenMP BenchmarksIPSJ SIG Notes 2002 (22) Mar 2002A blocking algorithm for parallel 1-D FFT on shared-memory parallel computersAPPLIED PARALLEL COMPUTING 2367 (2367) 2002[Refereed]A blocking algorithm for parallel 1-D FFT on clusters of PCsEURO-PAR 2002 PARALLEL PROCESSING, PROCEEDINGS 2400 (2400) 2002[Refereed]Parallel Forward Deduction Algorithms of General-Purpose Entailment Calculus on Shared-Memory Parallel ComputersProc. 2nd International Conference on Software Engineering, Artificial Intelligence, Networking & Parallel/Distributed Computing (SNPD'01) Aug 2001[Refereed]A Blocking Algorithm for Parallel FFT on SMP ClustersIPSJ SIG Notes 2001 (77) Jul 2001A Blocking Algorithm for FFT on Cache-Based ProcessorsProc. 9th International Conference on High Performance Computing and Networking Europe (HPCN Europe 2001), Lecture Notes in Computer Science (2110) Jun 2001[Refereed]An Extended Split-Radix FFT AlgorithmIEEE Signal Processing Letters 8 (5) May 2001[Refereed]A Mixed-Radix Parallel Three-Dimensional FFT Algorithm on Clusters of Vector SMPsProc. Tenth SIAM Conference on Parallel Processing for Scientific Computing (PP01) Mar 2001[Refereed]A Parallel 3-D FFT Algorithm on Clusters of Vector SMPsProc. 5th International Workshop on Applied Parallel Computing (PARA 2000), Lecture Notes in Computer Science (1947) Jan 2001[Refereed]A Performance Study on a Single Processing Node of the HITACHI SR8000Proc. Second International Conference on Numerical Analysis and Its Applications (NAA 2000), Lecture Notes in Computer Science (1988) Jan 2001[Refereed]A fast algorithm for computing large Fibonacci numbersINFORMATION PROCESSING LETTERS 75 (6) Nov 2000[Refereed]An Extended Split-Radix FFT AlgorithmIPSJ SIG Notes 2000 (93) Oct 2000Efficient Implementation of CG & CR Methods for Linear Systems on a Single Processing Node of HITACHI SR8000Proc. 2000 International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC2000) Jul 2000[Refereed]A New Radix-8 FFT Kernel Suitable for Multiply-Add InstructionIPSJ Journal 41 (7) Jul 2000[Refereed]A Fast Algorithm for Computing Fibonacci NumbersIPSJ Journal 41 (6) Jun 2000A Fast Algorithm for Computing Fibonacci NumbersIPSJ Journal 41 (6) Jun 2000[Refereed]A Divide and Rationalize Method which Improves the Multiple-Precision Function Computation with Series ExpansionIPSJ Journal 41 (6) Jun 2000[Refereed]High-Performance Parallel FFT Algorithms for the HITACHI SR8000Proc. Fourth International Conference/Exhibition on High-Performance Computing in Asia-Pacific Region (HPC-Asia 2000) 1 May 2000[Refereed]High-performance radix-2, 3 and 5 parallel 1-D complex FFT algorithms for distributed-memory parallel computersJOURNAL OF SUPERCOMPUTING 15 (2) Feb 2000[Refereed]A new radix-6 FFT algorithm suitable for multiply-add instruction2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI 6 2000[Refereed]Implementation of multiple-precision parallel division and square root on distributed-memory parallel computers2000 INTERNATIONAL WORKSHOPS ON PARALLEL PROCESSING, PROCEEDINGS 2000[Refereed]Fast High-Precision Arithmetic on Distributed Memory Parallel MachinesProc. Ninth SIAM Conference on Parallel Processing for Scientific Computing Mar 1999[Refereed]Calculation of pi to 51.5 Billion Decimal Digits on Distributed Memory Parallel ProcessorsTransactions of Information Processing Society of Japan 39 (7) Jul 1998[Refereed]Implementation and Evaluation of Radix-2, 3 and 5 1-D FFT on Distributed Memory Parallel ComputersTransactions of Information Processing Society of Japan 39 (3) Mar 1998[Refereed]Improvement of the Algorithms for pi Calculation: The Gauss-Legendre Algorithm and the Borwein's Quartically COnvergent AlgorithmTransactions of Information Processing Society of Japan 38 (11) Nov 1997[Refereed]An Implementation of Factorization on Massively Parallel SIMD ComputersTransactions of Information Processing Society of Japan 36 (11) Nov 1995[Refereed]Awards & Honors
Jul 2016NVIDIA Best Paper Award 16th International Conference on Computational Science and Its Applications (ICCSA 2016)Nov 2011Association for Computing Machinery ACM Gordon Bell PrizeApr 2010Ministry of Education, Culture, Sport, Science and Technology The Commendation for Science and Technology by the Minister of Education, Culture, Sports, Science and TechnologyNov 2009情報処理学会第14回ゲームプログラミングワークショップ優秀論文賞Nov 2008情報処理学会第13回ゲームプログラミングワークショップ優秀論文賞May 2004Information Processing Society of Japan IPSJ Best Paper AwardJan 2003情報処理学会 情報処理学会2003年ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2003)最優秀論文賞May 1999Information Processing Society of Japan IPSJ Best Paper AwardOct 1998Information Processing Society of Japan IPSJ Yamashita SIG Research AwardBooks etc
計算科学のためのHPC技術2(Role:Contributor, 計算科学のためのHPC技術2)COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II(Role:Contributor, Parallel Sparse Matrix-Vector Multiplication Using Accelerators)COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II(Role:Contributor, Implementation of Multiple-Precision Floating-Point Arithmetic on Intel Xeon Phi Coprocessors)COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT V(Role:Contributor, Optimization of Sparse Matrix-Vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs)PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012(Role:Contributor, A Fast Implementation and Performance Analysis of Collisionless N-body Code Based on GPGPU)Software Automatic Tuning: From Concepts to State-of-the-Art Results(Role:Sole author)PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I(Role:Contributor, An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-core Processors)IT Text HPCプログラミング(Role:Sole author)LARGE-SCALE SCIENTIFIC COMPUTING(Role:Contributor, A parallel algorithm for multiple-precision division by a single-precision integer)HIGH-PERFORMANCE COMPUTING(Role:Contributor, Empirical study for optimization of power-performance with on-chip memory)INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS(Role:Contributor, Implementation and evaluation of parallel FFT using SIMD instructions on multi-core processors)Applied Parallel Computing - STATE OF THE ART IN SCIENTIFIC COMPUTING(Role:Contributor, An implementation of parallel 1-D FFT using SSE3 instructions on dual-core processors)Sixth IEEE International Symposium on Cluster Computing and the Grid - SPANNING THE WORLD AND BEYOND(Role:Contributor, PACS-CS: A large-scale bandwidth-aware PC cluster for scientific computations)PARALLEL PROCESSING AND APPLIED MATHEMATICS(Role:Contributor, A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clusters)ADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS(Role:Contributor, Performance improvement by data management layer in a grid RPC system)APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING(Role:Contributor, A parallel method for large sparse generalized eigenvalue problems by OmniRPC in a grid environment)APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING(Role:Contributor, A parallel method for large sparse generalized eigenvalue problems by OmniRPC in a grid environment)LARGE-SCALE SCIENTIFIC COMPUTING(Role:Contributor, Computation of high-precision mathematical constants in a combined cluster and grid environment)2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2(Role:Contributor, Emprical study on reducing energy of parallel programs using slack reclamation by DVFS in a power-scalable high performance cluster)8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings(Role:Contributor, Design of a software distributed shared memory system using an MPI communication layer)8th International Symposium on Parallel Architectures, Algorithms and Networks, Proceedings(Role:Contributor, Design of a software distributed shared memory system using an MPI communication layer)Parallel and Distributed Scientific and Engineering Computing: Practice and Experience(Role:Sole author)2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS(Role:Contributor, Heterogeneous remote computing system for computational astrophysics with OmniRPC)2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET WORKSHOPS, PROCEEDINGS(Role:Contributor, Heterogeneous remote computing system for computational astrophysics with OmniRPC)COMPUTATIONAL SCIENCE - ICCS 2004, PROCEEDINGS(Role:Contributor, Formation of dwarf galaxies in reionized universe with heterogeneous multi-computer system)HIGH PERFORMANCE COMPUTING(Role:Contributor, RI2N - Interconnection network system for clusters with wide-bandwidth and fault-tolerancy based on-multiple links)EURO-PAR 2002 PARALLEL PROCESSING, PROCEEDINGS(Role:Contributor, A blocking algorithm for parallel 1-D FFT on clusters of PCs)Research Grants & Projects
メニーコア超並列クラスタにおける有理数演算ライブラリに関する研究基盤研究(C)Research period: 2016 - 2018エクサスケール計算環境に向けた高速フーリエ変換のアルゴリズムに関する研究基盤研究(C)Research period: 2012 - 2014大規模並列環境における数値計算アルゴリズム新学術領域研究Research period: 2010 - 2014ペタスケール計算環境に向けた高速フーリエ変換のアルゴリズムに関する研究若手研究(B)Research period: 2010 - 2011メニーコア超並列クラスタに向けた高速フーリエ変換のアルゴリズムに関する研究若手研究(B)Research period: 2008 - 2009ヘテロジニアス環境における高速フーリエ変換の並列アルゴリズムに関する研究若手研究(A)Research period: 2004 - 2006大規模クラスタにおける並列FFTライブラリの開発出資金による受託研究Research period: Aug 2004 - Jan 2005PCクラスタにおける高速フーリエ変換の並列アルゴリズムに関する研究若手研究(B)Research period: 2002 - 2003並列計算機における高速フーリエ変換のアルゴリズムに関する研究奨励研究(A)Research period: 2000 - 2001並列計算機による高精度数学定数の高速計算法に関する研究奨励研究(A)Research period: 1999Study on High Performance Computing
![]() |
氏名 | 建部 修見 |
Name | Tatebe Osamu | |
Faculty | ||
Section | ||
Position | Professor | |
Theme | High Performance Computing, Grid Computing, Distributed File System | |
Related Links | ||
tatebe![]() |
Research Interests
Academic & Professional Experience
Apr 2015-PresentUniversity of Tsukuba ProfessorApr 2006-Mar 2015University of Tsukuba Associate ProfessorOct 2005-Mar 2006National Institute of Advanced Industrial Science and Technology (AIST) Senior ResearcherApr 2001-Sep 2005National Institute of Advanced Industrial Science and Technology (AIST) ResearcherApr 1997-Mar 2001Electrotechnical Laboratory ResearcherPublished Papers
Prooceedings of the IEEE International Workshop on High-Performance Storage May 2020[Refereed]Journal of Computer Science and Technology 35 (1) Jan 2020[Refereed]Scalable Distributed Metadata Server Based on Nonblocking TransactionsJournal of Universal Computer Science 26 (1) Jan 2020[Refereed]IEICE Transactions on Information and Systems E102-D (12) Dec 2019[Refereed]Proceedings of the 6th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies Dec 2019[Refereed]Proceedings of the 6th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies Dec 2019[Refereed]GHOSTZ PW/GF: Distributed Parallel Homology Search System for Large-scale Metagenomic AnalysisProceedings of the Third IEEE International Workshop on Benchmarking, Performance Tuning and Optimization for Big Data Applications Dec 2019[Refereed]International Journal of Networking and Computing 9 (2) Jul 2019[Refereed]Proceedings of the 6th Workshop on Scalable Cloud Data Management Dec 2018[Refereed]Proceedings of 2018 Sixth International Symposium on Computing and Networking Workshops (CANDARW) Nov 2018[Refereed]Proceedings of 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS) Oct 2018[Refereed]pbdMPIを用いたエントロピー推定プログラムの並列化と性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-163 (13) Mar 2018ACM International Conference Proceeding Series Jan 2018[Refereed]Proceedings of 2018 IEEE International Conference on Cluster Computing (CLUSTER) 2018[Refereed]Proceedings of the 2nd IEEE International Workshop on Big Data and IoT Security in Smart Computing 2018[Refereed]Oakforest-PACSにおけるIO-500の評価研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-162 (6) Dec 2017すばるHSCパイプラインのPwrake/Gfarmによる高速化手法の提案研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-162 (9) Dec 2017Pwrake/Gfarmによる分散並列相同性検索システムの提案研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-162 (10) Dec 2017Oracle Storage Cloudの性能評価とGfarmファイルシステムへの組み込みの検討研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-162 (11) Dec 2017Burst BufferのためのGfarmファイルシステム研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-161 (2) Sep 2017並列離散イベントシミュレータを用いた分散メタデータサーバのベンチマーク研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-160 (26) Jul 2017同時複数タスク実行フレームワークSMTEFを用いたメニータスク並列ベンチマークとその評価研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-160 (33) Jul 2017RDMAの適用によるRAMPトランザクション処理の高速化情報処理学会論文誌データベース 10 (2) Jun 2017[Refereed]IEEE SYSTEMS JOURNAL 11 (2) Jun 2017[Refereed]IEEE SYSTEMS JOURNAL 11 (2) Jun 2017[Refereed]GfarmファイルシステムにおけるRDMAアクセスの設計情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-158 (12) Mar 2017Journal of Information Processing 25 2017[Refereed]Efficient Parallel Summation on Encrypted Database System2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP) 2017[Refereed]Accelerating Read Atomic Multi-partition Transaction with Remote Direct Memory Access2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP) 2017[Refereed]PPFS: a Scale-out Distributed File System for Post-Petascale SystemsProceedings of IEEE International Conference on Data Science Systems (DSS) Dec 2016[Refereed]Fault Tolerance of Pwrake Workflow System Supported by Gfarm File SystemProceedings of 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) Nov 2016[Refereed]Fast Window Aggregate on Array Database by Recursive Incremental ComputationThe IEEE 12th International Conference on eScience, Aug 2016[Refereed]JOURNAL OF SUPERCOMPUTING 72 (5) May 2016[Refereed]International Journal of High Performance Computing and Networking 9 (3) 2016[Refereed]Real-time 3D Visualization of Phased Array Weather Radar Data via Concurrent Processing in Science Cloud7TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE IEEE IEMCON-2016 2016[Refereed]Multiple Streams of UDT and HpFP Protocols for High-bandwidth Remote Storage System in Long Fat Network7TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE IEEE IEMCON-2016 2016[Refereed]PROCEEDINGS OF 7TH INTERNATIONAL WORKSHOP ON DATA-INTENSIVE COMPUTING IN THE CLOUDS (DATACLOUD 2016) 2016[Refereed]Journal of Information Processing 24 (5) 2016[Refereed]Journal of Information Processing 24 (6) 2016[Refereed]PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS) 2016[Refereed]広域分散ファイルシステムにおけるUDTマルチストリームファイル転送ツール電子情報通信学会論文誌D J99-D (5) 2016[Refereed]Three-Dimensional Spatial Join Count Exploiting CPU Optimized STR R-Tree2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) 2016PROCEEDINGS OF 7TH INTERNATIONAL WORKSHOP ON DATA-INTENSIVE COMPUTING IN THE CLOUDS (DATACLOUD 2016) 2016[Refereed]EURO-PAR 2015: PARALLEL PROCESSING WORKSHOPS 9523 2015[Refereed]2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015 2015[Refereed]2015 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM) 2015[Refereed]2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND DATA INTENSIVE SYSTEMS 2015[Refereed]2015 IEEE 21ST INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) 2015[Refereed]21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9 664 2015[Refereed]Cluster-wide RAID向けの集中型コントローラIPSJ SIG Notes 2014 (14) Dec 2014分散ファイルシステムGfarmにおけるMTCアプリケーションの性能予測モデルの構築IPSJ SIG Notes 2014 (13) Dec 2014科学クラウドを活用した3D降雨レーダのリアルタイム3次元可視化(高速スキャンレーダーによる激しい大気現象の観測:現状と将来展望,スペシャル・セッション)大会講演予講集 106 Sep 2014Cluster-wide RAIDの実装と評価IPSJ SIG Notes 2014 (31) Jul 2014WebGfarm: 分散ファイルシステムGfarmのHTTPインタフェースIPSJ SIG Notes 2014 (34) Jul 2014High-performance Processing of Sensing Data via NICT Science CloudTechnical report of IEICE. SANE 114 (87) Jun 2014Wide-area Distributed Storage System on The NICT Science CloudIEICE technical report. SC, Services Computing 114 (50) May 2014The NICT Science Cloud-A Proposal of Cloud System for Scienti.c Researches-JAXA research and development report 13 Mar 2014High Performance Visualization Processing of Large-Scale Computer Simulation Data via NICT Science CloudJAXA research and development report 13 Mar 2014不揮発性デバイス向けObject Storageの実装と評価IPSJ SIG Notes 2014 (1) Feb 2014ワークフローシステムPwrakeにおけるI/O性能を考慮したタスクスケジューリングIPSJ SIG Notes 2014 (3) Feb 2014Incremental Window Aggregates over Array Database2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) 2014[Refereed]IEICE Communications Express 3 (2) 2014[Refereed]A Web Application of Interdisciplinary Data Analysis Designed for ICSU World Data SystemJournal of Japan Society of Information and Knowledge 24 (3) 2014Supercomputing Frontiers and Innovations 1 (2) 2014[Refereed]Proceedings of the 4th International Workshop on Runtime and Operating Systems for Supercomputers, ROSS 2014 2014[Refereed]2014 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) 2014[Refereed]Proceedings of the 4th International Workshop on Runtime and Operating Systems for Supercomputers, ROSS 2014 2014[Refereed]New techniques of high-speed data processing for spacecraft observation data via NICT Science Cloud(ICSANE 2013(International Conference on Space, Aeronautical and Navigational Electronics)Technical report of IEICE. SANE 113 (335) Nov 2013Basic Evaluation of the Parallel File Transfer Tool with UDTIEICE technical report. Internet Architecture 113 (256) Oct 2013NICT Science Cloud : Big-data Analyses via Parallel and Distributed Processing Technique for Earth and Space ScienceTechnical report of IEICE. PRMU 113 (230) Sep 2013NICT Science Cloud : Data Collection, Database and Data Processing of Global Earth and Space Observation NetworksTechnical report of IEICE. PRMU 113 (230) Sep 2013不揮発性デバイス向けのObject Storageの設計IPSJ SIG Notes 2013 (12) Jul 2013Fault Tolerance Design for Hadoop MapReduce on Gfarm Distributed FilesystemIPSJ SIG Notes 2013 (14) Jul 2013データ配置を考慮したタスクスケジューリングIPSJ SIG Notes 2013 (19) Jul 2013The NICT Science Cloud : A Proposal of Cloud System for Scientific ResearchesIEICE technical report. SC, Services Computing 113 (86) Jun 2013High Performance of Big Data Processing via NICT Science CloudIEICE technical report. SC, Services Computing 113 (86) Jun 2013NICT Science Cloud : An Extension of Security Functions in a Wide-Area Distribution File SystemIEICE technical report. SC, Services Computing 113 (86) Jun 2013A Parallel Processing Technique on the NICT Science Cloud via Gfarm/PwrakeIPSJ SIG Notes 2013 (9) May 2013Parallel File Transfer using UDTIPSJ SIG Notes 2013 (8) May 2013A Science Cloud for Data Intensive SciencesData Science Journal 12 Mar 2013[Refereed]広域分散ファイルシステムBlobSeer-wan/HGMDSの設計と評価IPSJ SIG Notes 2013-HPC-138 (22) Feb 2013RDMAによる低オーバヘッドファイルアクセスと冗長記録IPSJ SIG Notes 2012 (29) Dec 2012Design of Authentication System for High Performance Distributed Computing Environment情報処理学会論文誌. コンピューティングシステム 5 (5) Oct 2012[Refereed]Workflow Scheduling to Minimize Data Movement using Multi-constraint Graph PartitioningProceedings of IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) May 2012[Refereed]Large-scale data processing with Pwrake, a parallel and distributed workflow system (Journal of Space Science Informatics Japan No.1)JAXA research and development report 11 Mar 2012Study of a file system for high-speed storageIPSJ SIG Notes 2012 (17) Mar 20122012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC) 2012[Refereed]MPI-IO/Gfarm: An Implementation and Evaluation of MPI-IO for the Gfarm File SystemTransactions of Information Processing Society of Japan 52 (12) Dec 2011[Refereed]Non-blocking RPCを用いた遠隔ファイルアクセスの実装と性能評価IPSJ SIG Notes 2011 (16) Nov 2011Non-blocking RPCを用いた遠隔ファイルアクセスの実装と性能評価IPSJ SIG Notes 2011 (16) Nov 2011Optimization of Remote File Access Considering Access Pattern and Network Delay論文誌コンピューティングシステム(ACS) 4 (4) Oct 2011[Refereed]Using the Gfarm File System as a POSIX compatible storage platform for Hadoop MapReduce applicationsProceedings of 12th IEEE/ACM International Conference on Grid Computing (Grid 2011) Sep 2011[Refereed]演算加速装置に基づく超並列クラスタHA-PACSによる大規模計算科学IPSJ SIG Notes 2011 (21) Jul 2011Non-blocking RPCを用いた遠隔ファイルアクセスの最適化IPSJ SIG Notes 2011 (30) Jul 2011Development and Evaluation of a Distributed Storage System Improved Fault ResistanceIPSJ SIG Notes 2011 (36) Jul 2011New Method of Task Assignment for Mininum Data Movement in WorkflowIPSJ SIG Notes 130 (61) Jul 2011Design Overview of System Software and Shared Global Storage in HPCI Wide Area Distributed EnvironmentIPSJ SIG Notes 130 (67) Jul 2011MPI-IO/Gfarmにおけるデータ配置を考慮したプロセススケジューリングの検討IPSJ SIG Notes 2011 (33) Jul 2011分散ファイルシステムにおけるメタデータサーバの冗長化手法の検討IPSJ SIG Notes 130 (37) Jul 2011Necessity and examination of large-scale storage system in Cloud-System of ExaBytes class that enhances Gfarm v2.4-IPSJ SIG Notes 2011 (34) Jul 2011COMPUTER PHYSICS COMMUNICATIONS 182 (6) Jun 2011[Refereed]The Gfarm File System on Compute CloudsProceedings of 1st International Workshop on Data Intensive Computing in the Clouds (DataCloud 2011) May 2011[Refereed]アクセスパターンと回線遅延を考慮した遠隔ファイルアクセスの最適化先進的計算基盤システムシンポジウムSACSIS 2011論文集 May 2011[Refereed]POSIX準拠の広域分散ファイルシステムGfarm上でのHadoop MapReduceアプリケーション先進的計算基盤システムシンポジウムSACSIS 2011論文集 May 2011[Refereed]MPI-IO/Gfarm: An Optimized Implementation of MPI-IO for the Gfarm File SystemProceedings of 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) May 2011[Refereed]StableSearch: A Searchable File Content Metadata System for the Gfarm File SystemIPSJ SIG Notes 2011 (7) Mar 2011Performance Monitoring and Bottleneck Identification for a Distributed File SystemIPSJ SIG Notes 129 (5) Mar 2011Research for redundancy of Ceph's metadata serversIPSJ SIG Notes 2011 (8) Mar 2011[Refereed]クラスタ間並列複製作成のためのファイル分割を許さないスケジューリングハイパフォーマンスコンピューティングと計算科学シンポジウム HPCS2011論文集 Jan 2011[Refereed]Effects of Access Patterns and Delay on Remote File AccessIPSJ SIG Notes 2010 (6) Dec 2010Effects of Access Patterns and Delay on Remote File AccessIPSJ SIG Notes 2010 (6) Dec 2010Replication Scheduling for a Set of Files between PC Clusters情報処理学会論文誌. コンピューティングシステム 3 (3) Sep 2010[Refereed]High Performance File Service for Cloud Computing情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 126 Jul 2010High Performance File Service for Cloud ComputingIPSJ SIG Notes 2010 (38) Jul 2010Data Intensive distributed computing using MapReduce on Gfarm file systemIPSJ SIG Notes 126 (4) Jul 2010NEW GENERATION COMPUTING 28 (3) Jul 2010[Refereed]Gfarm Grid File SystemNew Generation Computing 28 Jul 2010[Refereed]High Performance File Service for Cloud Computing情報処理学会研究報告 2010-HPC-126 (38) Jul 2010Distributed Metadata Management System for Global File System HGFSIPSJ SIG Notes 2010 (29) Jul 2010Data Intensive distributed computing using MapReduce on Gfarm file system情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 126 (4) Jul 2010An Implementation and evaluation of MPI-IO for Gfarm file systemIPSJ SIG Notes 126 (21) Jul 2010Distributed Metadata Management System for Global File System HGFS情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 126 Jul 2010Pwrake: A parallel and distributed flexible workflow management tool for wide-area data intensive computingProceedings of ACM International Symposium on High Performance Distributed Computing (HPDC) Jun 2010[Refereed]PCクラスタ間ファイル複製スケジューリング先進的計算基盤システムシンポジウムSACSIS 2010論文集 May 2010[Refereed]グラフ分割による広域分散並列ワークフローの効率的な実行先進的計算基盤システムシンポジウムSACSIS 2010論文集 May 2010[Refereed]2L-2 Performance Evaluation for FileSystem on MapReduce全国大会講演論文集 72 (1) Mar 20101ZC-2 Study of High Paformance File System on Cloud Computing全国大会講演論文集 72 (3) Mar 2010An Implementation of MPI-IO for Gfarm Global File SystemIPSJ SIG Notes 2010 (15) Feb 2010クラスタ間広域ファイル複製作成におけるスケジューリング2010年ハイパフォーマンスコンピューティングと計算科学シンポジウム論文集 Jan 2010[Refereed]A Study on Efficient Monitoring System in Multi-Grid EnvironmentIPSJ SIG Notes 2009 (8) Oct 2009Design of Distributed Metadata Management System for Global File SystemIPSJ SIG Notes 2009 (32) Jul 2009Application and Evaluation of Large-Area Distributed File System for e-ScienceIPSJ SIG Notes 2009 (41) Jul 2009A Proposal of Efficient File Transfer Between ClustersIPSJ SIG Notes 2009 (31) Jul 2009Distributed metadata management system for wide area file system情報処理学会研究報告 2009 (14) Feb 2009Distributed Metadata management system for wide area filesystem研究報告 - ハイパフォーマンスコンピューティング(HPC) 2009 (14) Feb 2009The task scheduling algorithm for parallel file transfer system情報処理学会研究報告 2009 (14) Feb 2009The Task Scheduling Algorithm for Parallel File Transfer SystemIPSJ SIG Notes 2009 (14) Feb 2009Development and Performance Evaluation of Distributed Parallel Processing System for Solar-Terrestrial Physics Observation Data and 3-D Computer Simulation Data based on Grid Datafarm Architecture(Session 1A)IPSJ SIG Notes 2009 (19) Feb 2009リソースネームスペース管理サービスの負荷分散手法の提案情報処理学会研究報告 2008-HPC-116 Aug 2008T2K筑波システムにおけるLinpack性能評価情報処理学会研究報告 2008-HPC-116 Aug 2008Load balancing of Resource Namespace Management ServiceIPSJ SIG Notes 2008 (74) Jul 2008Performance Evaluation of Linpack on T2K-Tsukuba SystemIPSJ SIG Notes 2008 (74) Jul 2008XMLデータを対象としたファセット検索インタフェースの生成情報処理学会研究報告 2008-DD-066 May 2008Semi-Automated Generation of Faceted Navigation Interfaces for XML DataIPSJ SIG Notes 2008 (53) May 20083ZL-4 仮想IPアドレスを用いたプライベートネットワーク内のノードへの透過的アクセス(情報爆発時代における安全,安心ネットワーク技術,学生セッション,「情報爆発」時代に向けた新しいIT基盤技術)全国大会講演論文集 70 (5) Mar 20086ZJ-4 Implementation of Resource Namespace Service全国大会講演論文集 70 (5) Mar 2008High Performance Data Analysis for Particle Physics using the Gfarm file systemJournal of Physics: Conference Series 119 Jan 2008[Refereed]Performance Evaluation of Data Management Layer by Data Sharing Patterns for Grid RPC ApplicationsLecture Notes in Computer Science 5168 Jan 2008[Refereed]Building Hierarchical Grid Storage Using the GFARM Global File System and the JUXMEM Grid Data-Sharing ServiceLecture Notes in Computer Science 5168 Jan 2008[Refereed]International Lattice Data Grid for computational particle physics and national Data Grid JLDGIPSJ SIG Notes 2007 (122) Dec 2007Implementation and Evaluation of Gfarm v2 Global Distributed File SystemIPSJ SIG Notes 2007 (122) Dec 2007Performance Improvement by Distributed Data Management Layer on Grid RPC System(Grid)情報処理学会論文誌. コンピューティングシステム 48 (13) Aug 2007[Refereed]Mechanism for Using Nodes Inside Private Network as Public ServersIPSJ SIG Notes 2007 (80) Aug 2007Resource Namespace Service Specification*EMPTY* GFD.101 Jan 2007Lessons Learned Through Driving Science Applications in the PRAGMA GridInt. J. Web and Grid Services 3 (3) Jan 2007[Refereed]Performance Evaluation of Gfarm Version 1.4 as a Cluster FilesystemProceedings of the 3rd International Workshop on Grid Computing and Applications (GCA 2007) Jan 2007[Refereed]P2P Overlay Network based on UDP Firewall TraversalIPSJ SIG Notes 2006 (87) Jul 2006Design of a scalable programming environment for large capacity computing in a large P2P grid environmentIPSJ SIG Notes 2006 (87) Jul 2006GDIA: A Scalable Grid Infrastructure for Data Intensive ApplicationsProceedings of IEEE International Conference on Hybrid Information Technology (ICHIT 2006) Jan 2006[Refereed]Building Cyberinfrastructure for Bioinformatics Using Service Oriented ArchitectureProceedings of Fourth International Workshop on Biomedical Computations on the Grid (BioGrid) Jan 2006[Refereed]P2P Overlay Network for TCP Programming with UDP Hole PunchingProceedings of IFIP International Conference on Network and Parallel Computing (NPC 2006) Jan 2006[Refereed]My WorkSphere: Integrative Work Environment for Grid-unaware Biomedical Researchers and ApplicationsProceedings of 2nd Grid Computing Environment Workshop Jan 2006[Refereed]The PRAGMA Testbed - Building a Multi-Application International GridProceedings of International Workshop on Grid Testbeds (Grid Testbeds) Jan 2006[Refereed]Implementing data aware scheduling in Gfarm(R) using LSF (TM) scheduler plugin mechanismGCA '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON GRID COMPUTING AND APPLICATIONS 2005[Refereed]Proteome Analysis using iGAP in GfarmProceedings of 2nd International Workshop on Life Science Grid (LSGRID 2005) Jan 2005[Refereed]Integrating Local Job Scheduler - LSF with GfarmLecture Notes in Computer Science 3758 Jan 2005[Refereed]Gfarm v2: A Grid file system that supports high-performance distributed and parallel data computingProceedings of the 2004 Computing in High Energy and Nuclear Physics (CHEP04) Sep 2004[Refereed]Performance Evaluation of AIST Supercluster P-32 by LinpackIPSJ SIG Notes 2004 (81) Jul 2004Gfarm v2: Design and Implementation of Global Virtual File SystemIPSJ SIG Notes 2004 (81) Jul 2004VLAN-based Multi-path L2 Ethernet Network for ClustersIPSJ Transaction 45 (6) May 2004[Refereed]The Effects of Cooling Fan Vibration on Hard Disk Drives(Aarchitecture)情報処理学会論文誌. コンピューティングシステム 45 (6) May 2004[Refereed]Trans-pacific fast file replication using GNET-1 on Grid DatafarmIPSJ SIG Notes 2004 (20) Mar 2004Performance Evaluation of PVFS-NFS proxyIPSJ SIG Notes 2004 (20) Mar 2004GNET-1: gigabit Ethernet network testbedProceedings of 2004 IEEE International Conference on Cluster Computing (CLUSTER'04) Jan 2004[Refereed]The Second Trans-Pacific Grid Datafarm Testbed and Experiments for SC2003Proceedings of 2004 International Symposium on Applications and the Internet - Workshops (SAINT 2004 Workshops) Jan 2004[Refereed]Parallel and Distributed Astronomical Data Analysis on Grid DatafarmProceedings of 5th IEEE/ACM International Workshop on Grid Computing (Grid 2004) Jan 2004[Refereed]VLAN-based multi-path L2 Ethenet networkIPSJ SIG Notes 2003 (102) Oct 2003Mechanical Vibration of Cooling Fan makes some problems in High Density Cluster NodesIPSJ SIG Notes 2003 (84) Aug 2003Performance Analysis of Scheduling and Replication Algorithms on Grid Datafarm Architecture (Grid Middleware)情報処理学会論文誌. コンピューティングシステム 44 (11) Aug 2003[Refereed]Grid Datafarmにおけるスケジューリング・複製手法の性能評価情報処理学会論文誌:コンピューティングシステム 44 (SIG11(ACS3)) Aug 2003[Refereed]Performance Evaluation of Astronomical Data Analysis Tools on Grid Datafarm ArchitectureIPSJ SIG Notes 2003 (83) Aug 2003Implementation and Performance Evaluation of PVFS-PMIPSJ SIG Notes 2003 (29) Mar 2003Cluster-wide Precise Performance Measurement using SNMPIPSJ SIG Notes 2003 (29) Mar 2003Performance Analysis of Scheduling and Replication Algorithms on Grid Datafarm Architecture for High Energy Physics ApplicationsProceedings of the 12th IEEE International Symposium on High Performance Distributed Computing (HPDC-12) Jan 2003[Refereed]Design and implementation of PVFS-PM: a cluster file system on SCoreProceedings of Workshop on Parallel I/O in Cluster Computing and Computational Grids Jan 2003[Refereed]Building A High Performance Parallel File System Using Grid Datafarm and ROOT I/OProceedings of the 2003 Computing in High Energy and Nuclear Physics (CHEP03) Jan 2003[Refereed]Worldwide Fast File Replication on Grid DatafarmProceedings of the 2003 Computing in High Energy and Nuclear Physics (CHEP03) Jan 2003[Refereed]Grid Datafarm Architecture for Global Petascale Data-intensive Computing情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002[Refereed]Grid Datafarm Architecture for Global Petascale Data-intensive Computing情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 43 (6) Sep 2002[Refereed]Performance Analysis of Scheduling and Replication Algorithms on Grid Datafarm Architecture for High Energy Physics ApplicationsIPSJ SIG Notes 2002 (80) Aug 2002Basic Concept of AIST Super ClusterIPSJ SIG Notes 2002 (80) Aug 2002Performance Evaluation of PVFSIPSJ SIG Notes 2002 (80) Aug 2002Implementation and Evaluation of a Scalable Job Management Architecture for Large-Scale PC Cluster on the Grid EnvironmentIPSJ SIG Notes 2002 (22) Mar 2002Implementation and Evaluation of a Scalable Job Management Architecture for Large-Scale PC Cluster on the Grid EnvironmentIPSJ SIG Notes 2002 (22) Mar 2002Grid Datafarm Architecture for Petascale Data Intensive ComputingProceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2002), Jan 2002[Refereed]遠隔メモリ操作に基づく高速メッセージパッシングライブラリFMPLの設計と開発情報処理学会論文誌 43 (4) Jan 2002[Refereed]遠隔メモリ操作に基づく高速メッセージパッシングライブラリFMPLの設計と開発情報処理学会論文誌 43 (4) Jan 2002[Refereed]Performance of a Network Enabled Server on a Real World-wide Grid情報処理学会論文誌. ハイパフォーマンスコンピューティングシステム 42 (9) Aug 2001[Refereed]OpenMP compiler for PC clusters using TDL distributed array libraryIPSJ SIG Notes 2001 (77) Jul 2001Grid Datafarm Architecture for Petascale Data Intensive ComputingIPSJ SIG Notes 2001 (77) Jul 2001The Optimization of The LINPACK Benchmark for Heterogeneous ClustersIPSJ SIG Notes 2001 (49) May 2001Multiple precision floating point number extension for Fortran compiler omf77 with GNU MPIPSJ SIG Notes 2001 (22) Mar 2001Multiple precision floating point number extension for Fortran compiler omf77 with GNU MPIPSJ SIG Notes 2001 (22) Mar 2001Design and implementation of FMPL, a fast message-passing library for remote memory operationsProceedings of Conference on High Performance Networking and Computing (SC2001) Jan 2001[Refereed]Network Enabled Server の World-wide Grid における性能情報処理学会論文誌:ハイパフォーマンスコンピューティングシステム 42 (SIG9(HPS3)) Jan 2001[Refereed]Numerical simulation of shockwave by KrF laser ablationECLIM 2000: 26th European Conference on Laser Interaction with Matter 4424 Jan 2001[Refereed]Implementation and evaluation of RHiNET-1, a network for LASNIPSJ SIG Notes 2000 (23) Mar 2000Impact of OpenMP Optimizations for the MGCG MethodLecture Notes in Computer Science 1940 Jan 2000[Refereed]Multi-dimensional simulation of target acceleration and hydrodynamic instabilities by KrF laser irradiationProceedings of 2000 International Congress on Plasma Physics (ICPP-2000) Jan 2000[Refereed]Evaluation of TEA Expert-an automated performance tuning environment-on Real MPP SystemIPSJ SIG Notes 99 (66) Aug 1999Shared-memory programming support on RHiNETIPSJ SIG Notes 99 (67) Aug 1999Calculation of Partial Charges of Huge Molecular Systems by Parallel ComputingIPSJ SIG Notes 99 (38) May 1999Performance analysis of parallel computers and parallel programs using clock-level profiling systemIPSJ SIG Notes 99 (21) Mar 1999Parallel processing system using network-connected PCsIPSJ SIG Notes 99 (21) Mar 1999RHiNET: A network for high performance parallel processing using locally distributed computersProceedings of 1999 International Workshop on Innovative Architecture (IWIA99) Jan 1999[Refereed]ウェーブフロント型並列処理における分散メモリ型並列計算機の通信機構の評価情報処理学会論文誌 40 (5) Jan 1999[Refereed]リモートメモリ書き込みを用いたMPIの効率的実装情報処理学会論文誌 40 (5) Jan 1999[Refereed]ウェーブフロント型並列処理における分散メモリ型並列計算機の通信機構の評価情報処理学会論文誌 40 (5) Jan 1999[Refereed]リモートメモリ書き込みを用いたMPIの効率的実装情報処理学会論文誌 40 (5) Jan 1999[Refereed]Automatic Adaptive Performance Tuning Tool, TEA ExpertIPSJ SIG Notes 98 (72) Aug 1998Matrix Workshop : a Matrix Generator on WWWIPSJ SIG Notes 98 (72) Aug 1998Local synchronization facility for shared memory multiprocessorsIPSJ SIG Notes 98 (72) Aug 1998Preliminary performance evaluation of etlwiz: a dedicated cluster of Alpha workstationsIPSJ SIG Notes 98 (18) Mar 1998Loopacross: Beyond Doacross for Distributed Memory MultiprocessorsProceedings of IASTED Int. Conf. of PDCN'98 Jan 1998[Refereed]Highly Efficient Implementation of MPI Point-to-point Communication Using Remote Memory OperationsProceedings of International Conference on Supercomputing (ICS98) Jan 1998[Refereed]LU Decomposition on Distributed Memory MachinesIPSJ SIG Notes 95 (81) Aug 1995Efficient Implementation of the Multigrid Preconditioned Conjugate Gradient Method on Distributed Memory MachinesProceedings of Supercomputing '94 Jan 1994[Refereed]The Multigrid Preconditioned Conjugate Gradient MethodProceedings of Sixth Copper Mountain Conference on Multigrid Methods 3224 Apr 1993[Refereed]Awards & Honors
Nov 2018Workshop Best PaperOct 2018IEEE Espana Best Paper AwardNov 2006IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC06) SC 2006 HPC Storage Challenge, Winner – Large SystemsNov 2006IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC06) SC 2006 HPC Storage Challenge, Winner – Large SystemsBooks etc
Advanced Software Technologies for Post-Peta Scale Computing(Role:Contributor, System Software for Data-Intensive Science)ASTONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XIX(Role:Contributor, Impact of Gfarm, a Wide-area Distributed File System, upon Astronomical Data Analysis and Virtual Observatory)ファイル共有とグリッド技術(Role:Sole author)Introduction to Grid Technology(Role:Sole author)MGCG METHOD: A ROBUST AND HIGHLY PARALLEL ITERATIVE METHOD(Role:Sole author)Research Grants & Projects
EBD:次世代の年ヨッタバイト処理に向けたエクストリームビッグデータの基盤技術Research period: Sep 2013 - Mar 2019極端気象予測を拓くビッグデータ機械学習基盤の研究基盤研究(B)Research period: 2017 - 2019System Software for Post Petascale Data Intensive ScienceResearch period: Apr 2011 - Mar 2017スケーラブルな広域ファイルシステムの研究特定領域研究Research period: 2009 - 2010情報爆発時代を支えるスケーラブルな広域分散ファイルシステムの研究特定領域研究Research period: 2007 - 2008
![]() |
氏名 | 額田 彰 |
Name | Nukada Akira | |
Faculty | ||
Section | ||
Position | Professor | |
Theme | High Performance Computing, Performance Optimization, GPU Computing | |
Related Links | ||
nukada![]() |
Research Interests
Academic & Professional Experience
Apr 2020-PresentUniversity of Tsukuba Center for Computational SciencesApr 2018-Mar 2020Tokyo Institute of Technology Global Scientific Information and Computing CenterApr 2013-Mar 2018Tokyo Institute of Technology Global Scientific Information and Computing CenterNov 2007-Mar 2013Tokyo Institute of Technology Global Scientific Information and Computing CenterApr 2004-Oct 2007科学技術振興機構Published Papers
TSUBAME3.0におけるストレージ利用効率化のためのファイルシステムベンチマーク情報処理学会研究報告 2019-HPC-170 (24) Jul 201919th Annual IEEE/ACM International Symposium in Cluster, Cloud, and Grid Computing (CCGrid 2019) May 2019[Refereed]小疎行列積計算のGPU最適化情報処理学会研究報告 2019-HPC-168 (19) Mar 2019GraphCNN向けの疎行列積計算Batch最適化情報処理学会研究報告 2018-HPC-167 (7) Dec 2018Parallel Computing 77 Sep 2018[Refereed]PASC 2018: Platform for Advanced Scientific Computing Conference Jul 2018[Refereed]18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2018) May 2018[Refereed]32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2018) May 2018[Refereed]Overview of TSUBAME3.0, Green Cloud Supercomputer for Convergence of HPC, AI and Big-DataTsubame ESJ. : e-science journal 16 Nov 2017Proceedings of the International Conference on Parallel Processing Sep 2017[Refereed]High-Performance and Memory-Saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPUProceedings of the International Conference on Parallel Processing Sep 2017[Refereed]Procedia Computer Science 80 2016[Refereed]疎行列ベクトル積計算を対象としたGPU向けメモリアクセス削減手法情報処理学会研究報告 2015-HPC-151 (8) Sep 2015EURO-PAR 2015: PARALLEL PROCESSING 9233 2015[Refereed]ACM International Conference Proceeding Series 09-12- Sep 2014[Refereed]超省エネスーパーコンピューター TSUBAMEペトロテック 37 (8) Aug 2014GPU間マイグレーションによる効率的な並列実行情報処理学会研究報告 2014-HPC-145 (42) Jul 2014TSUBAME-KFC : the Greenest Supercomputer in the World With Liquid Submersion CoolingTsubame ESJ. : e-science journal 11 Jun 2014GPUのキャッシュを考慮した疎行列ベクトル積計算手法の性能評価情報処理学会研究報告 2014-HPC-144 (5) May 2014Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS 2015- 2014[Refereed]Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS 2015- 2014[Refereed]TSUBAME-KFC: 液浸冷却を用いたウルトラグリーンスパコン研究設備情報処理学会研究報告 2013-ARC-199/HPC-142 Dec 2013APU上の混合精度AMG法IPSJ SIG Notes 2013 (13) Sep 2013ウルトラグリーンスパコンTSUBAME2.5/TSUBAME-KFC大学ICT推進協議会年次大会論文集 2013SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis Nov 2012[Refereed]GPU スパコンTSUBAME 2.0 によるフェーズフィールド法を用いた2 petaflops樹枝状凝固成長計算第17回計算工学講演会論文集 17 May 2012ACM International Conference Proceeding Series 2012[Refereed]Operation of TSUBAME 2.0 Green Supercomputer dealing with Power Crisis研究報告ハイパフォーマンスコンピューティング(HPC) 2011 (12) Nov 2011Achievement of Linpack Performance of over 1PFlops on TSUBAME 2.0 Supercomputer情報処理学会論文誌コンピューティングシステム(ACS) 4 (4) Oct 2011[Refereed]Achievement of Linpack Performance of over 1PFlops on TSUBAME 2.0 Supercomputer先進的計算基盤システムシンポジウム論文集 (2011) May 2011[Refereed]Fast fourier transform using GPUTsubame ESJ. 3 Feb 2011Proceedings of 2011 SC - International Conference for High Performance Computing, Networking, Storage and Analysis 2011[Refereed]PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011 2011[Refereed]IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum 2011[Refereed]Efficient PageRank on GPU Clusters情報処理学会研究報告 2010-HPC-128 (21) Dec 2010Performance Evaluation of TSUBAME 2.0 Heterogeneous Supercomputer with Linpack Benchmark情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2010-HPC-128 (5) Dec 2010Optimization of electric power efficiecy based on model in GPU情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2010-HPC-128 (5) Dec 2010CUDAによる高速フーリエ変換応用数理 20 (2) Jun 2010異種アクセラレータを持つTSUBAMEスーパーコンピュータのLinpack評価応用数理 20 (2) Jun 2010Power-Aware Task Scheduling on GPU Accelerated Clusters情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 124 Feb 2010Bulletin of the Japan Society for Industrial and Applied Mathematics 20 (2) 2010Bulletin of the Japan Society for Industrial and Applied Mathematics 20 (2) 201017th International Conference on High Performance Computing, HiPC 2010 2010[Refereed]2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2010 2010[Refereed]2010 International Conference on Green Computing, Green Comp 2010 2010[Refereed]Computer Science - Research and Development 25 (1-2) 2010[Refereed]Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010 2010[Refereed]Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010 2010[Refereed]CG on GPU-enhanced Clusters情報処理学会研究報告 2009-HPC-123 Dec 2009Software Framework for GPU Memory Errors情報処理学会研究報告. 計算機アーキテクチャ研究会報告 186 Nov 2009SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Nov 2009[Refereed]Linpack Tuning Method on a Heterogeneous Supercomputer with Hybrid AcceleratorsProc. Summer United Workshops on Parallel, Distributed and Cooperative Processing, SWoPP2009, Sendai, Aug. 2009-HPC-121 (3) Oct 2009CUDA GPU向けの自動最適化FFTライブラリ情報処理学会論文誌コンピューティングシステム(ACS) 2 (3) Sep 2009[Refereed]GPUにおける性能と消費電力の相関性の解析情報処理学会研究報告 2009-HPC-121 Jul 2009GPUにおける耐故障性を考慮した数値計算の電力性能情報処理学会研究報告 2009-HPC-121 Jul 2009Acceleration of Himeno Benchmark on Multi-node GPU System by Overlapping Communication with Calculation : Over 700 GFLOPS of Sustained Performance is Achieved with 32 GPUs情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 120 Jun 2009An Efficient Conjugate Gradient Solver on Double Precision Multi-GPU Systems先進的計算基盤システムシンポジウムSACSIS2009論文集 May 2009[Refereed]CUDA GPU向けの自動最適化FFTライブラリ先進的計算基盤システムシンポジウムSACSIS2009論文集 May 2009[Refereed]Linpack Tuning on a Heterogeneous Supercomputer with Four Types of ProcessorsIPSJ SIG Notes 182 Feb 2009Performance Evaluation of Software-Based ECC for GPUsIPSJ SIG Notes 2009 2009COMPUTATIONAL SCIENCE - ICCS 2009, PART I 5544 2009[Refereed]SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing Nov 2008[Refereed]High Performance 3-D FFT in CUDA Environment情報処理学会論文誌コンピューティングシステム(ACS) 1 (2) Aug 2008[Refereed]ソフトウェアECCによるGPUメモリの耐故障性の実現と評価IEICE technical report 108 (181) Aug 2008Lecture Notes in Computer Science 4967(PPAM2007) May 2008[Refereed]High Performance FFT on SGI Altix 3700Proc. 3rd International Conference on High Performance Computing and Communications (HPCC 2007), Lecture Notes in Computer Science (4782) Sep 2007[Refereed]Toward Automatic Performance Tuning for Numerical Simulations in the SILC Matrix Computation FrameworkProceedings of the Second international Workshop on Automatic Performance Tuning (iWAPT 2007) Sep 2007[Refereed]Distributed SILC: An easy-to-use interface for MPI-based parallel matrix computation librariesLecture Notes in Computer Science 4699 (PARA06) Jan 2007[Refereed]PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS 4742 2007[Refereed]A Performance Evaluation Model for the SILC Matrix Computation FrameworkProceeding of the IFIP International Conference on Network and Parallel Computing (NPC2006) Oct 2006[Refereed]SILC: A Flexible and Environment Independent Interface to Matrix Computation LibrariesLecture Notes in Computer Science 3911 (PPAM2005) Sep 2006[Refereed]Implementation of the Matrix Computation Library Interface SILC in Distributed Parallel EnvironmentsIPSJ SIG Notes 2006 (87) Jul 20062006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings III May 2006[Refereed]分散型 SILC の設計: MPI ベースの行列計算ライブラリを使いやすくするインタフェース2006年ハイパフォーマンスコンピューティングと計算科学シンポジウム HPCS2006 ポスター論文集 Jan 2006LAPACK in SILC: Use of a Flexible Application Framework for Matrix Computation LibrariesProceedings on the Eighth International Conference on High-Performance Computing in Asia-Pacific Region (HPC Asia 2005) Dec 2005[Refereed]共有メモリ並列環境における SILC の実現と利用第 34 回数値解析シンポジウム講演予稿集 Jun 2005SILC: 行列計算ライブラリの利用を簡単化するフレームワーク第10回計算工学講演会講演論文集 10 (2) Jun 2005行列計算ライブラリに対する計算環境に依存しないインタフェースの開発2005年ハイパフォーマンスコンピューティングと計算科学シンポジウム HPCS2005 ポスター論文集 Jan 2005Computing Environment Independent Interface for Matrix Computation LibraryIPSJ SIG Notes 2004 (128) Dec 2004Parallel Implementation of FFT Algorithms on Distributed Shared Memory Architecture and Its Optimization情報処理学会論文誌コンピューティングシステム(ACS) 44 (6) May 2003[Refereed]Performance Evaluation of Commodity Distributed Shared Memory IBM x440IPSJ SIG Notes 93 Mar 2003Fine Grain Parallel Implementation of Sparse Matrix Algorithms and its OptimizationIPSJ SIG Notes 91 Aug 2002Awards & Honors
Jul 2012日本計算工学会 第17回 計算工学講演会ベストペーパーアワードNov 2011ACM ACM Gordon Bell Prize – Special Achievements in Scalability and Time-to-SolutionDec 2010IEEE Computer Society Japan Chapter IEEE Computer Society Japan Chapter Young Author Award 2010Mar 2010情報処理学会 平成21年度山下記念研究賞May 2009情報処理学会 第7回先進的基盤システムシンポジウムSACSIS2009最優秀論文賞Jun 2008IEEE Computer Society Japan Chapter 第6回先進的基盤システムシンポジウムSACSIS2008優秀若手研究賞Books etc
はじめてのCUDAプログラミング(Role:Sole author)Research Grants & Projects
GPUアプリケーションに対するシステムレベルのチェックポイント技術の確立科学研究費補助金 若手研究Research period: Apr 2020 - Mar 2022高度なGPUプログラミング手法の開拓科学研究費助成金 挑戦的萌芽研究Research period: Apr 2011 - Mar 2013GPUによるFFT計算の自動チューニング手法の研究科学研究費補助金 若手研究(A)Research period: Apr 2010 - Mar 2012
![]() |
氏名 | 多田野 寛人 |
Name | Tadano Hiroto | |
Faculty | ||
Section | ||
Position | Assistant Professor | |
Theme | Numerical analysis: Numerical algorithms for large scale linear systems. Parallel computing for eigenvalue problems. | |
Related Links | ||
tadano![]() |
Research Interests
Academic & Professional Experience
Oct 2011-Mar 2016University of Tsukuba Faculty of Engineering, Information and Systems Assistant ProfessorMar 2008-Sep 2011Graduate School of Systems and Information Engineering University of Tsukuba Assistant ProfessorApr 2007-Feb 2008Kyoto University Graduate School of Informatics JSPS Research AssociateApr 2006-Mar 2007Japan Science and Technology Agency ResearcherPublished Papers
都市気象コードCity-LESの並列GPU実装の最適化と性能評価情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-170(5) Jul 2019JAPAN JOURNAL OF INDUSTRIAL AND APPLIED MATHEMATICS 36 (2) Jul 2019Journal of Advanced Simulation in Science and Engineering 6 (1) Mar 2019[Refereed]Development of the Shifted Block BiCGSTAB(l) Method and Improvement of Its AccuracyTransactions of the Japan Society for Industrial and Applied Mathematics 26 (3) 2016[Refereed]Journal of Computational Chemistry 35 (18) Jul 2014[Refereed]Improvement of the accuracy of the approximate solution of the Block BiCR methodJSIAM Letters 6 2014Journal of Algorithms and Computational Technology 7 (3) Sep 2013[Refereed]A modified Block IDR($s$) method for computing high accuracy solutionsJSIAM Letters 4 Aug 2012[Refereed]COMPUTER PHYSICS COMMUNICATIONS 183 (1) Jan 2012[Refereed]Numerical Solvers for Solving Linear Systems with Multiple Right-hand Sides応用数理 21 (4) Dec 2011局地気象シミュレーションで現れる線形方程式に対する前処理の評価日本応用数理学会2011年度年会予稿集 Sep 2011演算加速装置に基づく超並列クラスタHA-PACSによる大規模計算科学IPSJ SIG Notes 2011 (21) Jul 2011A convergence improvement of the BSAIC preconditioner by deflationJSIAM Letters 3 Jan 2011[Refereed]JOURNAL OF COMPUTATIONAL CHEMISTRY 31 (13) Oct 2010[Refereed]固有値分布の確率的推定法日本応用数理学会年会講演予稿集 2010 Sep 2010独立並列計算による行列固有値分布の確率的推定法IPSJ SIG Notes 2010 (35) Jul 2010JAPAN JOURNAL OF INDUSTRIAL AND APPLIED MATHEMATICS 27 (1) Jun 2010[Refereed]A PARALLEL EIGENSOLVER USING CONTOUR INTEGRATION FOR GENERALIZED EIGENVALUE PROBLEMS IN MOLECULAR SIMULATIONTAIWANESE JOURNAL OF MATHEMATICS 14 (3A) Jun 2010[Refereed]COMPUTER PHYSICS COMMUNICATIONS 181 (5) May 2010[Refereed]Parallel Eigensolver for Large Scale Non-linear SystemsNUMERICAL ANALYSIS AND APPLIED MATHEMATICS, VOLS I-III 1281 2010[Refereed]Parallel stochastic estimation method of eigenvalue distributionJSIAM Letters 2 Jan 2010[Refereed]A quadrature-based eigensolver with a Krylov subspace method for shifted linear systems for Hermitian eigenproblems in lattice QCDJSIAM Letters 2 Jan 2010[Refereed]A block sparse approximate inverse with cutoff preconditioner for semi-sparse linear systems derived from Molecular Orbital calculationsJSIAM Letters 2 Jan 2010[Refereed]Performance of a Contour Integral Based Eigensolver with a Complete Sparse Factorization Preconditioner on Multi-Core ClustersLecture Notes in Computer Science Jan 2010[Refereed]COMPUTER PHYSICS COMMUNICATIONS 181 (1) Jan 2010[Refereed]Error analysis for a matrix pencil of Hankel matrices with perturbed complex momentsJSIAM Letters 1 Dec 2009[Refereed]Application and Performance Evaluation of the Volumetric Parallel 3D-FFT to 3D-RISM on Massively Parallel ClusterIPSJ SIG Notes 2009 (3) Oct 2009バンド局所化による電子状態計算の高性能並列アルゴリズム日本応用数理学会年会講演予稿集 2009 Sep 2009A numerical method for nonlinear eigenvalue problems using contour integralsJSIAM Letters 1 Aug 2009[Refereed]A Block Krylov Subspace Method for the Contour Integral Method and Its Application to Molecular Orbital Computations情報処理学会論文誌. コンピューティングシステム 2 (2) Jul 2009[Refereed]A method for nonlinear eigenvalue problems based on contour integrationRIMS Kokyuroku 1638 Apr 2009A Method for Finding Zeros of Polynomial Equations using a Contour Integral Based EigensolverSNC'09: PROCEEDINGS OF THE 2009 INTERNATIONAL WORKSHOP ON SYMBOLIC-NUMERIC COMPUTATION 2009[Refereed]Block BiCGGR: a new Block Krylov subspace method for computing high accuracy solutionsJSIAM Letters 1 Jan 2009[Refereed]A performance evaluation of the preconditioning using double CutoffTransactions of the Japan Society for Industrial and Applied Mathematics 18 (4) Dec 2008[Refereed]グレブナ基底を用いない連立代数方程式の非線形固有値問題への変換法と非線形固有値問題の解法についてBulletin of the Japan Society for Symboric and Algebraic Computation 15 (2) Dec 2008Implementation and Performance Evaluation of Sparse Matrix Vector Multiplication for Mixed Precision Krylov Method on the Cell BE情報処理学会論文誌. コンピューティングシステム 1 (1) Jun 2008[Refereed]On Single Precision Preconditioners for Krylov Subspace Iterative MethodsLecture Notes in Computer Science (4818) Apr 2008[Refereed]CIRR: A Rayleigh-Ritz Type Method with Contour Integral for Generalized Eigenvalue ProblemsHokkaido Mathematical Journal 36 (4 (Special Issue)) Dec 2007[Refereed]A method for estimating a distribution of eigenvalues using the AMLS methodTransactions of the Japan Society for Industrial and Applied Mathematics 17 (4) Dec 2007[Refereed]Modified Multiple Explicitly Restarted Arnoldi Method with Hybrid GridRPC/MPI Implementation情報処理学会論文誌. コンピューティングシステム 48 (8) May 2007[Refereed]A master-worker type eigensolver for molecular orbital computationsAPPLIED PARALLEL COMPUTING 4699 (4699) 2007[Refereed]On an evaluation method of preconditioners for complex symmetric systems of linear equationsTransactions of the Japan Society for Industrial and Applied Mathematics 16 (4) Dec 2006[Refereed]A parallel method for large scale eigenvalue problems in a Grid environmentThe Computational Mechanics Conference 2005 (18) Nov 2005A Stabilization of the CGS Method by Avoiding Near-BreakdownProceedings of International Conference of Numerical Analysis and Applied Mathematics 2005 (ICNAAM 2005) Sep 2005[Refereed]A stabilization of the CGS method by restarting Lanczos processTransactions of the Japan Society for Industrial and Applied Mathematics 15 (2) Jun 2005[Refereed]A Krylov subspace method for shifted linear systems and its application to eigenvalue problemsTransactions of the Japan Society for Industrial and Applied Mathematics 14 (3) Sep 2004[Refereed]A method for avoiding breakdown in product-type iterative methods and its behavior for Toeplitz linear systemsICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004 2 (2) 2004[Refereed]A moment-based method for large scale eigenvalue problemsICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004 1 (3) 2004[Refereed]Awards & Honors
Apr 2011日本応用数理学会 日本応用数理学会 第7回 若手優秀講演賞Jan 2008情報処理学会 2008年ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2008)最優秀論文賞Books etc
数値線形代数の数理とHPC(Role:Contributor, 1.2 反復法, 1.2.1 定常反復法, 1.2.2 クリロフ部分空間反復法)SNC'09: PROCEEDINGS OF THE 2009 INTERNATIONAL WORKSHOP ON SYMBOLIC-NUMERIC COMPUTATION(Role:Contributor, A Method for Finding Zeros of Polynomial Equations using a Contour Integral Based Eigensolver)ICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004(Role:Contributor, A moment-based method for large scale eigenvalue problems)ICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004(Role:Contributor, A method for avoiding breakdown in product-type iterative methods and its behavior for Toeplitz linear systems)Research Grants & Projects
![]() |
氏名 | 小林 諒平 |
Name | Kobayashi Ryohei | |
Faculty | ||
Section | ||
Position | Assistant Professor | |
Theme | Reconfigurable Computing System, High-speed RTL Simulation | |
Related Links | ||
kobayashi![]() |
Research Interests
Academic & Professional Experience
Published Papers
FPGAに組み込まれたHBMの効率的な利用とその考察電子情報通信学会技術研究報告 (信学技報) 120 (168) Sep 2020再結合光子の輻射輸送大規模計算に向けたHBM-FPGA実装への考察情報科学技術フォーラム講演論文集 1 Sep 20202020 IEEE 31st International Conference on Application-specific Systems, Architectures and Processors (ASAP) Jul 2020[Refereed]2020 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Jul 2020[Refereed]宇宙幅射輸送コードARGOTのOpenACCによるGPU実装研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-175 (7) Jul 2020Stratix 10 FPGAを用いたray-tracing法による輻射輸送計算の高速化研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-175 (8) Jul 2020OpenCL対応FPGA間光リンク接続フレームワークCIRCUSとSMIの性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-175 (16) Jul 2020Design and Performance Evaluation of Inter-FPGA Communication using High Level Synthesis計算工学講演会論文集 Proceedings of the Conference on Computational Engineering and Science / 日本計算工学会 編 25 Jun 2020Multi-hybrid Accelerated Computing with GPU and Reconfigurable System計算工学講演会論文集 Proceedings of the Conference on Computational Engineering and Science / 日本計算工学会 編 25 Jun 2020GPU・FPGA複合演算加速による宇宙輻射輸送コードARGOTの性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-173 (8) Mar 2020スーパーコンピュータCygnus上におけるFPGA間パイプライン通信の性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2020-HPC-173 (24) Mar 2020Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops Jan 2020[Refereed]OpenCL対応GPU・FPGAデバイス間連携機構による宇宙輻射輸送コードの演算加速研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-172 (8) Dec 2019GPU-FPGA協調プログラミングを実現するコンパイラの開発研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-172 (11) Dec 2019再構成可能なハードウェアを用いた演算と通信を融合する手法の提案と性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-171 (6) Sep 20192019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Jul 2019[Refereed]2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Jul 2019[Refereed]OpenCL対応FPGA間通信機能によるGPU・FPGA複合型演算加速研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-170 (5) Jul 2019GPU・FPGA複合演算加速による輻射流体シミュレーションコードARGOTの実装研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-170 (22) Jul 2019Optimization on Astrophysical Radiative Transfer Code for FPGAs with OpenCLIPSJ Transactions on Advanced Computing System 12 (3) Jul 2019[Refereed]GPU-FPGA協調計算を記述するためのプログラミング環境に関する研究研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-169 (10) May 2019高位設計と低位設計の違いとFPGA演算性能の関係について情報処理学会第81回全国大会講演論文集 Mar 2019GPU・FPGA混載ノードにおけるヘテロ演算加速プログラム環境に関する研究研究報告ハイパフォーマンスコンピューティング(HPC) 2019-HPC-168 (10) Feb 2019異デバイス間でのPCIe通信を実現するOpenCL対応FPGAモジュールの提案と検証IEICE-RECONF2018-63 IEICE-118 (432) Jan 2019Proceedings of the HPC Asia 2019 Workshops Jan 2019[Refereed]OpenCLによるFPGA上の演算と通信を融合した並列処理システムの実装及び性能評価研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-167 (9) Dec 2018OpenCLとVerilog HDLの混合記述によるGPU-FPGAデバイス間連携研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-167 (11) Dec 2018FPGAによる宇宙輻射輸送シミュレーションの演算加速IEICE-RECONF2018-25 118 (215) Sep 2018並列FPGAシステムにおけるOpenCLを用いた宇宙輻射輸送コードの演算加速研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-165 (27) Jul 2018GPU-FPGA複合システムにおけるデバイス間連携機構研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-165 (26) Jul 2018HEART 2018 Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies Article No. 6 Jun 2018[Refereed]複数のFPGAによる分散ソーティングの実現に向けた予備評価Technical report of IEICE. EA 118 (63) May 2018IEICE Transactions on Information and Systems E101D (2) Feb 2018[Refereed]宇宙輻射輸送計算におけるHDL設計とOpenCL設計の比較情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2018-HPC-163 (24) Feb 2018ACM International Conference Proceeding Series Jan 2018[Refereed]OpenCLを用いたFPGAによる宇宙輻射輸送シミュレーションの演算加速情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-161 (12) Sep 2017OpenCLとVerilog HDLの混合記述によるFPGA間Ethernet接続情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) Jul 2017高位合成によるFPGAの高性能計算へ適用ハイパフォーマンスコンピューティングと計算科学シンポジウム論文集 May 2017[Refereed]IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS E100D (5) May 2017[Refereed]OpenCLとVerilog HDLの混合記述によるFPGAプログラミング情報処理学会研究報告ハイパフォーマンスコンピューティング(HPC) 2017-HPC-158 (16) Mar 2017ACM SIGARCH Computer Architecture News - HEART '16 44 (4) Sep 2016[Refereed]Proceedings - 2015 3rd International Symposium on Computing and Networking, CANDAR 2015 Mar 2016[Refereed]Frix: Feasible and Reconfigurable IBM PC Compatible SoC第78回全国大会講演論文集 2016 (1) Mar 2016世界最速のFPGAソーティングアクセラレータの初期検討第78回全国大会講演論文集 2016 (1) Mar 2016SSDの並列性を引き出すI/Oスケジューラ研究報告システムソフトウェアとオペレーティング・システム(OS) 2015-OS-135 (14) Nov 2015FPGAを用いた世界最速のソーティングハードウェアの実現に向けた試みIEICE-RECONF2015-12 115 (109) Jun 2015FPGAベースのソーティングアクセラレータの設計と実装IEICE-CPSY2015-5 115 (7) Apr 2015Ultra High-speed FPGA Accelerator for Sorting Application第77回全国大会講演論文集 2015 (1) Mar 20152015 IEEE 9TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANYCORE SYSTEMS-ON-CHIP (MCSOC) 2015[Refereed]2015 IEEE 9TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANYCORE SYSTEMS-ON-CHIP (MCSOC) 2015[Refereed]Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 9040 2015[Refereed]USB3.0接続の手軽で高速なFPGAアクセラレータの設計と実装IEICE-RECONF2014-78 114 (428) Jan 20153bOS: A flexible and lightweight embedded OS operated using only 3 buttons組込みシステムシンポジウム2014論文集 2014 Oct 2014[Refereed]FPGAの消費電力を削減するHDLコーディング手法の検討第76回全国大会講演論文集 2014 (1) Mar 2014Scalable Stencil-computation Accelerator by Employing Multiple Small FPGAsIPSJ Transactions on Advanced Computing System 6 (4) Oct 2013[Refereed]Development of Scalable Stencil-Computation Accelerator Based on Multiple Small FPGAs先進的計算基盤システムシンポジウム論文集 2013 May 2013[Refereed]Design of Synchronization Mechanism to Conquer the Clock Oscillator Variation for High Performance Stencil Computation Accelerator第75回全国大会講演論文集 2013 (1) Mar 2013メッシュ接続FPGAアレーを用いた高性能ステンシル計算機の設計と実装IEICE-RECONF2012-88 112 (377) Jan 2013メッシュ接続FPGAアレーにおける高性能ステンシル計算先進的計算基盤システムシンポジウム論文集 2012 May 2012[Refereed]メッシュ接続FPGAアレーにおけるステンシル計算の検討第74回全国大会講演論文集 2012 (1) Mar 20122012 THIRD INTERNATIONAL CONFERENCE ON NETWORKING AND COMPUTING (ICNC 2012) 2012[Refereed]Awards & Honors
Books etc
Interface 2017年2月号 緊急特集 本家ARMのIoTワールド入門(Role:Contributor, 計算力時代到来...スパコン技術研究コーナ ソート専用コンピュータ最前線)Interface 2016年12月号 IoT&スパコン!ラズパイ時代の自分用コンピュータ作り(Role:Contributor, 第6章 ビッグデータ時代にますます重要!ハードウェア・データ処理に挑戦)Interface 2016年12月号 IoT&スパコン!ラズパイ時代の自分用コンピュータ作り(Role:Contributor, 第6章 Appendix 2 基本演算の高速化が重要!ハードウェア並列ソート・アルゴリズム)Research Grants & Projects
FPGAを用いた超高速ハードウェアソーティングアルゴリズムの開発若手研究Research period: 2019 - 2020
![]() |
氏名 | 藤田 典久 |
Name | Fujita Norihisa | |
Faculty | ||
Section | ||
Position | Assistant Professor | |
Theme | Parallel processing, Interconnection network and Parallel application optimization using accelerators | |
Related Links | ||
fujita![]() |
Research Interests
Academic & Professional Experience
Published Papers
Optimization on Astrophysical Radiative Transfer Code for FPGAs with OpenCL情報処理学会論文誌トランザクション コンピューティングシステム(Web) 12 (3) Jul 2019高位設計と低位設計の違いとFPGA演算性能の関係について情報処理学会第81回全国大会講演論文集 Mar 2019異デバイス間でのPCIe通信を実現するOpenCL対応FPGAモジュールの提案と検証IEICE-RECONF2018-63 IEICE-118 (432) Jan 2019IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2019, Rio de Janeiro, Brazil, May 20-24, 2019 2019[Refereed]IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2019, Rio de Janeiro, Brazil, May 20-24, 2019 2019[Refereed]Proceedings of the 10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2019, Nagasaki, Japan, June 6-7, 2019. 2019[Refereed]IJHPCA 33 (5) 2019[Refereed]FPGAによる宇宙輻射輸送シミュレーションの演算加速IEICE-RECONF2018-25 118 (215) Sep 2018複数のFPGAによる分散ソーティングの実現に向けた予備評価Technical report of IEICE. EA 118 (63) May 2018OpenCL-ready High Speed FPGA Network for Reconfigurable High Performance Computing.Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2018, Chiyoda, Tokyo, Japan, January 28-31, 2018 2018[Refereed]Accelerating Space Radiative Transfer on FPGA using OpenCL.Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2018, Toronto, ON, Canada, June 20-22, 2018 2018[Refereed]高位合成によるFPGAの高性能計算へ適用ハイパフォーマンスコンピューティングと計算科学シンポジウム論文集 May 2017[Refereed]密結合並列演算加速機構TCAによるGPU対応GASNetの実装と評価2016年ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2016) 論文集, 2016 Jun 2016[Refereed]High Performance Computing for Computational Science - VECPAR 2016 - 12th International Conference, Porto, Portugal, June 28-30, 2016, Revised Selected Papers 2016[Refereed]2016 International Conference on Computational Science and Computational Intelligence (CSCI) 2016[Refereed]Applying TCA Architecture to QUDA QCD Library for GPUs情報処理学会論文誌トランザクション コンピューティングシステム(Web) 8 (2) Jun 2015[Refereed]2015 IEEE International Conference on Cluster Computing, CLUSTER 2015, Chicago, IL, USA, September 8-11, 2015 2015[Refereed]2015 IEEE International Conference on Cluster Computing, CLUSTER 2015, Chicago, IL, USA, September 8-11, 2015 2015[Refereed]PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW) 2014[Refereed]EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT I 8805 2014[Refereed]2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013) 2013[Refereed]Awards & Honors
Books etc
Research Grants & Projects
![]() |
氏名 | 塙 敏博 |
Name | Hanawa Toshihiro | |
Faculty | Information Technology Center, The University of Tokyo / University of Tsukuba | |
Section | ||
Position | Associate Professor / Visiting Associate Professor | |
Theme | High-performance Interconnect, Accelerated Computing, Large-scale Parallel Processing | |
Related Links | ||
![]() |
氏名 | 安永 守利 |
Name | Yasunaga Moritoshi | |
Faculty | Graduate School of Systems and Information Engineering | |
Section | ||
Position | Professor (Collaborative Fellow) | |
Theme | VLSI Engineering, Evolvable Hardware, Dependable Systems | |
Related Links | ||
![]() |
氏名 | 和田 耕一 |
Name | Wada Koichi | |
Faculty | Graduate School of Systems and Information Engineering | |
Section | ||
Position | Professor (Collaborative Fellow) | |
Theme | Parallel and Distributed Computing, Network Architecture for Clusters, Multimedia Processor Architecture | |
Related Links | ||
![]() |
氏名 | 櫻井 鉄也 |
Name | Sakurai Tetsuya | |
Faculty | Graduate School of Systems and Information Engineering | |
Section | ||
Position | Professor (Collaborative Fellow) | |
Theme | Numerical algorithms and simulation, Mathematical software for GRID computing | |
Related Links | ||
![]() |
氏名 | 山口 佳樹 |
Name | Yamaguchi Yoshiki | |
Faculty | Graduate School of Systems and Information Engineering | |
Section | ||
Position | Associate Professor (Collaborative Fellow) | |
Theme | Reconfigurable System, Energy-efficient computer system and architecture, Dependable computer system | |
Related Links | ||
![]() |
氏名 | 今倉 暁 |
Name | Imakura Akira | |
Faculty | Graduate School of Systems and Information Engineering | |
Section | ||
Position | Associate Professor (Collaborative Fellow) | |
Theme | Numerical linear algebra, Algorithms for solving linear systems (Krylov subspace methods and preconditioning techniques) | |
Related Links | ||