面向万亿次量级嵌入式应用的高效能计算模型与体系结构技术

61033008
2010
F0204.计算机系统结构与硬件技术
张春元
重点项目
教授
中国人民解放军国防科技大学
230万元
流处理器;万亿次;嵌入式计算;SDF模型;高效能
2011-01-01到2014-12-31
  • 中英文摘要
  • 结题摘要
  • 结题报告
  • 项目成果
  • 项目参与人
查看更多信息请先登录或注册
查看更多信息请先登录或注册
查看更多信息请先登录或注册
重置
序号 标题 类型 作者
1 On the GPU performance of cell-centered finite volume method over unstructured tetrahedral meshes 会议论文 Johannes Langguth|Nan Wu|Jun Chai|Xing Cai|
2 基于程序特征分析的流处理器VLIW压缩技术与解压实现 期刊论文 管茂林|何义|杨乾明|张春元|
3 Solving the cardiac model using Multi-core CPU and Many Integrated cores (MIC) 会议论文 Jing Yang|Jun Chai|Mei Wen|Nan Wu|Chunyuan Zhang|
4 Towards simulation of subcellular Calcium dynamics at nanometer resolution 期刊论文 Johan Hake|Nan Wu|Mei Wen|Xing Cai|Glenn T.li|Jing Yang|Huayou Su|Chunyuan Zhang|Xiangke Liao|
5 An Efficient Parallel Deblocking Filter Based on GPU: Implementation and Optimization 会议论文 Huayou Su, Chunyuan Zhang, Jun Chai, Qianming Yan|
6 Simulating cardiac electrophysiology in the era of GPU-cluster computing 期刊论文 Jun Chai|Mei Wen|Nan Wu|Dafei Huang|Jing Yang|Xing Cai|Chunyuan Zhang|Qianming Yang|
7 Extending BORPH for shared memory reconfigurable computers 会议论文 Xun Changqing|Wen Mei|Wu Nan|Zhang Chunyuan|So Hayden Kwok-Hay|
8 Automatic Stitching System for Images of Plane-like Scenes 会议论文 Dafei Huang|Mei Wen|Yungang Xue|Nan Wu|Ju Ren|Chunyuan Zhang|
9 High Efficient Sedimentary Basin Simulations on Hybrid CPU-GPU Clusters 期刊论文 Mei Wen|Huayou Su|Wenjie Wei|Nan Wu|Xing Cai|Chunyuan Zhang|
10 Improving Performance of GPU Specific OpenCL Program on CPUs 会议论文 Qiang Lan, Changqing Xun, Mei Wen, Huayou Su, Lif|
11 Accelerating thread-intensive and explicit memory management programs with dynamic partial reconfiguration 期刊论文 Yang Qianming|Wen Mei|Wu Nan|Zhang Chunyuan|
12 TISA: Reconfigurable System for Template-Based Stream Computing 期刊论文 Yang Qianming|Wu Nan|Wen Mei|Quan Wei|Zhang Chunyuan|
13 Parallelization Design of Irregular Algorithms of Video Processing on GPUs 会议论文 Huayou Su, Jun Chai, Mei Wen, Ju Ren, Chunyuan Zh|
14 Extending Borph for Shared Memory Reconfigurable Computers 会议论文 Xun Changqing, Wen Mei, Wu Nan, Zhang Chunyuan|
15 Parallelization design of irregular algorithms of video processing on GPUs 会议论文 Su Huayou|Chai Jun|Wen Mei|Ren Ju|Zhang Chunyuan|
16 Accelerating Thread-intensive and Explicit Memory Management Programs with Dynamic Partial Reconfiguration 期刊论文 Qianming Yang|Mei Wen|Nan Wu|Chunyuan Zhang|
17 <span style="font-family:;font-size:12pt;">A Computational Model of the Short-Cut Rule for 2D Shape Decomposition</span> 期刊论文 Lei Luo|Chunhua Shen|Xinwang Liu|Chunyuan Zhang|
18 A Parallel H.264 Encoder with CUDA: Mapping and Evaluation 会议论文 Nan Wu , Mei Wen , Huayou Su , Ju Ren , Chunyuan|
19 共享存储可重构计算机软硬件通信的优化实现 期刊论文 荀长庆|杨乾明|伍楠|文梅|张春元|
20 Automatic Stitching System for Images of Plane-like Scenes 会议论文 Dafei Huang, Mei Wen, Yungang Xue, Nan Wu, Ju Ren|
21 Fully distributed on-chip instruction memory design for stream architecture based on field-divided VLIW compression 会议论文 He Yi|Guan Maolin|Zhang Chunyuan|Tian Tian|Yang Qianming|
22 Resource-efficient utilization of CPU/GPU-based heterogeneous supercomputers for Bayesian phylogenetic inference 期刊论文 Jun Chai|Huayou Su|Mei Wen|Xing Cai|Nan Wu|Chunyuan Zhang|
23 Performance of Sediment Transport Simulations on NVIDIA&rsquo;s Kepler Architecture 会议论文 Huayou Su|Nan Wu|Mei Wen|Chunyuan Zhang|Xing Cai|
24 High-Efficient Parallel CAVLC Encoders on Heterogeneous Multicore Architectures 期刊论文 Su Huayou|Wen Mei|Ren Ju|Wu Nan|Chai Jun|Zhang Chunyuan|
25 High-Efficient Software Parallel CAVLC Encoder Based on Programmable Stream Processor 会议论文 Huayou Su|Chunyuan Zhang|Jun Chai|Mei Wen|Nan Wu|
26 Automatic mapping single-device OpenCL program to heterogeneous multi-device platform 会议论文 Dong Chen|Changqing Xun|Mei Wen|Chunyuan Zhang|
27 Fully Distributed on-chip Instruction Memory Design for Stream Architecture Based on Field-Divided VLIW Compression 会议论文 Yi He, Maolin Guan, Chunyuan Zhang, Tian Tian, Qi|
28 Using 1000+ GPUs and 10000+ CPUs for sedimentary basin simulations 会议论文 Wen Mei|Su Huayou|Wei Wenjie|Wu Nan|Cai Xing|Zhang Chunyuan|
29 High-Performance Implementation of Stream Model Based H.264 Video Coding on Parallel Processors 会议论文 Nan Wu|Mei Wen|Ju Ren|Huayou Su|Dafei Huang|
30 Utilizing Multiple Xeon Phi Coprocessors on One Compute Node [C] 会议论文 Xinnan Dong|Jun Chai|Jing Yang|Mei Wen|Nan Wu|Xing Cai|Chunyuan Zhang|Zhaoyun Chen|
31 流处理器中IO单元复用方法 专利 *管茂林; 荀长庆; 张春元; 杨乾明; 何义; 文梅; 伍楠; 任巨; 吴伟; 柴俊; 苏华友; 全巍
32 流体系结构指令存储器优化设计研究 期刊论文 管茂林|何义|杨乾明|张春元|伍楠|
33 An Adaptive Low-Overhead Mechanism for Dependable General-Purpose Many-Core Processors 会议论文 Wentao Jia|Chunyuan Zhang|Jian Fu|
34 Efficient fine grained shared buffer management for multiple OpenCL devices 期刊论文 Changqing Xun|Dong Chen|Qiang Lan|Chunyuan Zhang|
35 A Hybrid Task Mapping Algorithm for Heterogeneous MPSoCs 期刊论文 W. Quan|A. D. Pimentel|
36 基于CUDA的暗原色先验去雾算法并行实现与优化 会议论文 薛云刚|任巨|苏华友|文梅|张春元|
37 Maximum Variance Hashing via Column Generation 期刊论文 Lei Luo|Chao Zhang|Yongrui Qin|Chunyuan Zhang|
38 Shape Similarity Analysis by Self-Tuning Locally Constrained Mixed-Diffusion 期刊论文 Lei Luo|Chunhua Shen|Chunyuan Zhang|Anton van den Hengel|
39 ACF: Networks-on-chip Deadlock Recovery with Accurate Detection and Elastic Credit 会议论文 Nan Wu|Yuran Qiao|Mei Wen|Chunyuan Zhang|
40 流体系结构指令存储器优化设计研究 期刊论文 管茂林, 何义, 杨乾明, 张春元, 伍楠|
41 High-Efficient Parallel CAVLC Encoders on Heterogeneous Multicore Architectures 期刊论文 Huayou Su, Mei Wen, Ju Ren, Nan Wu, Jun Chai, Chu|
42 On the GPU Performance of 3D Stencil Computations Implemented in OpenCL 会议论文 Huayou Su|Nan Wu|Mei Wen|Chunyuan Zhang|Xing Cai|
43 Device view redundancy: an adaptive low-overhead fault tolerance mechanism for many-core system 会议论文 Wentao Jia|Chunyuan Zhang|Jian Fu|
44 面向全分布式VLIW结构的部分互连研究 会议论文 施自龙|杨乾明|文梅|张春元|伍楠|乔寓然|
45 Balancing efficiency and accuracy for sediment transport simulations. 期刊论文 Wenjie Wei|Stuart R. Clark|Huayou Su|Mei Wen|Xing Cai|
46 TISA: Reconfigurable System for Template-Based Stream Computing 期刊论文 YANG Qianming, WU Nan, WEN Mei, QUAN Wei, ZHANG C|
47 An Energy-Efficient Processor Core for Massively Parallel Computing 会议论文 Qianming Yang|Nan Wu|Maolin Guan|Chunyuan Zhang|Jun Cai|
48 Enabling a Uniform OpenCL Device View for Heterogeneous Platforms 期刊论文 Dafei HUANG, Changqing XUN, Nan WU, Mei WEN, Chun|
49 ACF: Networks-on-chip Deadlock Recovery with Accurate Detection and Elastic Credit 会议论文 Nan Wu, Yuran Qiao, Mei Wen, Chunyuan Zhang|
50 Shape Similarity Analysis by Self-Tuning Locally Constrained Mixed-Diffusion 期刊论文 Lei Luo|Chunhua Shen|Chunyuan Zhang|Anton van|
51 On the GPU-CPU Performance Portability of OpenCL for 3D Stencil Computations 会议论文 Huayou Su|Nan Wu|Mei Wen|Chunyuan Zhang|Xing Cai|
52 Improving Performance of GPU Specific OpenCL Program on CPUs 会议论文 Qiang Lan|Changqing Xun|Mei Wen|Huayou Su|Lifang Liu|Chunyuan Zhang|
53 共享存储可重构计算机软硬件通信的优化实现 期刊论文 荀长庆|杨乾明|伍楠|文梅|张春元|
54 Architecting Dependable Many-Core Processors Using Core-Level Dynamic Redundancy 会议论文 Wentao Jia|Chunyuan Zhang|Jian Fu|Rui Li|
55 流计算和视频编码 专著 张春元|文梅|苏华友|伍楠|任巨|
56 Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUs 会议论文 Dafei Huang|Mei Wen|Changqing Xun|Dong Chen|Chunyuan Zhang|
57 A Parallel H.264 Encoder with CUDA: Mapping and Evaluation 会议论文 Nan Wu|Mei Wen|Huayou Su|Ju Ren|Chunyuan Zhang|
58 <span style="font-size:12.0pt;font-family:" color:black;"="">Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation</span> 期刊论文 Huayou Su|Mei Wen|Nan Wu|Ju Ren|Chunyuan Zhang|
查看更多信息请先登录或注册