1 |
On the GPU performance of cell-centered finite volume method over unstructured tetrahedral meshes
|
会议论文 |
Johannes Langguth|Nan Wu|Jun Chai|Xing Cai| |
2 |
基于程序特征分析的流处理器VLIW压缩技术与解压实现
|
期刊论文 |
管茂林|何义|杨乾明|张春元| |
3 |
Solving the cardiac model using Multi-core CPU and Many Integrated cores (MIC)
|
会议论文 |
Jing Yang|Jun Chai|Mei Wen|Nan Wu|Chunyuan Zhang| |
4 |
Towards simulation of subcellular Calcium dynamics at nanometer resolution
|
期刊论文 |
Johan Hake|Nan Wu|Mei Wen|Xing Cai|Glenn T.li|Jing Yang|Huayou Su|Chunyuan Zhang|Xiangke Liao| |
5 |
An Efficient Parallel Deblocking Filter Based on GPU: Implementation and Optimization
|
会议论文 |
Huayou Su, Chunyuan Zhang, Jun Chai, Qianming Yan| |
6 |
Simulating cardiac electrophysiology in the era of GPU-cluster computing
|
期刊论文 |
Jun Chai|Mei Wen|Nan Wu|Dafei Huang|Jing Yang|Xing Cai|Chunyuan Zhang|Qianming Yang| |
7 |
Extending BORPH for shared memory reconfigurable computers
|
会议论文 |
Xun Changqing|Wen Mei|Wu Nan|Zhang Chunyuan|So Hayden Kwok-Hay| |
8 |
Automatic Stitching System for Images of Plane-like Scenes
|
会议论文 |
Dafei Huang|Mei Wen|Yungang Xue|Nan Wu|Ju Ren|Chunyuan Zhang| |
9 |
High Efficient Sedimentary Basin Simulations on Hybrid CPU-GPU Clusters
|
期刊论文 |
Mei Wen|Huayou Su|Wenjie Wei|Nan Wu|Xing Cai|Chunyuan Zhang| |
10 |
Improving Performance of GPU Specific OpenCL Program on CPUs
|
会议论文 |
Qiang Lan, Changqing Xun, Mei Wen, Huayou Su, Lif| |
11 |
Accelerating thread-intensive and explicit memory management programs with dynamic partial reconfiguration
|
期刊论文 |
Yang Qianming|Wen Mei|Wu Nan|Zhang Chunyuan| |
12 |
TISA: Reconfigurable System for Template-Based Stream Computing
|
期刊论文 |
Yang Qianming|Wu Nan|Wen Mei|Quan Wei|Zhang Chunyuan| |
13 |
Parallelization Design of Irregular Algorithms of Video Processing on GPUs
|
会议论文 |
Huayou Su, Jun Chai, Mei Wen, Ju Ren, Chunyuan Zh| |
14 |
Extending Borph for Shared Memory Reconfigurable Computers
|
会议论文 |
Xun Changqing, Wen Mei, Wu Nan, Zhang Chunyuan| |
15 |
Parallelization design of irregular algorithms of video processing on GPUs
|
会议论文 |
Su Huayou|Chai Jun|Wen Mei|Ren Ju|Zhang Chunyuan| |
16 |
Accelerating Thread-intensive and Explicit Memory Management Programs with Dynamic Partial Reconfiguration
|
期刊论文 |
Qianming Yang|Mei Wen|Nan Wu|Chunyuan Zhang| |
17 |
<span style="font-family:;font-size:12pt;">A Computational Model of the Short-Cut Rule for 2D Shape Decomposition</span>
|
期刊论文 |
Lei Luo|Chunhua Shen|Xinwang Liu|Chunyuan Zhang| |
18 |
A Parallel H.264 Encoder with CUDA: Mapping and Evaluation
|
会议论文 |
Nan Wu , Mei Wen , Huayou Su , Ju Ren , Chunyuan| |
19 |
共享存储可重构计算机软硬件通信的优化实现
|
期刊论文 |
荀长庆|杨乾明|伍楠|文梅|张春元| |
20 |
Automatic Stitching System for Images of Plane-like Scenes
|
会议论文 |
Dafei Huang, Mei Wen, Yungang Xue, Nan Wu, Ju Ren| |
21 |
Fully distributed on-chip instruction memory design for stream architecture based on field-divided VLIW compression
|
会议论文 |
He Yi|Guan Maolin|Zhang Chunyuan|Tian Tian|Yang Qianming| |
22 |
Resource-efficient utilization of CPU/GPU-based heterogeneous supercomputers for Bayesian phylogenetic inference
|
期刊论文 |
Jun Chai|Huayou Su|Mei Wen|Xing Cai|Nan Wu|Chunyuan Zhang| |
23 |
Performance of Sediment Transport Simulations on NVIDIA’s Kepler Architecture
|
会议论文 |
Huayou Su|Nan Wu|Mei Wen|Chunyuan Zhang|Xing Cai| |
24 |
High-Efficient Parallel CAVLC Encoders on Heterogeneous Multicore Architectures
|
期刊论文 |
Su Huayou|Wen Mei|Ren Ju|Wu Nan|Chai Jun|Zhang Chunyuan| |
25 |
High-Efficient Software Parallel CAVLC Encoder Based on Programmable Stream Processor
|
会议论文 |
Huayou Su|Chunyuan Zhang|Jun Chai|Mei Wen|Nan Wu| |
26 |
Automatic mapping single-device OpenCL program to heterogeneous multi-device platform
|
会议论文 |
Dong Chen|Changqing Xun|Mei Wen|Chunyuan Zhang| |
27 |
Fully Distributed on-chip Instruction Memory Design for Stream Architecture Based on Field-Divided VLIW Compression
|
会议论文 |
Yi He, Maolin Guan, Chunyuan Zhang, Tian Tian, Qi| |
28 |
Using 1000+ GPUs and 10000+ CPUs for sedimentary basin simulations
|
会议论文 |
Wen Mei|Su Huayou|Wei Wenjie|Wu Nan|Cai Xing|Zhang Chunyuan| |
29 |
High-Performance Implementation of Stream Model Based H.264 Video Coding on Parallel Processors
|
会议论文 |
Nan Wu|Mei Wen|Ju Ren|Huayou Su|Dafei Huang| |
30 |
Utilizing Multiple Xeon Phi Coprocessors on One Compute Node [C]
|
会议论文 |
Xinnan Dong|Jun Chai|Jing Yang|Mei Wen|Nan Wu|Xing Cai|Chunyuan Zhang|Zhaoyun Chen| |
31 |
流处理器中IO单元复用方法
|
专利 |
*管茂林; 荀长庆; 张春元; 杨乾明; 何义; 文梅; 伍楠; 任巨; 吴伟; 柴俊; 苏华友; 全巍 |
32 |
流体系结构指令存储器优化设计研究
|
期刊论文 |
管茂林|何义|杨乾明|张春元|伍楠| |
33 |
An Adaptive Low-Overhead Mechanism for Dependable General-Purpose Many-Core Processors
|
会议论文 |
Wentao Jia|Chunyuan Zhang|Jian Fu| |
34 |
Efficient fine grained shared buffer management for multiple OpenCL devices
|
期刊论文 |
Changqing Xun|Dong Chen|Qiang Lan|Chunyuan Zhang| |
35 |
A Hybrid Task Mapping Algorithm for Heterogeneous MPSoCs
|
期刊论文 |
W. Quan|A. D. Pimentel| |
36 |
基于CUDA的暗原色先验去雾算法并行实现与优化
|
会议论文 |
薛云刚|任巨|苏华友|文梅|张春元| |
37 |
Maximum Variance Hashing via Column Generation
|
期刊论文 |
Lei Luo|Chao Zhang|Yongrui Qin|Chunyuan Zhang| |
38 |
Shape Similarity Analysis by Self-Tuning Locally Constrained Mixed-Diffusion
|
期刊论文 |
Lei Luo|Chunhua Shen|Chunyuan Zhang|Anton van den Hengel| |
39 |
ACF: Networks-on-chip Deadlock Recovery with Accurate Detection and Elastic Credit
|
会议论文 |
Nan Wu|Yuran Qiao|Mei Wen|Chunyuan Zhang| |
40 |
流体系结构指令存储器优化设计研究
|
期刊论文 |
管茂林, 何义, 杨乾明, 张春元, 伍楠| |
41 |
High-Efficient Parallel CAVLC Encoders on Heterogeneous Multicore Architectures
|
期刊论文 |
Huayou Su, Mei Wen, Ju Ren, Nan Wu, Jun Chai, Chu| |
42 |
On the GPU Performance of 3D Stencil Computations Implemented in OpenCL
|
会议论文 |
Huayou Su|Nan Wu|Mei Wen|Chunyuan Zhang|Xing Cai| |
43 |
Device view redundancy: an adaptive low-overhead fault tolerance mechanism for many-core system
|
会议论文 |
Wentao Jia|Chunyuan Zhang|Jian Fu| |
44 |
面向全分布式VLIW结构的部分互连研究
|
会议论文 |
施自龙|杨乾明|文梅|张春元|伍楠|乔寓然| |
45 |
Balancing efficiency and accuracy for sediment transport simulations.
|
期刊论文 |
Wenjie Wei|Stuart R. Clark|Huayou Su|Mei Wen|Xing Cai| |
46 |
TISA: Reconfigurable System for Template-Based Stream Computing
|
期刊论文 |
YANG Qianming, WU Nan, WEN Mei, QUAN Wei, ZHANG C| |
47 |
An Energy-Efficient Processor Core for Massively Parallel Computing
|
会议论文 |
Qianming Yang|Nan Wu|Maolin Guan|Chunyuan Zhang|Jun Cai| |
48 |
Enabling a Uniform OpenCL Device View for Heterogeneous Platforms
|
期刊论文 |
Dafei HUANG, Changqing XUN, Nan WU, Mei WEN, Chun| |
49 |
ACF: Networks-on-chip Deadlock Recovery with Accurate Detection and Elastic Credit
|
会议论文 |
Nan Wu, Yuran Qiao, Mei Wen, Chunyuan Zhang| |
50 |
Shape Similarity Analysis by Self-Tuning Locally Constrained Mixed-Diffusion
|
期刊论文 |
Lei Luo|Chunhua Shen|Chunyuan Zhang|Anton van| |
51 |
On the GPU-CPU Performance Portability of OpenCL for 3D Stencil Computations
|
会议论文 |
Huayou Su|Nan Wu|Mei Wen|Chunyuan Zhang|Xing Cai| |
52 |
Improving Performance of GPU Specific OpenCL Program on CPUs
|
会议论文 |
Qiang Lan|Changqing Xun|Mei Wen|Huayou Su|Lifang Liu|Chunyuan Zhang| |
53 |
共享存储可重构计算机软硬件通信的优化实现
|
期刊论文 |
荀长庆|杨乾明|伍楠|文梅|张春元| |
54 |
Architecting Dependable Many-Core Processors Using Core-Level Dynamic Redundancy
|
会议论文 |
Wentao Jia|Chunyuan Zhang|Jian Fu|Rui Li| |
55 |
流计算和视频编码
|
专著 |
张春元|文梅|苏华友|伍楠|任巨| |
56 |
Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUs
|
会议论文 |
Dafei Huang|Mei Wen|Changqing Xun|Dong Chen|Chunyuan Zhang| |
57 |
A Parallel H.264 Encoder with CUDA: Mapping and Evaluation
|
会议论文 |
Nan Wu|Mei Wen|Huayou Su|Ju Ren|Chunyuan Zhang| |
58 |
<span style="font-size:12.0pt;font-family:" color:black;"="">Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation</span>
|
期刊论文 |
Huayou Su|Mei Wen|Nan Wu|Ju Ren|Chunyuan Zhang| |