schliessen

Filtern

 

Bibliotheken

Accelerating IDCT Algorithm on Xeon Phi Coprocessor

Inverse Discrete Cosine Transform (IDCT) is an important operation for image and videos decompression. How to accelerate the IDCT algorithm has been frequently studied. Recently Intel has proposed Xeon Phi coprocessors based on the many integrated core (MIC) architecture. Xeon Phi is integrated with... Full description

Journal Title: Advanced materials research 2013, Vol.756-759, p.3114-3120
Main Author: Qi, Jin
Other Authors: Yang, Can Qun , Chen, Cheng , Wu, Qiang , Tang, Tao
Format: Konferenzbeitrag Konferenzbeitrag
Language: English
ID: ISSN: 1662-8985
Zum Text:
SendSend as email Add to Book BagAdd to Book Bag
Staff View
recordid: cdi_crossref_primary_10_4028_www_scientific_net_AMR_756_759_3114
title: Accelerating IDCT Algorithm on Xeon Phi Coprocessor
format: Conference Proceeding
creator:
  • Qi, Jin
  • Yang, Can Qun
  • Chen, Cheng
  • Wu, Qiang
  • Tang, Tao
ispartof: Advanced materials research, 2013, Vol.756-759, p.3114-3120
description: Inverse Discrete Cosine Transform (IDCT) is an important operation for image and videos decompression. How to accelerate the IDCT algorithm has been frequently studied. Recently Intel has proposed Xeon Phi coprocessors based on the many integrated core (MIC) architecture. Xeon Phi is integrated with 61 cores and 512-bit SIMD extension within each core, thus providing very high performance. In this paper, we employ the Knights Corner (a beta version of Xeon Phi) to accelerate the IDCT algorithm. By employing the 512-bit SIMD instruction and data pre-fetching optimization, our implementation achieves (1) averagely 5.82 speedup over the none-SIMD version, (2) averagely 27.3% performance benefit with the data pre-fetching optimization, and (3) averagely 1.53 speedup on one Knights Corner coprocessor over the implementation on one octal-core Intel Xeon E5-2670 CPU.
language: eng
source:
identifier: ISSN: 1662-8985
fulltext: no_fulltext
issn:
  • 1662-8985
  • 1662-8985
url: Link


@attributes
NO1
SEARCH_ENGINEprimo_central_multiple_fe
SEARCH_ENGINE_TYPEPrimo Central Search Engine
RANK2.1520848
LOCALfalse
PrimoNMBib
record
control
sourceidcrossref
recordidTN_cdi_crossref_primary_10_4028_www_scientific_net_AMR_756_759_3114
sourceformatXML
sourcesystemPC
sourcerecordid10_4028_www_scientific_net_AMR_756_759_3114
originalsourceidFETCH-LOGICAL-c1211-23fbd6a981ecbfb48139fea3df184bf9bdc837d058aa625bd84fd2d695b2357b0
addsrcrecordideNpN0EtLxDAUBeAgCo6j_6F7aSePJk2Wpb4GRhQZwV3IcybSaYekUPz3ZlDQxT33rM7iA-AWwaqGmK_mea6SCW6Ygg-mGty0ap_fqoaysqGiIgjVZ2CBGMMlF5ye_-uX4CqlTwhZjTBdANIa43oX1RSGXbG-67ZF2-_GGKb9oRiH4sPleN2HohuPcTQupTFegwuv-uRufv8SvD_cb7uncvPyuO7aTWkQRqjExGvLlODIGe11zRER3iliPeK19kJbw0ljIeVKMUy15bW32DJBNSa00XAJup9dE8eUovPyGMNBxS-JoDw5yOwg_xxkdpDZQWaHfEKeHOA3toNZTQ
sourcetypeAggregation Database
isCDItrue
recordtypeconference_proceeding
display
typeconference_proceeding
titleAccelerating IDCT Algorithm on Xeon Phi Coprocessor
creatorQi, Jin ; Yang, Can Qun ; Chen, Cheng ; Wu, Qiang ; Tang, Tao
creatorcontribQi, Jin ; Yang, Can Qun ; Chen, Cheng ; Wu, Qiang ; Tang, Tao
descriptionInverse Discrete Cosine Transform (IDCT) is an important operation for image and videos decompression. How to accelerate the IDCT algorithm has been frequently studied. Recently Intel has proposed Xeon Phi coprocessors based on the many integrated core (MIC) architecture. Xeon Phi is integrated with 61 cores and 512-bit SIMD extension within each core, thus providing very high performance. In this paper, we employ the Knights Corner (a beta version of Xeon Phi) to accelerate the IDCT algorithm. By employing the 512-bit SIMD instruction and data pre-fetching optimization, our implementation achieves (1) averagely 5.82 speedup over the none-SIMD version, (2) averagely 27.3% performance benefit with the data pre-fetching optimization, and (3) averagely 1.53 speedup on one Knights Corner coprocessor over the implementation on one octal-core Intel Xeon E5-2670 CPU.
identifier
0ISSN: 1662-8985
1EISSN: 1662-8985
2DOI: 10.4028/www.scientific.net/AMR.756-759.3114
languageeng
ispartofAdvanced materials research, 2013, Vol.756-759, p.3114-3120
lds50peer_reviewed
citesFETCH-LOGICAL-c1211-23fbd6a981ecbfb48139fea3df184bf9bdc837d058aa625bd84fd2d695b2357b0
links
openurl$$Topenurl_article
thumbnail$$Usyndetics_thumb_exl
search
creatorcontrib
0Qi, Jin
1Yang, Can Qun
2Chen, Cheng
3Wu, Qiang
4Tang, Tao
title
0Accelerating IDCT Algorithm on Xeon Phi Coprocessor
1Advanced materials research
descriptionInverse Discrete Cosine Transform (IDCT) is an important operation for image and videos decompression. How to accelerate the IDCT algorithm has been frequently studied. Recently Intel has proposed Xeon Phi coprocessors based on the many integrated core (MIC) architecture. Xeon Phi is integrated with 61 cores and 512-bit SIMD extension within each core, thus providing very high performance. In this paper, we employ the Knights Corner (a beta version of Xeon Phi) to accelerate the IDCT algorithm. By employing the 512-bit SIMD instruction and data pre-fetching optimization, our implementation achieves (1) averagely 5.82 speedup over the none-SIMD version, (2) averagely 27.3% performance benefit with the data pre-fetching optimization, and (3) averagely 1.53 speedup on one Knights Corner coprocessor over the implementation on one octal-core Intel Xeon E5-2670 CPU.
issn
01662-8985
11662-8985
fulltextfalse
rsrctypeconference_proceeding
creationdate2013
recordtypeconference_proceeding
recordideNpN0EtLxDAUBeAgCo6j_6F7aSePJk2Wpb4GRhQZwV3IcybSaYekUPz3ZlDQxT33rM7iA-AWwaqGmK_mea6SCW6Ygg-mGty0ap_fqoaysqGiIgjVZ2CBGMMlF5ye_-uX4CqlTwhZjTBdANIa43oX1RSGXbG-67ZF2-_GGKb9oRiH4sPleN2HohuPcTQupTFegwuv-uRufv8SvD_cb7uncvPyuO7aTWkQRqjExGvLlODIGe11zRER3iliPeK19kJbw0ljIeVKMUy15bW32DJBNSa00XAJup9dE8eUovPyGMNBxS-JoDw5yOwg_xxkdpDZQWaHfEKeHOA3toNZTQ
startdate201309
enddate201309
creator
0Qi, Jin
1Yang, Can Qun
2Chen, Cheng
3Wu, Qiang
4Tang, Tao
scope
0AAYXX
1CITATION
sort
creationdate201309
titleAccelerating IDCT Algorithm on Xeon Phi Coprocessor
authorQi, Jin ; Yang, Can Qun ; Chen, Cheng ; Wu, Qiang ; Tang, Tao
facets
frbrtype5
frbrgroupidcdi_FETCH-LOGICAL-c1211-23fbd6a981ecbfb48139fea3df184bf9bdc837d058aa625bd84fd2d695b2357b0
rsrctypeconference_proceedings
prefilterconference_proceedings
languageeng
creationdate2013
toplevelpeer_reviewed
creatorcontrib
0Qi, Jin
1Yang, Can Qun
2Chen, Cheng
3Wu, Qiang
4Tang, Tao
collectionCrossRef
delivery
delcategoryRemote Search Resource
fulltextno_fulltext
addata
au
0Qi, Jin
1Yang, Can Qun
2Chen, Cheng
3Wu, Qiang
4Tang, Tao
formatbook
genreproceeding
ristypeCONF
atitleAccelerating IDCT Algorithm on Xeon Phi Coprocessor
btitleAdvanced materials research
date2013-09
risdate2013
volume756-759
spage3114
epage3120
pages3114-3120
issn1662-8985
eissn1662-8985
abstractInverse Discrete Cosine Transform (IDCT) is an important operation for image and videos decompression. How to accelerate the IDCT algorithm has been frequently studied. Recently Intel has proposed Xeon Phi coprocessors based on the many integrated core (MIC) architecture. Xeon Phi is integrated with 61 cores and 512-bit SIMD extension within each core, thus providing very high performance. In this paper, we employ the Knights Corner (a beta version of Xeon Phi) to accelerate the IDCT algorithm. By employing the 512-bit SIMD instruction and data pre-fetching optimization, our implementation achieves (1) averagely 5.82 speedup over the none-SIMD version, (2) averagely 27.3% performance benefit with the data pre-fetching optimization, and (3) averagely 1.53 speedup on one Knights Corner coprocessor over the implementation on one octal-core Intel Xeon E5-2670 CPU.
doi10.4028/www.scientific.net/AMR.756-759.3114