Parallel data reduction techniques for big datasets

Yıldırım, Ahmet Artu; Özdoğan, Cem; Watson, Dan

Parallel data reduction techniques for big datasets

Yıldırım, Ahmet Artu; Özdoğan, Cem; Watson, Dan

Bağlantı: http://hdl.handle.net/20.500.12416/5909

Tarih: 2013-10-31

Özet:

Data reduction is perhaps the most critical component in retrieving information from big data (i.e., petascale-sized data) in many data-mining processes. The central issue of these data reduction techniques is to save time and bandwidth in enabling the user to deal with larger datasets even in minimal resource environments, such as in desktop or small cluster systems. In this chapter, the authors examine the motivations behind why these reduction techniques are important in the analysis of big datasets. Then they present several basic reduction techniques in detail, stressing the advantages and disadvantages of each. The authors also consider signal processing techniques for mining big data by the use of discrete wavelet transformation and server-side data reduction techniques. Lastly, they include a general discussion on parallel algorithms for data reduction, with special emphasis given to parallel waveletbased multi-resolution data reduction techniques on distributed memory systems using MPI and shared memory architectures on GPUs along with a demonstration of the improvement of performance and scalability for one case study.

Tüm öğe kaydını göster

Bu öğenin dosyaları:

Dosyalar	Boyut	Biçim	Göster
Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Fizik Bilim Dalı Yayın Koleksiyonu [25]
Fizik Bilim Dalı yayınlarını içerir.

Parallel data reduction techniques for big datasets

Parallel data reduction techniques for big datasets

Özet:

Bu öğenin dosyaları:

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

DSpace'de Ara

Göz at

Tüm DSpace

Bu Koleksiyon

Hesabım