Principal Component Analysis And Randomness Tests For Big Data Analysis by Mieko TanakaPrincipal Component Analysis And Randomness Tests For Big Data Analysis by Mieko Tanaka

Principal Component Analysis And Randomness Tests For Big Data Analysis

byMieko Tanaka

Hardcover | March 11, 2019

Pricing and Purchase Info

$182.73 online 
$191.95 list price
Earn 914 plum® points

Prices and offers may vary in store


Pre-order online

Ships free on orders over $25

Not yet available in stores


This book presents the novel approach of analyzing large-sized rectangular-shaped numerical data (so-called big data). The essence of this approach is to grasp the "meaning" of the data instantly, without getting into the details of individual data. Unlike conventional approaches of principal component analysis, randomness tests, and visualization methods, the authors' approach has the benefits of universality and simplicity of data analysis, regardless of data types, structures, or specific field of science.

First, mathematical preparation is described. The RMT-PCA and the RMT-test utilize the cross-correlation matrix of time series,C=XXT, whereXrepresents a rectangular matrix ofNrows andLcolumns andXTrepresents the transverse matrix ofX. BecauseCis symmetric, namely,C=CT, it can be converted to a diagonal matrix of eigenvalues by a similarity transformation-1=SCSTusing an orthogonal matrixS. WhenNis significantly large, the histogram of the eigenvalue distribution can be compared to the theoretical formula derived in the context of the random matrix theory (RMT, in abbreviation).

Then the RMT-PCA applied to high-frequency stock prices in Japanese and American markets is dealt with. This approach proves its effectiveness in extracting "trendy" business sectors of the financial market over the prescribed time scale. In this case,Xconsists ofNstock- prices of lengthL, and the correlation matrixCis anNbyNsquare matrix, whose element at thei-th row andj-th column is the inner product of the price time series of the lengthLof thei-th stock and thej-th stock of the equal lengthL.

Next, the RMT-test is applied to measure randomness of various random number generators, including algorithmically generated random numbers and physically generated random numbers.

The book concludes by demonstrating two application of the RMT-test: (1) a comparison of hash functions, and (2) stock prediction by means of randomness.

Mieko Tanaka Professor Graduate School of Engineering, Tottori Koyama-cho Minami, Tottori 680-8550, Japan
Title:Principal Component Analysis And Randomness Tests For Big Data AnalysisFormat:HardcoverPublished:March 11, 2019Publisher:Springer-Verlag/Sci-Tech/TradeLanguage:English

The following ISBNs are associated with this title:

ISBN - 10:4431559043

ISBN - 13:9784431559047

Look for similar items by category:


Table of Contents

Chapter 1 Introduction.- Chapter 2 Big Data Analysis by Using Rectangular-Shaped data: Mathematical Tools.- Chapter 3 Application to Extract Trendy Sectors in Stock Markets (RMT-PCA).- Chapter 4 Application to Measure Randomness of Time Series (RMT-test)- 4-1 Pseudo-Random Generators and Physical Random Generators- 4-2 Relation Between the Randomness of Tick-wise Prices and the Future Performance of Individual Stocks- 4-3 Other Application of the RMT-test.- Chapter 5 Human Random Generation and Its Applications- 5-1 Experimental Conditions- 5-2 Selection of Indices.- Chapter 6 Visualization by Means of Self-Organized Maps (SOM).- Chapter 7 Conclusion and Future Perspectives.