Vendor : SAS Institute
E-mail this page
Related Content
Remember this itemFormat: PDF
Date:
15/02/2008
Overview
In data mining, principal component analysis is a popular dimension reduction technique. It also provides a good remedy for the multicollinearity problem, but its interpretation of input space is not as good. To overcome the interpretation problem, principal components (cluster components) are obtained through variable clustering, which was implemented with PROC VARCLUS. The procedure uses oblique principal components analysis and binary iterative splits for variable clustering, and it provides non-orthogonal principal components. Even if this procedure sacrifices the orthogonal property among principal components, it provides good interpretable principal components and well-explained cluster structures of variables. However, the PROC VARCLUS implementation is inefficient to deal with high-dimensional data. This paper introduces the two-stage, variable clustering technique for large data sets.
|
|
MY | Designed for Business Success.
Read about Dell’s highly-rated or CNET Editor’s Choice laptops
Rapid Application Development
Click here to download CodeGear RAD Studio 2009 for native Windows & .NET development
Download RAD Studio 2009 now
The complete solution for rapid application development from CodeGear
Protect Your Business Critical Systems
With award-winning disaster recovery solutions by NEC.
Free IT Salary Report 2009
Register and be the first to download this invaluable resource
Find out who you should be spending your IT budget with