Learn

Product Videos

Customer Stories

XM Basecamp

Training & Certification

XM Institute
Implement

XM Marketplace

Templates

Integrations

Free Account
Connect

Community

XM Advocates

X4 Summit
Support

Customer Success Hub

Product Documentation

XM Services

Product Roadmap
See the latest product releases on XM in Action
Watch Now
What is XM?

Customer Experience

Employee Experience

Product Experience

Brand Experience

Market Research

AI
Resources

Customer Stories

eBooks

Original Research

Analyst Reports

Blog

Product Demos

XM Institute
Events

X4 on Tour Sydney

XM Innovation Event

XM+

Events & Webinars
Partnerships

Become a Partner

Accenture + Qualtrics

Deloitte Digital + Qualtrics

EY + Qualtrics

Korn Ferry + Qualtrics

Marketplace
Join us in-person at
X4 on Tour Sydney
Learn more
Careers

Job Openings

Qualtrics Life

Sales

Engineering

Customer Success

Research Services
About

Contact Us

5 For The Fight

Diversity, Equity & Inclusion

Newsroom

Partnerships

Services

Brand Book
We're Hiring!
View Careers

Qualtrics Life
Read more

Products

Customer
Frontlines
OVERVIEW

PRODUCTS
Digital
Care
Location

People
Teams
OVERVIEW

PRODUCTS
Engage
Lifecycle
Analytics

Strategy
& Research
OVERVIEW

PRODUCTS
Research
UX
Brand

XM Platform
OVERVIEW

XM Marketplace
OVERVIEW

XM Services
OVERVIEW

Solutions

Role & Team

Contact Centre
OVERVIEW

Market Research
OVERVIEW

CX Professional
OVERVIEW

Human Resources
OVERVIEW

Digital
OVERVIEW

Product Management
OVERVIEW

Industry

Education
OVERVIEW

Healthcare
OVERVIEW

Technology
OVERVIEW

Retail & CPG
OVERVIEW

Financial Services
OVERVIEW

Government
OVERVIEW

B2B
OVERVIEW

Travel & Hospitality
OVERVIEW

Automotive
OVERVIEW

Popular Solutions
Brand Tracking

Media & Telco
OVERVIEW

Customers

Resources

Company

Try Qualtrics for free

Free Account

What Is Cluster Analysis? When Should You Use It For Your Results?

3 min read
Cluster analysis can be a powerful data-mining tool for any organisation that needs to identify discrete groups of customers, sales transactions, or other types of behaviours and things. For example, insurance providers use cluster analysis to detect fraudulent claims, and banks use it for credit scoring.

Cluster analysis, like reduced space analysis (factor analysis), is concerned with data matrices in which the variables have not been partitioned beforehand into criterion versus predictor subsets.

The objective of cluster analysis is to find similar groups of subjects, where “similarity” between each pair of subjects means some global measure over the whole set of characteristics. In this article we discuss various methods of clustering and the key role that distance plays as measures of the proximity of pairs of points.

Basic Questions in Cluster Analysis

The most common use of cluster analysis is classification. Subjects are separated into groups so that each subject is more similar to other subjects in its group than to subjects outside the group.

We will initially focus on clustering procedures that result in the assignment of each subject to one, and only one, class. Subjects within a class are usually assumed to be indistinguishable from one another. Thus, we assume that the underlying structure of the data involves an unordered set of discrete classes. In some cases we may also view these classes as hierarchical in nature, with some classes divided into subclasses. Clustering procedures can be viewed as “pre-classificatory” in the sense that the researcher has not used prior judgment to partition the subjects (rows of the data matrix). However, it is assumed that some of the objectives are heterogeneous; that is, that “clusters” exist.

This presupposition of different groups is based on commonalities within the set of independent variables. This assumption is different from the one made in the case of discriminant analysis or automatic interaction detection, where the dependent variable is used to formally define groups of objects and the distinction is not made on the basis of profile resemblance in the data matrix itself.

Thus, given that no information on group definition is formally evaluated in advance, the major problems of cluster analysis will be discussed as follows:

What measure of inter-subject similarity is to be used and how is each variable to be “weighted” in the construction of such a summary measure?
After inter-subject similarities are obtained, how are the classes to be formed?
After the classes have been formed, what summary measures of each cluster are appropriate in a descriptive sense; that is, how are the clusters to be defined?
Assuming that adequate descriptions of the clusters can be obtained, what inferences can be drawn regarding their statistical significance?

XM FOR

Customer Frontlines

PRODUCTS

Solutions

eBook

XM FOR

People Teams

PRODUCTS

Solutions

eBook

XM FOR

Strategy & Research

PRODUCTS

Solutions

Free Trial

The Experience Management Platform™

Platform Capabilities

eBook

XM Marketplace

Solution Type

Popular Solutions

XM Services

Advisory

Implementation

Support & Success

Research Services

Solutions for the Contact Centre

Popular Solutions

Educational Resources

Solutions for Market Research

Popular Solutions

Educational Resources

Solutions for CX Professional

Popular Solutions

Educational Resources

Solutions for Human Resources

Popular Solutions

Educational Resources

Solutions for Digital

Popular Solutions

Educational Resources

Solutions for Product Management

Popular Solutions

Educational Resources

Solutions for Education

Popular Solutions

Educational Resources

Solutions for Healthcare

Popular Solutions

Educational Resources

Solutions for Technology

Popular Solutions

Educational Resources

Solutions for Retail & CPG

Popular Solutions

Educational Resources

Solutions for Financial Services

Popular Solutions

Educational Resources

Solutions for Government

Popular Solutions

Educational Resources

Solutions for B2B

Popular Solutions

Educational Resources

Solutions for Travel & Hospitality

Popular Solutions

Educational Resources

Solutions for Automotive

Popular Solutions

Educational Resources

Solutions for Media & Telco

Popular Solutions

Educational Resources

Try Qualtrics for free

What Is Cluster Analysis? When Should You Use It For Your Results?

Basic Questions in Cluster Analysis

Related resources

Sentiment Analysis 20 min read

Report on Survey Findings 3 min read

Customer
Frontlines

People
Teams

Strategy
& Research

The Experience
Management Platform^™

Solutions for the
Contact Centre

Solutions for
Market Research

Solutions for
CX Professional

Solutions for
Human Resources

Solutions for
Digital

Solutions for
Product Management

Solutions for
Education

Solutions for
Healthcare

Solutions for
Technology

Solutions for
Retail & CPG

Solutions for
Financial Services

Solutions for
Government

Solutions for
B2B

Solutions for
Travel & Hospitality

Solutions for
Automotive

Solutions for
Media & Telco

Sentiment Analysis
20 min read

Report on Survey Findings
3 min read

Thematic Analysis
11 min read

Predictive Analytics
19 min read

Statistical significance calculator: Tool & complete guide
18 min read

Data Analysis
32 min read

Regression Analysis
19 min read