Akamai acquires LayerX, delivering end-to-end security and real-time AI usage control to any browser. Get details

Back Products Close

Cloud Computing

Cybersecurity

Content Delivery

See all products

Our Infrastructure

Global Services

Back Cloud Computing Close

Artificial intelligence (AI)

Akamai Inference Cloud

Storage

Object Storage

Block Storage

Backups

Databases

Managed Databases

compute

GPU

CPU

Kubernetes

App Platform

Accelerated Compute

Serverless

Akamai Functions

Networking

Cloud Firewall

DNS Manager

NodeBalancers

Private Networking

View cloud pricing

Explore plans and pricing that fit your needs — from small projects to global-scale deployments.

See pricing

Get started with Akamai Cloud

Sign up today and unlock cloud computing, edge, and AI tools built for your business.

Sign up

See all Cloud Computing

Back Cybersecurity Close

app and api security

API Security

App & API Protector

Firewall for AI

Client-Side Protection & Compliance

Bot & Agent Control

Account Protector

Content Protector

Bot Manager

AI Brand Presence

Segmentation

Akamai Guardicore Segmentation

zero trust security

Akamai Workforce Protector (formerly LayerX)

Secure Internet Access

Enterprise Application Access

Akamai MFA

Identity, Credential, and Access Management

infrastructure security

Edge DNS

Prolexic

IP Accelerator

DNS Posture Management

Brand Guardian

Get started with Security

Protect the applications that drive your business — every day, every time.

Contact Sales

See all Cybersecurity

Back Content Delivery Close

Application performance

Ion

API Acceleration

IP Accelerator

Media Delivery

Adaptive Media Delivery

Download Delivery

Edge Applications

EdgeWorkers

EdgeKV

Image & Video Manager

Media Services Live

Cloudlets

Cloud Wrapper

Global Traffic Management

Monitoring, reporting and testing

Data Stream

mPulse

CloudTest

Get started with Content Delivery

Trust the agility and scale of Akamai to help you flawlessly deliver extraordinary digital experiences.

Contact Sales

See all Content Delivery

Back Solutions Close

Cloud Computing

Serverless

Media

SaaS

Gaming

See all Cloud Computing

security

Frontier AI Security Risks

Akamai Application Protection Platform

Cybersecurity Compliance

Ransomware Protection

Secure Apps and APIs

DNS Delivery and Security

Zero Trust

DDoS Protection

Bot & Agent Control

Identity, Credential and Access Management

See all Cybersecurity

content delivery

App and API Performance

Media Delivery

See all Content Delivery

industry solutions

Media and Entertainment

Retail, Travel, and Hospitality

Financial Services

Healthcare and Life Sciences

Public Sector

Defense

Games

Online Sports Betting and iGaming

Service Providers

See all Industry Solutions

Back Pricing Close

Security and Delivery

Get started

Contact Sales

Free trials

Cloud pricing

GLOBAL PRICING

North America pricing

Europe pricing

Asia Pacific pricing

South America pricing

SPECIFIC LOCAL PRICING

Jakarta pricing

See all pricing

Cloud pricing

Try Akamai Cloud with US$100 in credits*

Deploy faster with global cloud infrastructure — no surprise bills, no lock-in, and transparent pricing across every data center.

Try now

*See Promotion Redemption Rules & Conditions

Back Developers Close

Cloud developers

Developer hub

Akamai GitHub repo

docs and guides

Cloud docs

Guides and tutorials

cloud marketplace

Developer apps

Get started with Akamai Cloud

Sign up today and unlock cloud computing, edge and AI tools built for your business.

Sign up

Back Resources Close

What’s new

Akamai blog

Events and workshops

Learning

White papers, ebooks, videos, product briefs

Customer stories

Training and certifications

Cybersecurity Research

Akamai Security Intelligence Group (SIG)

State of Internet (SOTI) reports

Partners

Partner with Akamai to innovate, scale, and grow your advantage

Channel Partners

Partner Portal

Partner Stories

Technology Partners

Technology Partners Directory

Log in

Back Log in Close

Cloud Manager
Manage your cloud computing services

Back Log in Close

Control Center
Manage your security and delivery services
- Docs
- Sales
- Support
- Under Attack ?
English
Back Language Close
- English
- Deutsch
- Español
- Français
- Italiano
- Português
- 中文
- 日本語
- 한국어

Create account

Under Attack?

Akamai Cloud

Akamai Security and Delivery

Connect with our Sales team to discuss your business needs and find the right solutions.

Contact Sales

What Is Locally Linear Embedding

Understanding Locally Linear Embedding

Locally linear embedding (LLE) is a method used in artificial intelligence (AI) to simplify large and complex datasets. This process, known as nonlinear dimensionality reduction, helps make data easier to work with while keeping ‌important patterns and relationships intact. Created by Sam T. Roweis and Lawrence K. Saul in 2000, the LLE algorithm is especially useful for understanding and visualizing data with many features or variables. It is widely used because it can capture complex, nonlinear structures in data and is a key tool in manifold learning, a field that studies how to represent high-dimensional data in lower dimensions.

The need for dimensionality reduction

In many fields, datasets often have hundreds or even thousands of features. This is called high-dimensional data. Working with such data structures can be slow and make it hard to spot patterns. Dimensionality reduction techniques, like locally linear embedding, simplify these datasets by reducing the number of features while keeping the most important information. This makes it easier for AI programs and models to analyze and use the data effectively. Dimensionality reduction also supports data analysis, enabling better data visualization and interpretation of complex relationships.

Key concepts and technologies

These terms and concepts are essential for understanding locally linear embedding better.

Datasets: Datasets are collections of data points. Each data point is a single observation, like one person’s height and weight in a health study.
High-dimensional data: This refers to datasets with many features or variables, like a spreadsheet with thousands of columns. These datasets are hard to analyze because they contain too much information to visualize or process easily.
Data shapes: Data shapes refer to the underlying organization or patterns that emerge when we examine datasets. For instance, a dataset of photos may have patterns based on how lighting changes across images or how objects are positioned.
Feature space: The feature space is the space in which data points are represented by their features. Each dimension corresponds to a specific feature, and the position of a data point in this space reflects its values for those features.
Manifolds: A manifold is a data shape that resembles a geometric space like an orb, a donut, or a winding path, but where the relationships between local data points seem flat or linear.
Dimensional embedding: This means taking data with many features and representing it in a smaller number of features while keeping its key patterns intact.
Local relationships: These describe how data points relate to their closest neighbors, often referred to as local neighborhoods. Preserving these relationships helps to maintain the local structure of the data.
Optimization: This is the process of finding the best solution to a problem. In LLE, optimization is used to figure out how to represent data points using their neighbors while minimizing the cost function.
Weight matrix: This is a table that shows how much each neighbor contributes to representing a data point. The weights add up to 1 for each data point, and each weight, often denoted as wij, represents the influence of a neighbor in reconstructing the original data.

Overview of locally linear embedding

Locally linear embedding reduces the number of features in a dataset while keeping the smaller patterns and connections intact. It assumes that the data has a simpler underlying shape that can be represented in fewer dimensions. Unlike principal component analysis (PCA), which looks at global structure in the data, LLE focuses on preserving local relationships between small groups of data points.

Key ideas in LLE:

Manifold structure: The data is assumed to lie on a smooth, curved surface within a high-dimensional space. These surfaces, also called nonlinear manifolds, hold the intrinsic structure of the data.
Local linearity: LLE focuses on small groups of neighboring data points. It captures the local geometry of the data manifold by examining the structure within each neighborhood. The relationships between a point and its nearest neighbors are approximately linear and are represented using mathematical methods, like least squares, to find out how well their neighbors can represent points.

How locally linear embedding works

LLE simplifies complex data in three steps. Here’s how it works, using a dataset of handwritten numbers or digits as an example.

Find nearby data points: For each data point, a locally linear embedding identifies its closest neighbors. These neighbors are small groups of points that are used to represent local relationships. Nearby points can be found in two main ways:

k-nearest neighbors: This method finds the closest points for each data point based on their distances from it, using a “k” parameter. For example, if k=5, the algorithm looks for the five closest points to represent the local neighborhood. In the handwritten digit dataset, this step would involve identifying the five most similar digit images for each sample based on pixel intensity patterns using metrics like Euclidean distance.
ε-ball: Instead of a fixed number of neighbors, this method includes all points within a certain distance (ε) from a data point. For instance, all digit images within a specific similarity range would be included in the neighborhood. A small ε might focus on digits with very similar strokes, while a larger ε might include more diverse handwriting styles.

Calculate how neighbors fit together: Each data point is written as a combination of its neighbors. The algorithm adjusts the reconstruction weights to find the best fit. In the handwritten digit dataset, this step involves figuring out how much each neighboring digit image contributes to approximating the current image. For example, the image of the digit “3” might be represented by a weighted combination of similar “3” images in the dataset. The weight matrix ensures these relationships are captured mathematically, and the weights are optimized iteratively to minimize the reconstruction error while maintaining sparse matrix representations to reduce computational overhead.

Create a simpler version of the data: Finally, locally linear embedding creates a simpler version of the dataset. It uses ‌weights to make sure the relationships between data points stay the same, even in fewer dimensions. For the handwritten digit dataset, this would result in a low-dimensional representation where similar digits (like different styles of “3”) are grouped close together. This embedding could then be visualized in 2D or 3D, making it easier to identify patterns, clusters, or outliers in the dataset. The global geometric framework ensures that while local relationships are preserved, the structure in the embedding space is meaningful.

Advantages of locally linear embedding

Focuses on small patterns: LLE is very good at keeping the small, local relationships between data points, making it ideal for analyzing complex data.
Works with nonlinear data: LLE can handle data where patterns aren’t straight lines or simple shapes, unlike methods like PCA.
Avoids big assumptions: By focusing on small groups of points, LLE doesn’t need to make guesses about the structure of the data.

Limitations of locally linear embedding

Takes time for big datasets: LLE uses complex calculations that can be slow when working with the very large datasets in big data analytics.
Needs the right settings: Choosing the number of neighbors or the distance for groups is tricky and can affect the results. LLE may perform poorly if the number of neighbors isn’t chosen appropriately.
Sensitive to messy data: LLE works best with clean data. Noisy or random points can disrupt its results.
Hard to scale: As datasets grow, the memory and time required for LLE increase quickly.

Locally linear embedding in AI models

Locally linear embedding is useful in many areas of AI technology.

Working with images: LLE reduces the size of image data, making tasks like recognizing faces or compressing images faster and more efficient. It is often shown using the “Swiss roll” example, which demonstrates how it handles curved patterns in data.
Simplifying text data: In language tasks, LLE helps break down text data into simpler numbers, making it easier to analyze.
Improving neural networks: By simplifying the input data, LLE makes neural networks more accurate and neural computation faster.
Finding patterns: LLE helps recognize patterns, like identifying handwriting or understanding speech.
Spotting unusual points: By focusing on local relationships, LLE can identify points that don’t fit with the rest of the data, such as anomalies or errors.

Comparisons with other methods

Locally linear embedding is just one way to reduce data dimensions. Here’s how it compares to other algorithms for dimensionality reduction:

t-SNE: This method is great for visualizing data but is harder to interpret compared to LLE. It focuses on preserving local similarities, making it well suited for visualization tasks like clustering.
Isomap: Isomap emphasizes global relationships, maintaining overall patterns in the data. It works well for datasets with simple structures but struggles with highly complex or noisy data. Isomap also utilizes the graph Laplacian to analyze the intrinsic geometry of data, and the graph ensures that local relationships and manifold structures are preserved during dimensionality reduction.
PCA: PCA is a linear method that captures straight-line relationships. It’s fast and effective for simple datasets but can’t handle nonlinear patterns like LLE can. Kernel PCA extends this by projecting data into a higher-dimensional space, allowing it to capture nonlinear patterns that standard PCA can’t.
Multidimensional scaling (MDS): MDS aims to preserve pairwise distances between points in a dataset. While it can be effective for specific tasks, it doesn’t prioritize local groupings of points as LLE does.

Variants and improvements of locally linear embedding

To make locally linear embedding better, researchers have created new versions:

Modified LLE (MLLE): MLLE adjusts the weight calculations to make them more robust, especially for noisy or imbalanced datasets.
Hessian LLE: This version captures the curvature of the data manifold, making it suitable for datasets with more complex shapes. Hessian LLE leverages tangent space information at each data point to better capture local geometry.
Sparse LLE: Sparse LLE enforces sparsity in the weight matrix, meaning only the most important relationships are kept. This improves efficiency and interpretability.
Laplacian eigenmaps: Laplacian eigenmaps focus on maintaining smooth transitions between points, adding regularization to improve stability and accuracy.

Understanding Locally Linear Embedding

Frequently Asked Questions

LLE works well with datasets that have complex or curved structures, like images or nonlinear patterns.

Yes, LLE can be slow and memory-intensive for large datasets due to its reliance on eigenvalue calculations and iterative optimization processes.

LLE is used in image recognition, text analysis, anomaly detection, and pattern recognition.

LLE is a type of manifold learning that assumes data lies on a low-dimensional manifold embedded in a high-dimensional space.

These are methods or processes used in machine learning to identify patterns or make decisions from data. LLE is an unsupervised machine learning algorithm because it works without labeled data.

Why customers choose Akamai

Akamai is the cybersecurity and cloud computing company that powers and protects business online. Our market-leading security solutions, superior threat intelligence, and global operations team provide defense in depth to safeguard enterprise data and applications everywhere. Akamai’s full-stack cloud computing solutions deliver performance and affordability on the world’s most distributed platform. Global enterprises trust Akamai to provide the industry-leading reliability, scale, and expertise they need to grow their business with confidence.

View cloud pricing

Get started with Akamai Cloud

Get started with Security

Get started with Content Delivery

Security and Delivery

Cloud pricing

Cloud pricing

Try Akamai Cloud with US$100 in credits*

Get started with Akamai Cloud

Partners

Akamai Cloud

Akamai Security and Delivery

What Is Locally Linear Embedding

Understanding Locally Linear Embedding

The need for dimensionality reduction

Key concepts and technologies

Overview of locally linear embedding

How locally linear embedding works

Advantages of locally linear embedding

Limitations of locally linear embedding

Locally linear embedding in AI models

Comparisons with other methods

Variants and improvements of locally linear embedding

Understanding Locally Linear Embedding

Frequently Asked Questions

What types of data are best suited for locally linear embedding?

Is locally linear embedding computationally expensive?

What are common applications of LLE?

How does LLE relate to manifold learning?

What are learning algorithms?

Why customers choose Akamai

Learn More

Akamai Cloud Computing

Cloud Computing at the Edge

Additional Resources

Distributed Cloud: Technology's Next Act

Power of Portability: 5 Business Benefits of Going Cloud Native

Related Pages

Related Blog Posts

Ready to get started or have questions?