RedEngine By UpEngine

RedEngine, community miner by UpEngine

RedEngine is an analytical engine built to mine Reddit communities for pain points, trends, and hidden signals. It takes large volumes of Reddit posts, processes them through an AI-powered pipeline, and transforms them into interactive, visual maps of meaning — making it possible to see what an entire community is talking about, struggling with, and recommending, all at a glance.

The core idea behind RedEngine is simple: Reddit is one of the richest sources of unfiltered human opinion on the internet. People vent about their frustrations, share what works, and tell stories about their experiences. But this information is buried across thousands of posts, scattered across time, and impossible to consume manually at scale. RedEngine solves this by turning that noise into structured, searchable, and visual intelligence.


The Problem RedEngine Solves

Anyone doing market research, product development, competitive analysis, or community management on Reddit faces the same wall: volume. A subreddit with 100,000 members can generate thousands of posts per month. Reading them one by one is impractical. Keyword searches miss context. And Reddit’s own search is notoriously unreliable.

The real value in Reddit data isn’t in any single post — it’s in the patterns. Which problems keep coming up? What tools are people recommending over and over? What frustrations are growing? What topics cluster together? These are the questions RedEngine is designed to answer.


How RedEngine Works — The Insight Engine

The heart of RedEngine is its Insight interface. This is where collected Reddit data becomes actionable intelligence. Insight is split into two views: Overview and Chart, each serving a distinct purpose.

Overview — The Dashboard

The Overview tab provides a high-level statistical snapshot of a subreddit’s data. It answers the foundational questions:

  • How much data do we have? Total posts collected, date range of collection, and how many posts have been processed through the AI calibration pipeline.
  • What are people talking about? A ranked leaderboard of the top entities mentioned across all posts — these include people, companies, products, tools, problems, solutions, concepts, and more. The AI doesn’t just find keywords; it extracts meaningful entities from context.
  • How are topics trending? A multi-line chart tracks the top entities over time, showing which topics are rising, falling, or staying steady. This is critical for spotting emerging pain points or fading trends.
  • What entities appear together? Co-occurrence analysis reveals which topics are linked in people’s minds. If “pricing” and “competitor X” keep appearing in the same posts, that’s a signal.
  • Where are people linking to? A platform leaderboard tracks the most common outbound links, revealing which external tools, platforms, and resources a community relies on.
  • When are people posting? A posts-over-time chart shows activity patterns and can highlight spikes tied to events, launches, or incidents.

The Overview also includes a search function — both exact keyword matching and semantic search powered by AI embeddings — so users can query the dataset in natural language and find conceptually relevant posts, not just string matches.

Chart — The Visual Intelligence Map

The Chart tab is where RedEngine truly differentiates itself. It transforms every post in a subreddit into a point on an interactive 2D scatter plot, where proximity equals similarity. Posts that talk about similar things appear near each other. Posts that are unrelated are far apart. The result is a literal map of a community’s conversation.

Multiple Lenses of Analysis

RedEngine doesn’t produce just one map. It generates four independent embedding spaces, each offering a different perspective on the same data:

  • AI Summary — How posts relate based on their overall content and meaning.
  • Pain — How posts relate based on the frustrations, problems, and pain points they express.
  • Advice — How posts relate based on the solutions, recommendations, and guidance they offer.
  • Narrative — How posts relate based on the stories, experiences, and journeys they describe.

Switching between these metrics reveals entirely different structures in the same dataset. A cluster of posts that look related in the Pain view might scatter in the Advice view, and vice versa. This multi-dimensional analysis is what makes RedEngine more than a simple topic modeler.

Clustering — Finding the Groups

RedEngine applies K-means clustering to automatically group similar posts together. Users can:

  • Let the engine auto-detect the optimal number of clusters using a Gap Statistic algorithm.
  • Manually specify how many clusters to create.
  • Ask the AI to name each cluster based on the posts it contains, generating a descriptive label and overview for each group.
  • Sort clusters by engagement (total posts, comments, or upvotes) to prioritize the most active discussions.
  • Hide, focus on, or delete entire clusters to refine the visualization.

This turns a cloud of dots into labeled territories — “People frustrated with onboarding,” “Users recommending tool X,” “Migration horror stories” — each instantly explorable.

Dynamic Visual Encoding

Every point on the chart carries more information than just position. RedEngine supports dynamic color intensity mapping, where the darkness or opacity of a point reflects a chosen engagement metric:

  • Comments — Darker points received more discussion.
  • Upvotes — Darker points were more popular.
  • Upvote Ratio — Darker points were more controversial (lower ratio).
  • Date — Darker points are more recent.

Combined with tag highlighting (pain, advice, narrative), users can visually filter the map to show only the pain-related posts with high engagement, or the most recent advice posts, without running a single query.

Search on the Map

Both exact and semantic search are available directly on the chart. When a user searches, matching posts are highlighted on the visualization, and the system estimates where the search query itself would appear on the map — showing conceptually where it fits in the conversation landscape. Users can then isolate search results with a single click, stripping away everything else.

Timeline Control

A timeline slider allows users to scrub through dates, highlighting only the posts from a specific day. This turns the static map into a temporal animation of how a community’s conversation evolved over time.

Post Deep Dive

Clicking any point opens a detailed modal showing the full post: title, author, engagement stats, the AI-generated summary, extracted entities, detected links, speech type classification, and a direct link back to Reddit. Every dot on the map is a gateway to the original conversation.

Undo/Redo

All operations — removing points, filtering results, deleting clusters — support full undo and redo via keyboard shortcuts, making exploration non-destructive and encouraging experimentation.


How Insight Can Be Used

Market Research & Product Discovery

Product teams can use RedEngine to analyze subreddits related to their industry and immediately see what problems people face most frequently. The Pain embedding view isolates frustration-related content, and clustering groups it into distinct problem categories. This reveals unmet needs, feature requests, and competitive gaps — backed by real user language, not survey responses.

Competitive Intelligence

By tracking entity trends over time, teams can monitor how often competitors, their products, or related tools are mentioned — and in what context. Co-occurrence analysis shows which competitors are being compared, and the Advice embedding view reveals what people recommend as alternatives.

Community Health Monitoring

Community managers can use the Overview dashboard to track posting patterns, identify trending concerns, and catch emerging issues before they escalate. The timeline feature on the Chart view shows how conversations shift over weeks or months.

Content Strategy & SEO

Content teams can identify the most discussed topics, the exact language people use to describe their problems, and which questions come up repeatedly. This directly informs blog posts, documentation, FAQ pages, and keyword targeting.

Startup Validation

Founders exploring a market can use RedEngine to validate whether a problem they want to solve actually exists at scale, how intensely people feel about it, and what solutions they’ve already tried. The Advice view shows the competitive landscape of existing solutions, while the Pain view quantifies demand.

Academic & Social Research

Researchers studying online communities, public discourse, or sentiment patterns can use the multi-metric embedding views and clustering to perform qualitative analysis at quantitative scale, identifying themes and patterns across thousands of posts without reading each one.

Trend Detection

The entity trend charts and temporal clustering make it possible to detect emerging topics — a new tool gaining traction, a policy change sparking backlash, a common complaint that suddenly spikes. RedEngine turns Reddit into an early warning system.


Technology Stack

  • Frontend: Next.js, React, TypeScript, Tailwind CSS, Chart.js
  • Backend: Node.js, Express.js
  • Database: PostgreSQL
  • AI: OpenAI (GPT-4o-mini for analysis, text-embedding-3-small for embeddings)
  • Dimensionality Reduction: UMAP
  • External Data: Reddit API (OAuth2)

Design Philosophy

RedEngine uses a dark-themed interface built around a minimal, high-contrast design system. The palette is anchored by a near-black background (#0B0B0F) with a red accent (#FF3B3B) that reinforces the brand identity. Typography is set in Inter for readability and JetBrains Mono for data. The UI prioritizes data density — surfaces are compact, controls are contextual, and the visualization takes center stage.


Current Status

RedEngine v0.0 is a functional prototype demonstrating the full pipeline from data ingestion through AI processing to interactive visualization. It is designed as a foundation for further development, with the architecture and AI prompt system built for extensibility.