Architecture Documentation

This document provides a comprehensive overview of the News Aggregator application architecture, detailing how the frontend and backend components interact to deliver the functionality.

System Overview

The News Aggregator is a full-stack JavaScript application that combines:

A React frontend for user interaction
An Express backend for API processing
LangChain integration for LLM orchestration
External API connectivity for news data retrieval

The application follows a modern client-server architecture where the frontend makes API calls to the backend, which in turn processes these requests by communicating with external services.

Architecture Diagram

graph TD
    subgraph "User Interface"
        User[User] -->|Enters search topic| UI[React Frontend]
        UI -->|Displays results| User
    end

    subgraph "Frontend Components"
        UI -->|State management| App[App.js]
        App -->|Search request| NewsSearch[NewsSearch.js]
        App -->|Renders articles| NewsList[NewsList.js]
        NewsList -->|Renders each article| NewsItem[NewsItem.js]
        NewsSearch -->|Submits topic| App
    end

    subgraph "API Communication"
        App -->|API call| NewsService[newsService.js]
        NewsService -->|HTTP Request| Backend[Express Backend]
    end

    subgraph "Backend Processing"
        Backend -->|Processes request| APIHandler[API Handler]
        APIHandler -->|Fetches articles| NewsAPI[News API]
        APIHandler -->|Summarizes content| LangChain[LangChain Integration]
        LangChain -->|Sends prompts| LLM[LLM Service]
        NewsAPI -->|Returns articles| APIHandler
        LLM -->|Returns summaries| APIHandler
        APIHandler -->|Returns processed data| Backend
    end

    Backend -->|JSON Response| NewsService
    NewsService -->|Updates state| App

Component Details

Frontend (React)

The frontend is a React application organized into the following key components:

App.js

Purpose: Main application component that orchestrates the overall UI
Responsibilities:
- Manages application state (articles, loading, error)
- Handles the search workflow
- Renders child components
Key State Variables:
- articles: Array of news articles with summaries
- loading: Boolean indicating if a search is in progress
- error: Error message if the search fails
- searchPerformed: Boolean tracking if a search has been executed

NewsSearch.js

Purpose: Search form component for user input
Responsibilities:
- Captures the user's topic input
- Validates form input
- Triggers search via callback to parent
Props:
- onSearch: Callback function to initiate search

NewsList.js

Purpose: Container for displaying news article results
Responsibilities:
- Renders a collection of NewsItem components
- Handles the layout of search results
Props:
- articles: Array of article objects to display

NewsItem.js

Purpose: Individual news article display
Responsibilities:
- Renders article details (title, source, summary, etc.)
- Displays article image if available
- Provides link to original article
Props:
- article: Object containing article data
Features:
- Date formatting
- Conditional image rendering
- External link handling

newsService.js

Purpose: Service module for API communication
Responsibilities:
- Makes HTTP requests to the backend API
- Handles API response formatting
- Manages API errors
Key Functions:
- searchNews(topic): Fetches news articles for a given topic
Environment Handling:
- Automatically detects development vs. production environment
- Uses appropriate API base URL based on environment

Backend (Express)

The backend is an Express.js server with the following components:

server.js

Purpose: Main server application
Responsibilities:
- Sets up Express middleware
- Defines API routes
- Manages connections to external services
- Handles Cloud Foundry environment integration
Key Middleware:
- CORS handling
- JSON parsing
- Static file serving
- Rate limiting

LLM Integration

Purpose: Integration with Large Language Models via LangChain
Responsibilities:
- Creates and manages LLM client
- Sends article data for summarization
- Processes LLM responses
Implementation Details:
- Uses ChatOpenAI from LangChain
- Configures model parameters (temperature, etc.)
- Handles prompt engineering for summarization

Environment Configuration

Purpose: Handle configuration for different environments
Responsibilities:
- Detect Cloud Foundry VCAP_SERVICES for production
- Use local .env variables for development
- Configure API keys and endpoints
Configuration Hierarchy:
1. Cloud Foundry VCAP_SERVICES (production)
2. Environment variables (multiple fallback options)
3. Default values where appropriate

External Services

The application interacts with two primary external services:

News API

Purpose: Source of news article data
Responsibilities:
- Provides searchable news articles
- Returns article metadata (title, description, source, etc.)
Integration Method:
- REST API calls with API key authentication
- Endpoint: https://newsapi.org/v2/everything

LLM Service (GenAI)

Purpose: Natural language processing for article summarization
Responsibilities:
- Generates concise article summaries
- Processes natural language instructions
Integration Method:
- LangChain.js framework
- Compatible with OpenAI API and similar LLM services
- Supports custom endpoints via service binding

Data Flow

sequenceDiagram
    participant User
    participant React as React Frontend
    participant Service as newsService.js
    participant Express as Express Backend
    participant NewsAPI as News API
    participant LLM as LLM Service

    User->>React: Enters search topic
    React->>Service: Calls searchNews(topic)
    Service->>Express: GET /api/news?topic=xyz
    Express->>NewsAPI: Fetch articles for topic
    NewsAPI-->>Express: Return article data

    loop For each article
        Express->>LLM: Request 25-word summary
        LLM-->>Express: Return summary
    end

    Express-->>Service: Return articles with summaries
    Service-->>React: Update state with articles
    React-->>User: Display search results

User Interaction:
- User enters a topic in the search form
- Frontend submits search request to newsService.js
API Request:
- newsService.js sends HTTP GET request to /api/news endpoint
- Request includes the search topic as a query parameter
Backend Processing:
- Express server receives the request
- Server queries the News API for relevant articles
- For each article, server sends content to LLM for summarization via LangChain
LLM Processing:
- LangChain orchestrates communication with the LLM service
- LLM generates concise summaries for each article
- System prompt instructs LLM to create exactly 25-word summaries
Response Handling:
- Backend collects all articles with summaries
- Returns formatted JSON response to frontend
UI Update:
- Frontend receives the response
- Updates React state with article data
- Renders NewsList and NewsItem components with the results

Deployment Architecture

The application is designed to be deployed on Tanzu Platform for Cloud Foundry:

graph TD
    subgraph "Cloud Foundry Environment"
        CF[Cloud Foundry Runtime] -->|Hosts| App[News Aggregator App]
        App -->|Binds to| GenAI[GenAI Service]
        App -->|Connects to| NewsAPI[News API]

        subgraph "Application Components"
            App -->|Contains| Static[Static React Build]
            App -->|Contains| Server[Express Server]
        end
    end

    subgraph "External Services"
        GenAI -->|Provides| LLM[LLM Capabilities]
        NewsAPI -->|Provides| Articles[News Articles]
    end

    User[End User] -->|Accesses| App

Cloud Foundry Runtime:
- Application packaged as a Node.js app
- Configuration via manifest.yml
- Memory allocation optimized for Node.js and React build
Service Binding:
- Automatic detection of VCAP_SERVICES
- Binding to GenAI service for LLM functionality
- Smart credential extraction with fallbacks
Environment Variables:
- API keys and endpoints configured via environment
- Fallback to local variables for development
- Secure handling of credentials

Security Considerations

API Keys:
- News API key stored securely in environment variables
- LLM API keys managed through service binding
- No hardcoded credentials in source code
Data Processing:
- All LLM processing done server-side
- No sensitive user data stored or processed
- CORS configured for appropriate access control
Error Handling:
- Proper error handling to prevent information leakage
- Graceful degradation when services are unavailable
- User-friendly error messages

Performance Considerations

Parallel Processing:
- Articles are summarized in parallel using Promise.all
- Reduces overall response time
- Handles failures of individual summary requests
Error Handling:
- Graceful degradation if LLM summarization fails
- Falls back to article description when summary unavailable
- Comprehensive error logging
Optimization Opportunities:
- Implement caching for frequent searches
- Add pagination for large result sets
- Optimize image loading with lazy loading

Development Workflow

Local Development:
- Frontend and backend run on separate ports (3000 and 3001)
- concurrently package runs both services for development
- Hot reloading for React components
Build Process:
- React app built to static files
- Express serves static files in production
- Optimized bundle size
Deployment:
- Single command deployment via cf push
- Service binding via cf bind-service
- Environment variable configuration via manifest.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Architecture Documentation

System Overview

Architecture Diagram

Component Details

Frontend (React)

App.js

NewsSearch.js

NewsList.js

NewsItem.js

newsService.js

Backend (Express)

server.js

LLM Integration

Environment Configuration

External Services

News API

LLM Service (GenAI)

Data Flow

Deployment Architecture

Security Considerations

Performance Considerations

Development Workflow

FilesExpand file tree

ARCHITECTURE.md

Latest commit

History

ARCHITECTURE.md

File metadata and controls

Architecture Documentation

System Overview

Architecture Diagram

Component Details

Frontend (React)

App.js

NewsSearch.js

NewsList.js

NewsItem.js

newsService.js

Backend (Express)

server.js

LLM Integration

Environment Configuration

External Services

News API

LLM Service (GenAI)

Data Flow

Deployment Architecture

Security Considerations

Performance Considerations

Development Workflow