Our Methodology

Overview

AI Crypto News uses a multi-stage pipeline to collect, process, and analyze cryptocurrency news. Our system runs continuously, fetching new articles every 5 minutes, classifying them every 10 minutes, and generating entity summaries every 30 minutes.

1. News Collection

We aggregate RSS feeds from 19+ cryptocurrency news publications. Our collector:

  • Fetches articles from all sources in parallel
  • Deduplicates content using URL matching
  • Enriches articles with Open Graph images when missing
  • Stores articles in Redis cache for fast retrieval

Update Frequency: Every 5 minutes

2. Sentiment Classification

Each article is analyzed by our AI model (Llama 3.2) to determine market sentiment:

Bullish

News that suggests positive price action, adoption growth, favorable regulations, successful upgrades, or institutional interest.

Bearish

News indicating potential negative impact: hacks, regulatory crackdowns, project failures, market manipulation, or negative macroeconomic factors.

Neutral

Informational content without clear market direction: educational articles, technical updates, or balanced market analysis.

Important

High-impact news regardless of sentiment: major announcements, breaking news, regulatory decisions, or significant market events.

3. Importance Scoring

Our AI assigns importance scores from 1-10 based on potential market impact:

  • 1-3 (Low): Minor updates, routine news, educational content
  • 4-6 (Medium): Noteworthy developments, partnership announcements, technical milestones
  • 7-9 (High): Significant market events, major protocol upgrades, regulatory news
  • 10 (Critical): Market-moving events, security incidents, major institutional moves

4. Entity Extraction

We automatically identify and tag entities mentioned in articles:

  • Cryptocurrencies: 84+ symbols (BTC, ETH, SOL, etc.) and full names
  • ETFs: Bitcoin and Ethereum ETF tickers (IBIT, FBTC, ARKB, etc.)
  • Key Terms: airdrop, listing, mainnet, halving, regulation, hack, etc.
  • Protocols & Exchanges: Major DeFi protocols and centralized exchanges

5. Entity Summaries

For major cryptocurrencies (BTC, ETH, SOL, XRP, BNB, ADA, DOGE, TRX, XLM, LINK), we generate AI-powered market analysis summaries:

  • Paragraph 1: Current developments and main news themes
  • Paragraph 2: Market sentiment and trading implications
  • Paragraph 3: Upcoming catalysts and market outlook

Update Frequency: Every 30 minutes, based on the latest 10 articles per entity

6. Caching Strategy

We use a multi-layer caching system for optimal performance:

  • In-Memory Cache: Fastest access for frequently requested data
  • Redis Cache: Persistent storage with configurable TTL
  • Stale-While-Revalidate: Serve cached data while refreshing in background

This ensures you always get fast responses while maintaining data freshness.

AI Model Details

We use locally-hosted Ollama with Llama 3.2 models for privacy and speed:

  • Classification: Llama 3.2 3B parameter model (higher accuracy)
  • Summarization: Llama 3.2 1B parameter model (faster generation)
  • Temperature: 0.3 (focused, consistent outputs)
  • Processing: All AI inference runs locally - no data sent to external APIs

Limitations

While our AI system strives for accuracy, please be aware of these limitations:

  • AI sentiment analysis may misinterpret sarcasm, nuance, or complex narratives
  • Importance scores are algorithmic estimates, not guarantees of market impact
  • Entity extraction may miss context-dependent references
  • News aggregation depends on source RSS feed availability and formatting
  • Market conditions can change faster than our update intervals

Always verify important information with original sources and conduct your own research.