CORE PIPELINE

Click any stage for details
INPUTEOJN NabavaFINA RGFISudski RegistarNarodne Novine82 Press SourcesReddit · Trends⬆ Click for details COLLECTScrapersREST APIsRSS FeedsHydra CollectorUnified Pipeline39 Python scripts RAW DATAPostgreSQL 16.8M190 tables10 schemas4.6 GBcivic · persona · legalplatform · dabi · support PROCESSDeduplicationEntity ResolutionNLP / NERBenford AnalysisAnomaly DetectionSentinel v11.00 AI / LLMOllama (local)Groq (free)DeepSeekEmbeddingRAG PipelineRTX 4000 Ada 20GB OUTPUTQdrant 801K vecMeili 489K docsNeo4j 73K nodesdemo.rinet.oneportal.rinet.oneDABI APIWeb Builder

📡 INPUT SOURCES

EOJN (nabava)69,351 records
FINA RGFI (financije)1.6M records
Sudski Registar64,292 entities
Narodne Novine10,705 zakona
Press (82 izvora)10,642 članaka
Reddit r/croatia8,234 posts
Google Trends HR1,240 trends
Proračun.hr11.5M stavki
37 cron jobs automatski prikupljaju podatke 24/7. Svaki izvor ima vlastiti collector sa schedule-om, retry logikom i dedup filterom.

⚙️ COLLECTORS

Python scripts39
Cron jobs37
Hydra CollectorACTIVE
Unified PipelineACTIVE
Brain BuilderACTIVE
Legal EmbedderACTIVE
Collectors koriste scraping (BeautifulSoup, Playwright), REST API pozive, RSS feed parsanje i WebSocket streaming. Svaki ima retry (3x), rate limiting i TruthGate validaciju.
→ Otvori Collectors Dashboard

🗄️ RAW DATA

PostgreSQL 1816.8M rows
Tables190
Database Size4.6 GB
Schemascivic (88), persona (35), support (18), projects (14), dabi (13), commander (8), platform (6), legal (3)
Svi sirovi podaci ulaze u PostgreSQL 18. Svaki schema ima jasnu domenu. Civic schema drži javne podatke (nabava, proračun, entiteti), persona schema drži DABI osobnost i blog.
→ Otvori Data Dashboard

🔬 PROCESSING

DeduplicationSHA-256 hash
Entity ResolutionOIB + fuzzy match
NLP / NERspaCy + custom
Benford AnalysisFirst digit law
Anomaly DetectionStatistical + ML
Civic ScoreMulti-factor index
Sentinel v11.00 orkestrira sve procesiranje. DataGate (4 workera) radi dedup, HealthChecker (60s interval) nadzire, AnomalyDetector (6h) traži nepravilnosti u nabavi i proračunu.

🧠 AI / LLM

Ollama (local)qwen2.5:7b, llama3.2, nomic-embed
Groqllama-3.3-70b (FREE)
DeepSeekdeepseek-chat ($0.14/1K)
GPURTX 4000 Ada 20GB VRAM
RAG PipelineQdrant + nomic-embed
WaterfallGroq → DeepSeek → Ollama → GPT → Claude
LLM waterfall: besplatni provideri prvi (Groq, Ollama), plaćeni kao fallback. DABI orchestrator automatski routea upite na pravi model prema tipu zadatka.
→ Otvori AI Dashboard

🚀 OUTPUT

Qdrant Vectors801K
Meilisearch Docs489K
Neo4j Graph73K nodes
demo.rinet.oneCivic Platform
portal.rinet.oneTech Portal
me.dabi.digitalDABI Persona
DABI API/api/v1/dabi/*
Web Buildermc.rinet.one/builder
Svo procesirano znanje dostupno je kroz multiple kanale: RAG search (Qdrant), full-text search (Meili), graph queries (Neo4j), i DABI AI chat koji kombinira sve.

PIPELINE STATUS

Sentinel Watchdog ACTIVE
Brain Builder ACTIVE
Hydra Collector ACTIVE
Legal Embedder ACTIVE
Unified Pipeline ACTIVE

DATA FLOW

Input sources: 10+
Collectors: 39 scripts
Cron jobs: 37
Processing: Sentinel + NLP
AI models: 3 local + 2 cloud

LIVE NUMBERS

PG rows:
Vectors:
Search docs:
Services:
Docker:
RINET AI OS — MISSION CONTROL