Arsitektur Sistem Rekomendasi
Content-Based Filtering dengan SBERT dan Cosine Similarity
π Diagram Alur Sistem
File GPX
Preprocessing
Cleaning, Lowercase
SBERT Embedding
384 dimensi
Cosine Similarity
QΒ·D / (||Q||Γ||D||)
Top-N Ranking
Rekomendasi
1. Data Collection & Fusion
- β Ekstraksi fitur dari GPX (jarak, elevasi, grade, durasi Naismith)
- β Deskripsi manual (vegetasi, sumber air, panorama)
- β Koordinat rute untuk visualisasi peta
2. Text Preprocessing
- β Data Cleaning (regex: URL, karakter non-ASCII)
- β Case Folding (lowercase)
- β Stopword Removal Selektif (pertahankan negasi & kata sifat)
- β No Stemming (SBERT sensitif konteks)
3. SBERT Embedding
-
β
Model:
paraphrase-multilingual-MiniLM-L12-v2 - β Dimensi embedding: 384
- β Support bahasa Indonesia
4. Cosine Similarity
Sim(Q, D) = (Q Β· D) / (||Q|| Γ ||D||)
- Q = Query embedding (user search)
- D = Document embedding (route)
- Range = -1 to 1 (1 = identical)
π οΈ Technology Stack
π
Laravel 11
Backend Framework
π
Python 3.11
ML Processing
π€
SBERT
Sentence Transformers
πΊοΈ
Leaflet.js
Map Visualization