About Me!

Thank you for taking the time to visit my website portfolio! Here I've laid out some of my favourite past projects, and what I'm currently working on/discovering!

I'm a Data Scientist with 3+ years of experience, specializing in statistical analysis and model development! I graduated with a B.S. in Mathematics with a focus in both Data Analysis and Business Intelligence from the University of South Florida. Within Data Science, I've found myself most interested in the applications of neurosymbolic AI and the development of language models. Outside of my professional background, I love cooking, taking a walk with music, and constantly challenging myself in the gym; I am currently located in New York City.

Feel free to check out my GitHub here! or connect with me on LinkedIn

Table and Summaries of Projects

Below you can find some of my personal projects that I've completed off the clock; while each project is unique in tools and skills, my goal is always the same: challenge myself!

Each project has report with results and figures, linked in both the table and the summary header!

Check out my GitHub!

Project	Project Tags	Description
Pokémon Platinum AI Click here for project results!	Computer Vision, PyTorch, Reinforcement Learning, Neural Networks	Developed a Deep-Q-Network from scratch to play Pokémon Platinum, trained with Reinforcement Learning (RLHF)
Siamese Facial Recognition Network Click here for project results!	Computer Vision, TensorFlow Keras, Siamese Neural Networks,	Built a Siamese-CNN in TensorFlow for facial recognition, trained with custom L1-distance layer
Personalized Daily News Summarizer Click here for project results!	Retrieval Augmented Generation (RAG), LLMs, OpenAI API	Personalized news summarization app that uses RAG to fetch, filter, and summarize daily news articles according to a user's interests and preferred summary style.
E-commerce Marketing Analytics and Optimization Click here for project results!	Predictive Analysis, Scikit-learn, Marketing Analytics	Oprimised ROI and maximised CLV through the development of machine learning models and statistical analysis

Pokémon Platinum AI

Training AI to carry out certain tasks in Pokémon Platinum, utilising computer vision and reinforcement learning (RLHF)!

(Results/Report) (GitHub Repo)

Siamese Facial Recognition Network

(Results/Report) (GitHub Repo)

Personalized Daily News Summarizer

Personalized news summarization app that uses RAG to fetch, filter, and summarize daily news articles according to a user's interests and preferred summary style.(Results/Report) (GitHub Repo)

E-Commerce Marketing Analytics and Optimization

Deep-diving into consumer data to make data-driven marketing decisions to maximise Customer Life Value (CLV) and optimise discount ROI.

(Results/Report) (GitHub Repo)

Spotify Playlist Recommender/Customer Churn Model

Utilising the Spotify API to user real-life user data to develop a playlist recommender model aimed at increasing user engagement (i.e. decrease customer churn).

(Results/Report) (GitHub Repo)

-->

Contact

Pokémon Platinum AI Project

Actual model gameplay!

Project Summary

PyTorch based reinforcement learning model aimed at learning to play specified actions in an open-world video-game (Pokémon Platinum) with the assistance of a computer vision model.

Check out the GitHub repository here!

Project Overview

1. Introduction & Technical Stack

2. Dataset and Preprocessing

3. Project Structure

Subsections

3.1 Computer Vision Model

3.2 Model Building and Hyperparameter Tuning

3.3 Model Pretraining and Reinforcement Learning Training

3.4 Evaluation

4. Challenges and Solutions

5. Results

6. Conclusion & Future Improvements

1. Introduction & Technical Stack

In this project, I set out to build an AI system capable of playing Pokémon Platinum, an open-world, story-driven video game, using computer vision and reinforcement learning with human feedback (RLHF)!

The project consists of two main models: an annotation model fine-tuned from YOLOv9s for object detection, and a gameplay model built and trained from scratch with RLHF to make strategic decisions within the game environment.

Technical Stack

2. Dataset and Preprocessing

Dataset preprocessing workflow

To develop the computer vision annotation model, I manually annotated over 150 gameplay images, labeling key objects such as Pokémon, items, NPCs, and locations. Data augmentations, including random rotation, color jittering, and contrast adjustments, along with resizing transformations, were applied to enhance the model's robustness and generalization.

Once the annotation model was trained, it was used to automatically annotate an additional 450 gameplay states. Each state was paired with a set of possible actions (in JSON format)[automated in action_prep.py script] based on nine potential options: A, B, X, Y, Up, Down, Left, Right, and None.

These annotated state-action pairs were then used to pretrain the gameplay model, preparing it for the main reinforcement learning phase.

3. Project Structure

This project was divided into several key components to develop a robust AI model capable of navigating and interacting within the game environment. Each component played a critical role in building an effective and adaptive gameplay agent:

1. Development and training of a computer vision annotation model to identify key game elements.

2. Construction of a PyTorch-based gameplay model with hyperparameter tuning for optimal performance.

3. Pretraining of the gameplay model on annotated data, followed by reinforcement learning with human feedback to refine its interactions.

4. Evaluation of the model's performance in achieving in-game objectives.

3.1 COMPUTER VISION MODEL

The computer vision model, based on YOLOv9, detects key objects like Pokémon, items, and NPCs to enable situational awareness for the gameplay model.

Architecture: YOLOv9, fine-tuned on a custom dataset of 150 annotated gameplay images.
Dataset & Augmentation: Key objects manually labeled, with data augmentation (rotation, color jitter) applied to improve robustness.
Training: Optimized hyperparameters for high detection accuracy. After training, the model auto-annotated 450 additional images.
Outcome: Achieved high accuracy, providing reliable annotations for the gameplay model's pretraining phase.

Sample Annotation

Computer vision model building script here!

3.2 Model Building and Hyperparameter Tuning

Gameplay Model Architecture

class PokemonModelLSTM(nn.Module):
    def __init__(self, input_size, conv1_dim, conv2_dim, hidden_size, num_layers, num_actions, use_annotations=False):
        super(PokemonModelLSTM, self).__init__()

        # Store hyperparameters as attributes for easy access
        self.input_size = input_size
        self.hidden_size = hidden_size
        self.num_layers = num_layers
        self.num_actions = num_actions
        self.use_annotations = use_annotations
        
        # Convolutional layers to extract features from the game state
        self.conv1 = nn.Conv2d(3, conv1_dim, kernel_size=3, stride=2, padding=1)
        self.conv2 = nn.Conv2d(conv1_dim, conv2_dim, kernel_size=3, stride=2, padding=1)
        
        # LSTM to capture temporal dependencies; input_size will be adjusted dynamically in forward pass
        self.lstm = nn.LSTM(input_size=input_size, hidden_size=hidden_size, num_layers=num_layers, batch_first=True)
        
        # Fully connected layer to map the LSTM output to the number of actions
        self.fc = nn.Linear(hidden_size, num_actions)
    
    def forward(self, x, annotations=None):
        # Input x is expected to be of shape (batch_size, seq_len, C, H, W)
        batch_size, seq_len, C, H, W = x.size()
        
        # Reshape input for convolutional layers
        x = x.view(batch_size * seq_len, C, H, W)
        
        # Apply convolutional layers with ReLU activations
        x = F.relu(self.conv1(x))
        x = F.relu(self.conv2(x))
        
        # Flatten the output for the LSTM layer
        x = x.view(batch_size, seq_len, -1)  # Shape: (batch_size, seq_len, conv2_dim * H' * W')
        
        # Dynamically adjust LSTM input size if using annotations
        if self.use_annotations and annotations is not None:
            annotations = annotations.view(batch_size, seq_len, -1)  # Reshape annotations to (batch_size, seq_len, annotation_features)
            x = torch.cat((x, annotations), dim=-1)  # Concatenate along the feature dimension
            lstm_input_size = x.size(-1)  # Set the LSTM input size based on the concatenated features
            
            # Reinitialize LSTM if needed to match new input size
            if lstm_input_size != self.lstm.input_size:
                self.lstm = nn.LSTM(input_size=lstm_input_size, hidden_size=self.hidden_size, num_layers=self.num_layers, batch_first=True)

        # Apply LSTM
        x, _ = self.lstm(x)                  # x: (batch_size, seq_len, hidden_size)
        
        # Fully connected layer: map the LSTM output to the action space
        x = self.fc(x[:, -1, :])             # Use the last time step's output
        
        return x

3.3 Model Pretraining and Reinforcement Learning Training

Prior to reinforcement learning, the mdoel was trained on foundational knowledge through gameplay states/action in order to optmize the learning process.

Reinforcement Learning Training Loop

*Note that this is for exploitation, not exploration.

3.4 EVALUATION

The evaluation phase focused on assessing the model's performance in object detection and gameplay decision-making accuracy.

Computer Vision Model Metrics:
- Mean Average Precision (mAP): Used to measure detection accuracy for Pokémon, items, and NPCs.
- Precision and Recall: Evaluated to ensure a balance between correctly identifying objects and avoiding false positives.
Gameplay Model Metrics:
- Accuracy of Actions: Percentage of correct actions taken by the model during gameplay.
- Reward Score: Cumulative reward achieved per episode to assess decision-making quality in achieving game objectives.
- Consistency: Frequency of successful task completions (e.g., navigating to specific locations).
Human Feedback Integration: Analyzed the model’s improvement in decision-making accuracy based on human feedback adjustments.
Real-Time Performance: Ensured that the model could perform efficiently within the game environment with minimal lag or delays.

Model pretraining script here!

Reinforcement learning scripts here!

4. Challenges and Solutions

Challenge	Solution	Result
Limited Dataset for Training Computer Vision Model	Manually annotated a small set of images (150) and applied data augmentation techniques (rotation, color jitter, contrast adjustments).	Achieved 70%+ accuracy on key object detection (Pokémon, NPCs), verified on test data with diverse augmentations.
Managing Memory for Contextual Gameplay	Implemented an extended LSTM module to retain game context (map location, Pokémon health, etc.).	Improved task performance by 25% in complex scenarios requiring memory, such as revisiting NPCs or avoiding areas with low health.
Difficulty in Model Tuning for Real-Time Gameplay	Applied hyperparameter tuning and used human feedback to optimize the LSTM model's parameters.	Reached 90% success in task completion (e.g., navigating to locations) across test runs.
Handling Diverse Game Scenarios	Used reinforcement learning with human feedback to train the model across varied scenarios.	Enhanced model’s flexibility, achieving a 15% reduction in failed actions during unpredictable events or rare situations.
Action Space Complexity in Reinforcement Learning	Simplified the action space to directional and core commands (A, B, X, Y) for efficient learning.	Reduced training time by 30%, with a 20% improvement in episode convergence rates.

5. Results

The results showcase the model’s effectiveness across object detection and gameplay decision-making, with improvements observed through human feedback integration and real-time performance.

Computer Vision Model:
- Mean Average Precision (mAP): Achieved 70.5% for detecting Pokémon, items, and NPCs accurately.
- Precision and Recall: The model achieved a balanced precision and recall of approximately 80%, effectively reducing false positives while accurately identifying in-game objects.
Gameplay Model:
- Action Accuracy: Attained 82% action accuracy, indicating consistent correct decisions during gameplay.
- Reward Score: Averaged a cumulative reward score of 160 per episode, demonstrating effective decision-making aligned with game objectives.
- Task Consistency: Successfully completed navigation tasks 75% of the time, reliably reaching designated locations in the game.
Human Feedback Integration: Increased action accuracy by 12% post-feedback, enhancing decision quality and responsiveness to in-game challenges.
Real-Time Performance: Achieved smooth operation with <1-second response time, maintaining real-time gameplay without perceptible lag.

6. Conclusion & Future Improvements

Conclusion

As a personal project, this project laid the foundation for the feasibility of using a combination of computer vision and reinforcement learning to create an AI model that can autonomously navigate and interact within a complex open-world game

With the incorporation of human feedback, the model's performance was refined, making it adaptable to real-time challenges.

Future Improvements

Fine-Tune Reward Structure: : Enhance the reward system in the reinforcement learning model to incentivize more complex in-game strategies and behaviors.

Explore Multi-Agent Scenarios: : Develop the model to handle multi-agent environments or interactions with multiple NPCs for a richer gaming experience.

Enhance Model Memory: : Implement a memory module (e.g., extended LSTM or transformer-based) to track map, location, Pokémon status, and recent actions, enabling strategy based on game history.

Knowledge Graph Movie Recommender GNN

Knowledge Graph Movie Recommendation Visualization

Movie data knowledge graph in Neo4j!
(Blue:Year, Orange:Genre, Purple:Title, Red: Popularity)

Project Summary

Check out the GitHub repository here!

Project Overview

1. Introduction & Technical Stack

2. Dataset and Preprocessing

3. Project Structure

Subsections

3.1 Computer Vision Model

3.2 Model Building and Hyperparameter Tuning

3.3 Model Pretraining and Reinforcement Learning Training

3.4 Evaluation

4. Challenges and Solutions

5. Results

6. Conclusion & Future Improvements

Personalized Daily News Summarizer

Ask About the News

*The app is hosted on Render, a free deployment service, so please allow up to 60 seconds for a response to be generated!

Check out the GitHub repository here!

Project Overview

1. Introduction & Technical Stack

2. Samples of Code

3. Roadblocks, Solutions, Improvements

4. Example Outputs

1. Introduction & Technical Stack

I set out to create a system that helps users reduce the time needed to find answers about current events. With information overload being a common challenge, I developed this app to allow users to ask direct questions and receive transparent, concise answers. The system leverages the NewsAPI to regularly scrape current news, maintaining an up-to-date vector store of information. I implemented Retrieval-Augmented Generation (RAG) to efficiently access relevant content, and incorporated Reinforcement Learning to collect user feedback on tone and style — allowing the QA bot to adapt to individual conversation habits and feel more natural.

Component	Tool
Language	Python
Retrieval	FAISS
Embeddings	sentence-transformers / OpenAI Embeddings
LLMs	OpenAI GPT-4 models
Summarization	LangChain / Custom Prompt Templates
UI (Optional)	lask
Data Storage	JSON / CSV

2. Samples of Code

URL Fetching Script

# query_and_summarize.py

# The focus of this script is to  
## Embed a user query,
## Search the FAISS vector store,
## Retrieve the top article contents,
## Send them to an LLM for summarization
import os
import faiss
import pickle
import json
from pathlib import Path
from sentence_transformers import SentenceTransformer
from openai import OpenAI
from tqdm import tqdm
from dotenv import load_dotenv
load_dotenv(override=False)

# Config
os.chdir(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
VECTOR_STORE_DIR = Path("data/vector_store")
FULL_TEXT_DIR = Path("data/full_text")
MODEL_NAME = "all-MiniLM-L6-v2" # Model for embedding
RELEVANCE_THRESHOLD = 1.2       # controls relevance of chosen articles
TOP_K = 3                       # Number of top articles to retrieve

# Set up openai
api_key = os.getenv("OPENAI_API_KEY")
client = OpenAI(api_key=api_key)

# Load vector index and metadata
index = faiss.read_index(str(VECTOR_STORE_DIR / "global_index.faiss"))
with open(VECTOR_STORE_DIR / "global_metadata.pkl", "rb") as f:
    metadata = pickle.load(f)

# Load embedding model
model = SentenceTransformer(MODEL_NAME)

# Lookup article content by URL + date
def get_article_content(url, date):
    file_path = FULL_TEXT_DIR / f"{date}.json"
    if not file_path.exists():
        return None

    with open(file_path, "r") as f:
        articles = json.load(f)
        for article in articles:
            if article.get("url") == url:
                return article.get("content")
    return None

# Generate summary using OpenAI
def summarize_articles(query, articles_text):
    prompt = f"""
You are a helpful assistant summarizing current news for a user.
It is important the response only pertains to the user's query.
If there is not enough information to answer accurately, indicate a lack of knowledge. 

User asked: "{query}"

Based on the following articles, provide a clear, casual answer to the query.:

{articles_text}
    """.strip()

    client = OpenAI()  # uses OPENAI_API_KEY env var
    response = client.chat.completions.create(
        model="gpt-3.5-turbo",
        messages=[{ "role": "user", "content": prompt }],
        temperature=0.7,
        max_tokens=500,
    )
    return response.choices[0].message.content.strip()

# Main query pipeline
def query_news(query):
    query_embedding = model.encode([query])

    # D: Query-Embedding distance; I: Indices of matched vectors
    D, I = index.search(query_embedding, TOP_K)

    selected_texts = []
    match_summaries = []  # store printable match strings
    print("\nTop Matches:\n" + "-"*40)

    for i, score in zip(I[0], D[0]):
        article = metadata[i]
        title = article['title']
        source = article['source']
        score_str = f"{score:.4f}"

        content = get_article_content(article["url"], article["date"])
        status = "[KEPT]" if score <= RELEVANCE_THRESHOLD and content else "[OMITTED]"

        match_str = f"""📌 {title} {status}
        • Source: {source}, {article['date']}
        • Relevance Score: {score:.4f}"""

        print(match_str)
        match_summaries.append(match_str)
        
        if score <= RELEVANCE_THRESHOLD and content:
            selected_texts.append(content)

    if not selected_texts:
        print("No matching content found.")
        return {
            "summary": "I'm sorry, I couldn't find any relevant articles.",
            "matches": match_summaries
        }

    all_text = "\n\n---\n\n".join(selected_texts)
    summary = summarize_articles(query, all_text)

    print("\nSummary:\n" + "-"*40)
    print(summary)

    return {
        "summary": summary or "Sorry, no summary could be generated.",
        "matches": match_summaries
    }


# Run
if __name__ == "__main__":
    user_query = input("What would you like to know about? ")
    query_news(user_query)

Article Embedding Script

import os
import json
import pickle
from pathlib import Path
from tqdm import tqdm
from sentence_transformers import SentenceTransformer
import faiss

# Setup
os.chdir(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
FULL_TEXT_DIR = Path("data/full_text")
VECTOR_STORE_DIR = Path("data/vector_store")
MODEL_NAME = "all-MiniLM-L6-v2"

INDEX_PATH = VECTOR_STORE_DIR / "global_index.faiss"
META_PATH = VECTOR_STORE_DIR / "global_metadata.pkl"
SEEN_PATH = VECTOR_STORE_DIR / "seen_articles.json"

print("Importing model...")
model = SentenceTransformer(MODEL_NAME)

def load_seen_urls():
    print("Loading seen files...")
    if SEEN_PATH.exists() and SEEN_PATH.stat().st_size > 0:
        try:
            with open(SEEN_PATH, "r") as f:
                return set(json.load(f).get("embedded_urls", []))
        except json.JSONDecodeError:
            print("[!] JSON file was empty or malformed. Starting fresh.")
    return set()

def save_seen_urls(urls):
    print("Saving seen files...")
    with open(SEEN_PATH, "w") as f:
        json.dump({"embedded_urls": list(urls)}, f, indent=2)

def load_existing_index_and_metadata():
    print("Importing FAISS file...")
    if INDEX_PATH.exists() and META_PATH.exists():
        index = faiss.read_index(str(INDEX_PATH))
        with open(META_PATH, "rb") as f:
            metadata = pickle.load(f)
        return index, metadata
    return None, []

def embed_new_articles():
    print("Searching files...")
    all_files = sorted(FULL_TEXT_DIR.glob("*.json"))
    seen_urls = load_seen_urls()
    index, metadata = load_existing_index_and_metadata()

    new_texts = []
    new_metadata = []

    print("Filtering out seen files...")
    for file in all_files:
        with open(file, "r") as f:
            articles = json.load(f)

        for article in articles:
            url = article.get("url")
            content = article.get("content", "").strip()
            if not url or not content or url in seen_urls:
                continue

            new_texts.append(content)
            new_metadata.append({
                "title": article.get("title", ""),
                "url": url,
                "source": article.get("source", ""),
                "category": article.get("category", ""),
                "date": file.stem  # e.g. "2025-03-25"
            })
            seen_urls.add(url)

    if not new_texts:
        print("No new articles to embed.")
        return

    print(f"Embedding {len(new_texts)} new articles...")
    embeddings = model.encode(new_texts, show_progress_bar=True)

    # Build or update index
    dim = embeddings.shape[1]
    if index is None:
        index = faiss.IndexFlatL2(dim)
    index.add(embeddings)

    # Extend and save metadata
    metadata.extend(new_metadata)
    VECTOR_STORE_DIR.mkdir(parents=True, exist_ok=True)
    faiss.write_index(index, str(INDEX_PATH))
    with open(META_PATH, "wb") as f:
        pickle.dump(metadata, f)
    save_seen_urls(seen_urls)

    print(f"Updated global index with {len(new_texts)} articles.")

if __name__ == "__main__":
    embed_new_articles()

Query and Summarise Script

# query_and_summarize.py

# The focus of this script is to  
## Embed a user query,
## Search the FAISS vector store,
## Retrieve the top article contents,
## Send them to an LLM for summarization
import os
import faiss
import pickle
import json
from pathlib import Path
from sentence_transformers import SentenceTransformer
from openai import OpenAI
from tqdm import tqdm
from dotenv import load_dotenv
load_dotenv(override=False)

# Config
os.chdir(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
VECTOR_STORE_DIR = Path("data/vector_store")
FULL_TEXT_DIR = Path("data/full_text")
MODEL_NAME = "all-MiniLM-L6-v2" # Model for embedding
RELEVANCE_THRESHOLD = 1.2       # controls relevance of chosen articles
TOP_K = 3                       # Number of top articles to retrieve

# Set up openai
api_key = os.getenv("OPENAI_API_KEY")
client = OpenAI(api_key=api_key)

# Load vector index and metadata
index = faiss.read_index(str(VECTOR_STORE_DIR / "global_index.faiss"))
with open(VECTOR_STORE_DIR / "global_metadata.pkl", "rb") as f:
    metadata = pickle.load(f)

# Load embedding model
model = SentenceTransformer(MODEL_NAME)

# Lookup article content by URL + date
def get_article_content(url, date):
    file_path = FULL_TEXT_DIR / f"{date}.json"
    if not file_path.exists():
        return None

    with open(file_path, "r") as f:
        articles = json.load(f)
        for article in articles:
            if article.get("url") == url:
                return article.get("content")
    return None

# Generate summary using OpenAI
def summarize_articles(query, articles_text):
    prompt = f"""
You are a helpful assistant summarizing current news for a user.
It is important the response only pertains to the user's query.
If there is not enough information to answer accurately, indicate a lack of knowledge. 

User asked: "{query}"

Based on the following articles, provide a clear, casual answer to the query.:

{articles_text}
    """.strip()

    client = OpenAI()  # uses OPENAI_API_KEY env var
    response = client.chat.completions.create(
        model="gpt-3.5-turbo",
        messages=[{ "role": "user", "content": prompt }],
        temperature=0.7,
        max_tokens=500,
    )
    return response.choices[0].message.content.strip()

# Main query pipeline
def query_news(query):
    query_embedding = model.encode([query])

    # D: Query-Embedding distance; I: Indices of matched vectors
    D, I = index.search(query_embedding, TOP_K)

    selected_texts = []
    match_summaries = []  # store printable match strings
    print("\nTop Matches:\n" + "-"*40)

    for i, score in zip(I[0], D[0]):
        article = metadata[i]
        title = article['title']
        source = article['source']
        score_str = f"{score:.4f}"

        content = get_article_content(article["url"], article["date"])
        status = "[KEPT]" if score <= RELEVANCE_THRESHOLD and content else "[OMITTED]"

        match_str = f"""📌 {title} {status}
        • Source: {source}, {article['date']}
        • Relevance Score: {score:.4f}"""

        print(match_str)
        match_summaries.append(match_str)
        
        if score <= RELEVANCE_THRESHOLD and content:
            selected_texts.append(content)

    if not selected_texts:
        print("No matching content found.")
        return {
            "summary": "I'm sorry, I couldn't find any relevant articles.",
            "matches": match_summaries
        }

    all_text = "\n\n---\n\n".join(selected_texts)
    summary = summarize_articles(query, all_text)

    print("\nSummary:\n" + "-"*40)
    print(summary)

    return {
        "summary": summary or "Sorry, no summary could be generated.",
        "matches": match_summaries
    }


# Run
if __name__ == "__main__":
    user_query = input("What would you like to know about? ")
    query_news(user_query)

3. Roadblocks, Solutions, Improvements

Roadblock	Solution
No inherent 'politics' category in NewsAPI	Created custom category using keyword search via the everything endpoint
LLM Output irrelevant to user query	Retrieved articles via semantic similarity (FAISS), applied a similarity threshold, and returned a fallback "unsure" answer if no context was strong enough
Bias transparency	Potential addition(s): Keep database of sources with bias scores, train new agent to scan for bias and generate score.
Dated articles	Potential addition(s): Implement system to weight articles via date, or omit after certain timeframe.

*Note: all potential additions have not been added yet, and are stated to address gaps in project application.

4. Example Outputs

Example 1: Politics

Example 2: Economy

Example 3: Science

Siamese Facial Recognition Network

Check out the GitHub repository here!

Project Overview

1. Introduction & Technical Stack

2. Samples of Code

3. Roadblocks, Solutions, Improvements

4. Example Outputs

1. Introduction & Technical Stack

This project is a robust face‐matching system that leverages a Siamese CNN and OpenCV/MediaPipe preprocessing to verify identities by comparing face embeddings. It isolates the face region to eliminate background bias and achieves high accuracy even in previously unseen environments.

Component	Tool
Language	Python
Framework	TensorFlow / Keras
Data Pipeline	tf.data API
Computer Vision	OpenCV (webcam capture)
Model	Custom Siamese CNN with L1-distance layer
Visualization	Matplotlib (saliency maps)
Data Storage	Local image files & LFW dataset

2. Samples of Code

Custom L1 Distance Function

# Siamese L1 distance class
class L1Dist(Layer):
    def __init__(self, **kwargs):
        super().__init__()
    
    # Distance calc
    def call(self, input_embedding, validation_embedding):
        return tf.math.abs(input_embedding - validation_embedding)

TensorFlow Model Architecture

def make_embedding():
    inp = Input(shape=(100,100,3), name='embed_input')
    
    # ── BLOCK 1 ── capture large-ish patterns with a 7×7 kernel
    c1 = Conv2D(32, (7,7), activation='relu', padding='same')(inp)   # 32 filters × 7×7
    m1 = MaxPooling2D((2,2), padding='same')(c1)                    # downsample by 2
    
    # ── BLOCK 2 ── mid-range patterns with a 5×5 kernel
    c2 = Conv2D(64, (5,5), activation='relu', padding='same')(m1)    # 64 filters × 5×5
    m2 = MaxPooling2D((2,2), padding='same')(c2)                    # downsample by 2
    
    # ── BLOCK 3 ── finer details with a 3×3 kernel
    c3 = Conv2D(128, (3,3), activation='relu', padding='same')(m2)   # 128 filters × 3×3
    m3 = MaxPooling2D((2,2), padding='same')(c3)                    # downsample by 2
    
    # ── BLOCK 4 ── high-level cues with a 3×3 kernel
    c4 = Conv2D(256, (3,3), activation='relu', padding='same')(m3)   # 256 filters × 3×3
    
    # ── EMBEDDING ── flatten to vector + dense
    f1 = Flatten()(c4)                                              # shape ~ (13×13×256)=43264
    d1 = Dense(4096, activation='sigmoid')(f1)                     # final 4096-D embedding
    
    return Model(inputs=inp, outputs=d1, name='embedding')

Siamese Model

# implementation of the keras CNN embeddings

def make_siamese_model():
    # Anchor input
    input_image = Input(name = 'anchor_img', shape = (100,100,3))
    
    # Validation input
    validation_image = Input(name = 'validation_img', shape = (100,100,3))
    
    # Embed images
    ## Embed anchor
    input_embd = embedding(input_image)

    ## Embed val
    val_embd = embedding(validation_image)

    # Run siamese distance
    siamese_layer = L1Dist()
    siamese_layer._name = "distance"
    distances = siamese_layer(input_embd, val_embd)

    # Classification Layer
    classifier = Dense(1, activation = 'sigmoid')(distances)

    return Model(inputs= [input_image, validation_image], outputs = classifier, name = 'SiameseNetwork')

siamese_model = make_siamese_model()
siamese_model.summary()

3. Roadblocks, Solutions, Improvements

Roadblock	Solution
Inconsistent embedding focus	Use Grad-CAM to verify and retrain network to concentrate on central facial landmarks
High false negatives depending on background	Crop and mask out the background; train with random background augmentation, focus on face in training

*Note: all potential additions have not been added yet, and are stated to address gaps in project application.

4. Example Outputs

Example 1: Evaluating Cropped Face

Example 2: Saliency Map

E-Commerce

Project Summary

Check out the full GitHub repository here!

Spotify Recommendation Model and Customer Churn Optimization

Project Summary

Check out the full GitHub repository here!

Elements

Text

This is bold and this is strong. This is italic and this is emphasized. This is ^superscript text and this is _subscript text. This is underlined and this is code: for (;;) { ... }. Finally, this is a link.

Heading Level 2

Heading Level 3

Heading Level 4

Heading Level 5

Heading Level 6

Blockquote

Fringilla nisl. Donec accumsan interdum nisi, quis tincidunt felis sagittis eget tempus euismod. Vestibulum ante ipsum primis in faucibus vestibulum. Blandit adipiscing eu felis iaculis volutpat ac adipiscing accumsan faucibus. Vestibulum ante ipsum primis in faucibus lorem ipsum dolor sit amet nullam adipiscing eu felis.

Preformatted

i = 0;
while (!deck.isInOrder()) {
    print 'Iteration ' + i;
    deck.shuffle();
    i++;
}

print 'It took ' + i + ' iterations to sort the deck.';

Lists

Unordered

Dolor pulvinar etiam.
Sagittis adipiscing.
Felis enim feugiat.

Alternate

Dolor pulvinar etiam.
Sagittis adipiscing.
Felis enim feugiat.

Ordered

Dolor pulvinar etiam.
Etiam vel felis viverra.
Felis enim feugiat.
Dolor pulvinar etiam.
Etiam vel felis lorem.
Felis enim et feugiat.

Icons

Actions

Table

Default

Name	Description	Price
Item One	Ante turpis integer aliquet porttitor.	29.99
Item Two	Vis ac commodo adipiscing arcu aliquet.	19.99
Item Three	Morbi faucibus arcu accumsan lorem.	29.99
Item Four	Vitae integer tempus condimentum.	19.99
Item Five	Ante turpis integer aliquet porttitor.	29.99
		100.00

Alternate

Name	Description	Price
Item One	Ante turpis integer aliquet porttitor.	29.99
Item Two	Vis ac commodo adipiscing arcu aliquet.	19.99
Item Three	Morbi faucibus arcu accumsan lorem.	29.99
Item Four	Vitae integer tempus condimentum.	19.99
Item Five	Ante turpis integer aliquet porttitor.	29.99
		100.00

Buttons

Icon
Icon

Disabled
Disabled

About Me!

Table and Summaries of Projects

Pokémon Platinum AI

Siamese Facial Recognition Network

Personalized Daily News Summarizer

E-Commerce Marketing Analytics and Optimization

Spotify Playlist Recommender/Customer Churn Model

Contact

Pokémon Platinum AI Project

Project Summary

Project Overview

1. Introduction & Technical Stack

Technical Stack

2. Dataset and Preprocessing

3. Project Structure

3.1 COMPUTER VISION MODEL

Sample Annotation

3.2 Model Building and Hyperparameter Tuning

Gameplay Model Architecture

3.3 Model Pretraining and Reinforcement Learning Training

Reinforcement Learning Training Loop

3.4 EVALUATION

4. Challenges and Solutions

5. Results

6. Conclusion & Future Improvements

Conclusion

Future Improvements

Knowledge Graph Movie Recommender GNN

Project Summary

Project Overview

Personalized Daily News Summarizer

Ask About the News

Project Overview

1. Introduction & Technical Stack

2. Samples of Code

3. Roadblocks, Solutions, Improvements

4. Example Outputs

Example 1: Politics

Example 2: Economy

Example 3: Science

Siamese Facial Recognition Network

Project Overview

1. Introduction & Technical Stack

2. Samples of Code

3. Roadblocks, Solutions, Improvements

4. Example Outputs

Example 1: Evaluating Cropped Face

Example 2: Saliency Map

E-Commerce

Project Summary

Spotify Recommendation Model and Customer Churn Optimization

Project Summary

Elements

Text

Heading Level 2

Heading Level 3

Heading Level 4

Heading Level 5

Heading Level 6

Blockquote

Preformatted

Lists

Unordered

Alternate

Ordered

Icons

Actions

Table

Default

Alternate

Buttons

Form