Home
  • About Me
  • Search
Navigation bar avatar
✕

    Jason Ballantyne's Portfolio


    Cyber Security Data Engineer | MSc Computer Science

    Text Analytics for Big Data

    Discovering meaning in big data through text analytics

    Posted on February 26, 2022

    Post thumbnail
    Post thumbnail
    Discovering meaning in big data and finding important regularities through fundamental techniques and areas where text analytics is deployed. [Read More]
    Tags: data-science nlp nltk natural-language-processing r python3 pandas numpy seaborn sklearn

    Covid-19 Cleaning and Preparation

    Focus on data understanding, cleaning and preparation for the Covid-19 pandemic dataset

    Posted on February 15, 2022

    Post thumbnail
    Post thumbnail
    This project focuses on data understanding, cleaning and preparation for the Covid-19 pandemic dataset. The data comes from the Centers for Disease Control and Prevention. [Read More]
    Tags: data-science pandas numpy seaborn data-analysis matplotlib data-visualization

    Implementing Naïve Bayes from Scratch

    Creating a class that implements Gaussian Naïve Bayes from scratch.

    Posted on February 1, 2022

    Post thumbnail
    Post thumbnail
    Creating a python class that implements Gaussian Naive Bayes from scratch, then test the performance and accuracy of my implementation against the GaussianNB implementation in scikit-learn. [Read More]
    Tags: data-science naive-bayes python3 object-oriented-programming sklearn

    Irish Crime Analysis

    An analysis and report of crime in Ireland from 2003 to 2019 in R

    Posted on January 30, 2022

    Post thumbnail
    Post thumbnail
    An analysis and report of crime in Ireland from 2003 to 2019 in R. A report of the key functionality of dplyr and a confidence interval function, complete with methods to print, summarise and plot the function in R. [Read More]
    Tags: data-science r ggplot2 dplyr tidyverse data-visualization s4class reshape2 confidence-interval

    Naïve Bayes handling missing values

    Implementing a Gaussian Naïve Bayes function in python that handles missing values

    Posted on January 25, 2022

    Post thumbnail
    Post thumbnail
    Implementing a Gaussian Naive Bayes function in python that handles missing values, both explicity in the classification algorithm and using imputation methods. [Read More]
    Tags: data-science naive-bayes python3 object-oriented-programming sklearn
    • ← Newer Posts
    • Older Posts →
    • Email me
    • GitHub
    • LinkedIn

    Jason Ballantyne  •  2024

    Powered by Beautiful Jekyll