Skip to content
#

tf-idf-vectorizer

Here are 129 public repositories matching this topic...

Tunable full text search engine in JavaScript that: (1) works natively on web apps like Express.js; (2) easy to customize (via BM25) to specific types of documents (e.g. tweets, scientifc journals); (3) is deployable on either the client-side or the server side.

  • Updated Jan 26, 2019
  • JavaScript

NLP based Classification Model that predicts a person's personality type as one of the 16 Myers Briggs personality types. Extremely challenging project dealing with correlation between human psychology and casual writing styles and handling heavily imbalanced classes. Check the app here - https://mb-predictor-motetuzs5q-uc.a.run.app/

  • Updated Feb 15, 2023
  • Jupyter Notebook

Skincare recommendation android application that uses dataset from Kaggle and scrapped data from cosmetics websites to work a Tf-IDF vectorizer for content based filtering, and KNN and Decision trees for collaborative based filtering. The notebook also contains other approaches for POC including SVD. Backend APIs are based on Flask, Android appl…

  • Updated Oct 22, 2021
  • Jupyter Notebook

A simple Django-based resume ranker website where recruiters post their jobs and candidates applies for their desired vacancies. The system gets the document similarity between the job description and the candidate resumes, generates similarity scores using the KNN model, and rank or shortlist the candidate resumes.

  • Updated Jan 26, 2024
  • HTML

Sistema de recomendación de películas basado en contenido. Utilizando TF-IDF y la similitud del coseno. La data fue extraída, transformada y analizada para el entrenamiento del modelo. Disponibilizandolo junto con la data limpia para futuras consultas, a través del despliegue con FastAPI y Render.

  • Updated Sep 26, 2023
  • Jupyter Notebook

Scrapped tweets using twitter API (for keyword ‘Netflix’) on an AWS EC2 instance, ingested data into S3 via kinesis firehose. Used Spark ML on databricks to build a pipeline for sentiment classification model and Athena & QuickSight to build a dashboard

  • Updated May 2, 2022
  • Jupyter Notebook

The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.

  • Updated Mar 29, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the tf-idf-vectorizer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tf-idf-vectorizer topic, visit your repo's landing page and select "manage topics."

Learn more