Count the frequency of letters or words in a text file and show a graph.

Last update: Apr 09, 2022

Overview

Word Counter

By EBUS Coding Club

Count the frequency of letters or words in a text file and show a graph.

Requirements

Python 3.9 or higher
matplotlib

Usage

Download the source code and unzip the downloaded file. Run pip install -r requirements.txt in the source code directory to install the required packages. Create a text file in the same directory as main.py named input.txt and fill it with text you want to analyze. Run the script in an IDE of your choice or with python main.py.

Objective

Given a text file, count the frequency (number of occurrences) of either letters or words, and show a bar graph to visualize the results. Do not include whitespace or punctuation in the results, with the exception of apostrophes that are inside words.

Next Steps

Add command line arguments for input file path and other options
Add timers for significant steps to diagnose performance
Optimize speed and memory usage
Anything else you can think of to improve the script

License

MIT License

Count the frequency of letters or words in a text file and show a graph.

Related tags

Overview

Word Counter

Requirements

Usage

Objective

Next Steps

License

Owner

EBUS Coding Club

This is the offline-training-pipeline for our project.

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

Twitter Sentiment Analysis using #tag, words and username

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

All the code I wrote for Overwatch-related projects that I still own the rights to.

MEDIALpy: MEDIcal Abbreviations Lookup in Python

Voilà turns Jupyter notebooks into standalone web applications

Local cross-platform machine translation GUI, based on CTranslate2

Korean extractive summarization. 2021 AI 텍스트 요약 온라인 해커톤 화성갈끄니까팀 코드

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Exploring dimension-reduced embeddings

Sapiens is a human antibody language model based on BERT.

Tool to check whether a GCP bucket is public or not.

The official repository of the ISBI 2022 KNIGHT Challenge

ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost

This repository serves as a place to document a toy attempt on how to create a generative text model in Catalan, based on GPT-2

Tevatron is a simple and efficient toolkit for training and running dense retrievers with deep language models.

Sentence Embeddings with BERT & XLNet

Hostapd-mac-tod-acl - Setup a hostapd AP with MAC ToD ACL