Vector-Semantics-Evaluation

This repository contains code for evaluating the alignment between lexical semantics and vector semantics using the SimLex-999 dataset and the Brown corpus from NLTK. The evaluation is conducted using various vector semantic methods, including TF-iDF and Word2Vec, and the results are compared against manually determined lexical semantics.

Introduction

Lexical semantics and vector semantics are two approaches to understanding the meaning of words in natural language processing. This project aims to evaluate how well vector semantic methods capture the semantics of words compared to manually determined lexical semantics.

Datasets

SimLex-999: Golden standard dataset for lexical semantics evaluation.
Brown corpus: Large corpus from NLTK used for training vector semantic methods.

Usage

To run the evaluation code:

Just download the SimLex-999 file and give path in the ipynb file.
Run the provided Python ipynb file, specifying the desired settings for vector semantic methods for both TF-IDF and Word2Vec.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
SimLex-999.txt		SimLex-999.txt
Vector_Semantics_Evaluation.ipynb		Vector_Semantics_Evaluation.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vector-Semantics-Evaluation

Introduction

Datasets

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

charan-d55/Vector-Semantics-Evaluation

Folders and files

Latest commit

History

Repository files navigation

Vector-Semantics-Evaluation

Introduction

Datasets

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages