ChineseTextClassifier by ami66

Chinese short text classifier for sentiment analysis

Created 6 years ago

369 stars

Top 76.9% on SourcePulse

Project Summary

This repository provides a Chinese text classifier for short product reviews, primarily for sentiment analysis. It offers a range of models achieving over 90% accuracy, targeting developers and researchers working with Chinese e-commerce data.

How It Works

The classifier implements several deep learning architectures, including Transformer, word2vec combined with TextCNN, FastText, and recurrent networks (LSTM/GRU) with Attention mechanisms. This multi-model approach allows for flexibility and comparison, with word embeddings pre-trained on a large dataset to capture semantic meaning.

Quick Start & Requirements

Install via pip install tensorflow==2.0.
Requires Python 3.
Dataset: 100,000京东 (JD.com) product reviews (data/goods_zh.txt), labeled as 0 (negative) or 1 (positive).

Highlighted Details

Achieves >90% accuracy across implemented models.
Models include Transformer, word2vec+TextCNN, FastText, word2vec+LSTM/GRU, word2vec+LSTM/GRU+Attention, and word2vec+Bi_LSTM+Attention.
Future improvements planned for GloVe, GPT, BERT, and ERNIE.

Maintenance & Community

Project maintained by ami66.
WeChat public account ID: datanlp for more ML/DL project knowledge.

Licensing & Compatibility

License not specified in the README.
Compatibility for commercial use or closed-source linking is undetermined.

Limitations & Caveats

The project is built on TensorFlow 2.0, which is an older version. The README does not specify the license, which may impact commercial use.

Health Check

Last Commit

4 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

Macadam by yongzhuo

NLP tool for text classification, sequence labeling, and relation extraction

Created 5 years ago

Updated 2 years ago

SentimentAnalysis by barissayil

Sentiment analysis via fine-tuned transformer

Created 6 years ago

Updated 2 years ago

pytorch-transformers-classification by ThilinaRajapakse

Deprecated starter for Transformer-based text classification tasks

Created 6 years ago

Updated 5 years ago

awesome-sentiment-analysis by laugustyniak

Sentiment analysis resources: frameworks, libraries, software, papers

Created 9 years ago

Updated 1 month ago

BDCI_Car_2018 by yilifzf

Solution for sentiment analysis and topic recognition

Created 7 years ago

Updated 7 years ago

awesome-sentiment-analysis by xiamx

Sentiment analysis resource list

Created 9 years ago

Updated 7 years ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Luis Capelo

Luis Capelo(Cofounder of Lightning AI), and

4 more.

sentiment-discovery by NVIDIA

Language modeling and sentiment classification in PyTorch (deprecated, see Megatron-LM)

Created 8 years ago

Updated 5 years ago

Sentiment-Analysis-Twitter by ayushoriginal

NLP research paper for Twitter sentiment analysis

Created 9 years ago

Updated 5 years ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Shyamal Anadkat

Shyamal Anadkat(Research Scientist at OpenAI), and

11 more.

generating-reviews-discovering-sentiment by openai

Language model code for generating reviews and discovering sentiment

Created 9 years ago

Updated 2 years ago

finBERT by ProsusAI

Financial sentiment analysis via fine-tuned BERT

Created 6 years ago

Updated 3 years ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic),

Luis Capelo

Luis Capelo(Cofounder of Lightning AI), and

1 more.

vaderSentiment by cjhutto

Sentiment analysis tool attuned to social media texts

Created 11 years ago

Updated 1 year ago

Starred by

Zack Li

Zack Li(Cofounder of Nexa AI),

Andrew Kane

Andrew Kane(Author of pgvector), and

5 more.

text_classification by brightmart

Text classification models using deep learning

Created 8 years ago

Updated 2 years ago

Feedback? Help us improve.