Swinburne
Browse

Text Classification via Semantic Representation and Similarity

Download (2.43 MB)
thesis
posted on 2024-07-12, 21:19 authored by Xinyi Cai
Text classification focuses on categorizing raw texts to specific predefined groups. Text classification consists of intent classification, topic classification, sentiment classification, and language identification due to different classification goals. There are several emerging challenges for text classification applied in real scenarios, including out-of-scope classification, zero-shot classification, few-shot classification, etc. Addressing these challenges requires advancements in model architectures, training techniques, data augmentation strategies, and evaluation metrics. In this thesis, we provide three lines of works to improve the performance of text classification on transformer-based models figuring out one or more challenging problems via semantic representation and similarity.

History

Thesis type

  • Thesis (PhD)

Thesis note

Thesis submitted for the Degree of Doctor of Philosophy, Swinburne University of Technology, 2023.

Copyright statement

Copyright © 2023 Xinyi Cai.

Supervisors

Jinjun Chen

Language

eng

Usage metrics

    Theses

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC