Search Results for author: Huijia Wu

Found 6 papers, 2 papers with code

Dynamic Generation of Personalities with Large Language Models

1 code implementation • 10 Apr 2024 • Jianzhi Liu, Hexiang Gu, Tianyu Zheng, Liuyu Xiang, Huijia Wu, Jie Fu, Zhaofeng He

We propose a new metric to assess personality generation capability based on this evaluation method.

Personality Generation

19,045

Paper
Code

HyperMoE: Paying Attention to Unselected Experts in Mixture of Experts via Dynamic Transfer

1 code implementation • 20 Feb 2024 • Hao Zhao, Zihan Qiu, Huijia Wu, Zili Wang, Zhaofeng He, Jie Fu

The Mixture of Experts (MoE) for language models has been proven effective in augmenting the capacity of models by dynamically routing each input token to a specific subset of experts for processing.

Multi-Task Learning

Paper
Code

Shortcut Sequence Tagging

no code implementations • 3 Jan 2017 • Huijia Wu, Jiajun Zhang, Cheng-qing Zong

To simply the stacked architecture, we propose a framework called shortcut block, which is a marriage of the gating mechanism and shortcuts, while discarding the self-connected part in LSTM cell.

POS POS Tagging