大语言模型(LLMs A-Z)

A

AnythingLLM

 

 

 

 

 

B

百川百小应

Baichuan4 Playground

百度DeepSeek R1

Brisk Teaching

 

 

C

ChatGPT***

ChatPDF

ChatPPT

ChatTTS

橙篇

Claude.ai***

C

Common Crawl

Copilot

COZE扣子

Curipod

 

 

D

DeepL Write

DeepSeek** (深度求索)

Diffit

豆包**

Dream Machine (Luma AI)

 

E

Eduaide

 

 

 

 

 

F

Fliki

 

 

 

 

 

G

Gamma

Gemini**

Genspark

GitHub

Gencraft

Google AI Studio**

G

GPT-Zero

GPTZero.me

Grok (xAI)

Groq

 

 

H

海螺AI (MiniMax)*

Hugging Face

 

 

 

 

I

Ideagram

 

 

 

 

 

J

即梦AI

字源

 

 

 

 

K

可灵AI

Kimi (月之暗面)

 

 

 

 

L

Liner

Llama 2 and 3

 

 

 

 

M

MagicSchool**

MaxAI***

Meta.ai

Midjourney

Mindshow

Mistral

M

Manus AI

Mizou

秘塔AI搜索

Monica

 

 

N

Natural Readers* (Text to speech)

NotebookLM*

 

 

 

 

O

Ollama

OpenAI

OpenAI Playground

OpenAI Prompt engineering Official Guide

 

 

P

Perplexity

PIKA

POE**

PromptBank

 

 

Q

QuillBot

 

 

 

 

 

R

Runway

 

 

 

 

 

S

Sonauto

Sider***

Sora**

Suno**

Synthesia (Turn text to video)

 

T

Teachinglab

Teachology

天工AI

通义**

通义万相*(文生图、文生视频)

TTSMaker

T

TTSMP3

Twee.com**

There is an AI for that

 

 

 

U

Udio

Unipus AIGC**

 

 

 

 

V

Veo

Viva

 

 

 

 

W

Writefull Paraphraser

万知

文心一言

 

 

 

X

讯飞星火

讯飞配音(语音合成、虚拟人制作)

 

 

 

 

Y

有言3D*

 

 

 

 

 

Z

ZeroGPT

智谱清言**

AI(思维导图)

智增LLM API

 

 

 

360 AI 浏览器

360智脑

 

 

 

 

Note: The number of asterisks indicates our level of recommendation. Three asterisks represent the highest level of recommendation.

 

语料天涯(Corpora A-Z)

 

A

Academic Phrasebank

AHRC Expert Seminars: Word Frequency & Keyword Extraction

AntConc

AntWordProfiler

 

 

B

Babel parallel corpus

BASE: British Academic Spoken English Corpus

BAWE: British Academic Written English Corpus

北大CCL语料库

 

 

B

北外语料库团队官方网站

北语BCC语料库

BFSU CQPweb

BFSU PowerConc

BNC at English-Corpora.org

BNC2014

B

BNC freq. lists

BNC Simple Search

BNC Web Index

Brown Corpus_1

Brown Corpus_2

Brown for Psycholinguistics

B

Business English Corpus (compiled at UIBE)

 

 

 

 

 

C

CCTFC胡显耀外译汉小说语料库

Centre for Corpus Approaches to Social Science

传媒语言语料库在线分词标注系统

CLARIN

CLAWS trial

CLICS (Database of Cross-Linguistic Colexifications)

C

Cobuild Grammar Patterns 1: Verbs

COCA

CoRD: Corpus Resource Database

Corpora journal

Corpora List

CLOB2009

C

Computer-Assisted Language Comparison

 

 

 

 

 

C

CORE: Corpus of Online Registers of English

CECPC (BFSU Chinese-English Parallel Corpus) by Prof. Kefei Wang

The CLiC Dickens Project

Coh-Metrix

Corpus-analysis.com

Corpus Finder

C

Corpus Linguistics and Linguistics Theory journal

Corpus-based Linguistics Links (David Lee & Martin Weisser)

Corpus of TED Speeches

CROWN2009

CROWN2021

Crown and CLOB

D

Developing Linguistic Corpora: A Guide to Good Practice

Digital Scholarship in the Humanities journal

Discourse Analyzer

dtSearch

 

 

E

ELRA (The European Language Resources Association)

European Corpus of Academic Talk (EuroCoAT)

 

 

 

 

F

FireAnt: A freeware social media and data analysis toolkit

FrameNet

FrazeIt

 

 

 

G

Genealogies of Knowledge

German learner corpus

Global WordNet Associaton

Global Web-Based English (GloWbE)

Google books corpus

Project Gutenberg

H

哈工大社会计算与信息检索研究中心

汉语新闻

红楼梦Parallel Corpus

HowNet知网

HSK动态作文语料库

 

I

ICAME Journal

International Comparable Corpus (ICC)

ICE download

Idiomsearch

IJCL journal

iWrite learner corpus (error-annotated)

I

iWriteBaby corpus

 

 

 

 

 

J

Just-the-word

 

 

 

 

 

K

KH Coder

Kibbitzers (by Tim Johns)

Kristopher Kyle Text Analysis Tools

 

 

 

L

LancsBox

Lancaster CQPweb

LCMC (Lancaster Corpus of Mandarin Chinese)

Learner Corpus Association

Level tests

Lexical Syllabus, Dave Willis

L

LexiBank

 

 

 

 

 

L

Lextutor

Linguistic Data Consortium (LDC)

Linggle

Livac汉语变异

LIWC (Linguistic Inquiry and Word Count)

 

M

MICASE

MICASE Kibbitzers DDL cases

MICUSP

MRC Psycholinguistic Database

 

 

N

NESSIE Corpus 1st release (Native English Speakers Similarly or Identically-prompted Essays)

NESSIE Corpus 2nd release

Noun Phrase search

NOW Corpus

 

 

O

OLAC语言开发典藏社群:Open Language Archives Community

OPUS: The open parallel corpus

Oxford Text Archive

 

 

 

P

Pattern Dictionary of English Prepositions

Pattern Dictionary of English Verbs

Paul Nation Range Vocabulary

Pear Stories

PolyU Language Bank

Phrase in English

Q

全球汉语中介语语料库

全球华语语料库GCC

QuillBot

 

 

 

R

Russian Error-Annotated Learner English Corpus (REALEC)

 

 

 

 

 

S

SEW: A Wikipedia corpus with 200M sense annotations

数据堂

SKELL at Sketch Engine

Sketch Engine

 

 

S

StringNet3.0

StringNet4.0

 

 

 

 

T

台湾中研院汉语语料库

TalkBank

TEC browser

TED英汉平行演讲语料库 (en->zh)

TED英汉平行演讲语料库 (zh->en)

The TECCL Chinese Learners English Corpus

T

Textinspector

TIME at English-Corpora

T-Lab

Tmxmall

ToRCH2009 汉语语料库

ToRCH2014汉语语料库

T

ToRCH2019汉语语料库

TreeTagger Windows interface

 

 

 

 

U

UCREL Corpus Application Server at Lancaster University

UCLA Corpus of Written Chinese

United Nations Parallel Corpus

Universal Dependencies

USAS Semantic tagger

 

V

VOICE corpus

VerbNet

VersaText

Visual Language Research Corpus (VLRC)

Vocabulary Size Test (Paul Nation)

 

Voyant Tools

W

WebCorp

Wmatrix

Word neighbors

WordSmith

Word frequency lists

WordNet

W

Writefull

 

 

 

 

 

Z

ZCTC浙大翻译汉语语料库

中国传媒大学语料库

中文语言资源联盟

诸子百家ctext.org

 

 

 



语料库语言学家及团队(Corpus Linguists and Research Groups)

B

BFSU Corpus Research Group

Douglas Biber

 

 

 

 

C

CASS Centre

 

 

 

 

 

G

Dirk Geeraerts

Stefan Th. Gries

 

 

 

 

M

Michaela Mahlberg

Tony McEnery

 

 

 

 

Q

QLVL, KU Leuven

 

 

 

 

 

S

SHISU Institute of Corpus Studies and Applications

 

 

 

 

 

 

This page is collated and maintained by Dr. Jiajin Xu, with updates by Hui Kang and Likai Yin, from the Corpus Research Group, Beijing Foreign Studies University.

If you encounter any dead links or would like to suggest new resources to be included, please email bfsucrg@sina.com.

Last edited: 8 March, 2025