Package 'rwstats'

Title: Chinese Character Frequency in Real World
Description: It contains Chinese character frequency data based on news data from 2017 to 2019.
Authors: Yitao Ma [aut, cre], Xinjin Zhang [aut], Bin Cui [aut]
Maintainer: Yitao Ma <[email protected]>
License: CC0
Version: 0.1
Built: 2025-03-01 05:51:35 UTC
Source: https://github.com/schirp/rwstats

Help Index


Chinese Word Frequency with five Characters

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

fiveChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

fiveChar

Chinese Word Frequency with four Characters

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

fourChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

fourChar

Chinese Word Frequency with one Character

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

oneChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

oneChar

Chinese Word Frequency with three Characters

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

threeChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

threeChar

Chinese Word Frequency with two Characters

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

twoChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

twoChar

Occurrence and frequency of the next word

Description

This is a function that lists the next word appearing frequency by a given chinese character based on real world chinese character frequency Statistics table.

Usage

wordMiner.next(candidateWord, topN = 10)

Arguments

candidateWord

A single Chinese character

topN

The number of raws of output dataframe

Value

A dataframe containing the next word and its occurrence and frequency

Examples

## Not run: 
wordMiner.next(strsplit(levels(twoChar$character)[988], "")[[1]][1])
wordMiner.next(strsplit(levels(twoChar$character)[988], "")[[1]][2], 20)

## End(Not run)

Occurrence and frequency of the previous word

Description

This is a function that lists the previous word appearing frequency by a given chinese character based on real world chinese character frequency Statistics table.

Usage

wordMiner.previous(candidateWord, topN = 10)

Arguments

candidateWord

A single Chinese character

topN

The number of raws of output dataframe

Value

A dataframe containing the previous word and its occurrence and frequency

Examples

## Not run: 
wordMiner.previous(strsplit(levels(twoChar$character)[988], "")[[1]][1])
wordMiner.previous(strsplit(levels(twoChar$character)[988], "")[[1]][2], 20)

## End(Not run)