大数据很热,用大数据挖个单词表试试

最近在学习SEO(searching engine optimization)的时候受到启发,想到能用Google大数据挖掘出英语中最常见的单词。

其实方法很简单,去Google找stop word list。什么叫stop words呢?Stop word就是搜索引擎在搜索算法中忽略掉的词。为什么要忽略掉这个词呢?是因为这些词太太太常见了,以致于搜索引擎需要禁止自己的爬虫抓取这些词以节约缓存和增加搜索速度。这些常用搜索词源自大数据分析的结果。

举个例子,用百度搜索一些带”的“的词,百度就会自动省略”的“,比如你搜索”的字开头诗句“,搜搜结果会自动忽略到”的“(有点悲催)。因为“的”字太常见了。

同样道理,你用google搜索英文关键词,也会省略一些很常用的英文单词。而反过来,这些被google程序算法排出的stop word的list,自然就是我们这些学英语的人最最需要学习的单词了,而且,由于源自大数据,这可能是世界上最最精确的英语常见单词了。

当然这些单词呢,也有不足,那就是他们主要会以介词为主。不过,无论如何,这是特别好的一个单词表啦,毕竟全英语世界的人都在不自然地搜它们!!

下面是我找到的stop word list,六百个词

仅供自我学习和大家学习交流之用。

a | hence | see

able | her | seeing

about | here | seem

above | hereafter | seemed

abroad | hereby | seeming

according | herein | seems

accordingly | here's | seen

across | hereupon | self

actually | hers | selves

adj | herself | sensible

after | he's | sent

afterwards | hi | serious

again | him | seriously

against | himself | seven

ago | his | several

ahead | hither | shall

ain't | hopefully | shan't

all | how | she

allow | howbeit | she'd

allows | however | she'll

almost | hundred | she's

alone | i | should

along | i'd | shouldn't

alongside | ie | since

already | if | six

also | ignored | so

although | i'll | some

always | i'm | somebody

am | immediate | someday

amid | in | somehow

amidst | inasmuch | someone

among | inc | something

amongst | inc. | sometime

an | indeed | sometimes

and | indicate | somewhat

another | indicated | somewhere

any | indicates | soon

anybody | inner | sorry

anyhow | inside | specified

anyone | insofar | specify

anything | instead | specifying

anyway | into | still

anyways | inward | sub

anywhere | is | such

apart | isn't | sup

appear | it | sure

appreciate | it'd | t

appropriate | it'll | take

are | its | taken

aren't | it's | taking

around | itself | tell

as | i've | tends

a's | j | th

aside | just | than

ask | k | thank

asking | keep | thanks

associated | keeps | thanx

at | kept | that

available | know | that'll

away | known | thats

awfully | knows | that's

b | l | that've

back | last | the

backward | lately | their

backwards | later | theirs

be | latter | them

became | latterly | themselves

because | least | then

become | less | thence

becomes | lest | there

becoming | let | thereafter

been | let's | thereby

before | like | there'd

beforehand | liked | therefore

begin | likely | therein

behind | likewise | there'll

being | little | there're

believe | look | theres

below | looking | there's

beside | looks | thereupon

besides | low | there've

best | lower | these

better | ltd | they

between | m | they'd

beyond | made | they'll

both | mainly | they're

brief | make | they've

but | makes | thing

by | many | things

c | may | think

came | maybe | third

can | mayn't | thirty

cannot | me | this

cant | mean | thorough

can't | meantime | thoroughly

caption | meanwhile | those

cause | merely | though

causes | might | three

certain | mightn't | through

certainly | mine | throughout

changes | minus | thru

clearly | miss | thus

c'mon | more | till

co | moreover | to

co. | most | together

com | mostly | too

come | mr | took

comes | mrs | toward

concerning | much | towards

consequently | must | tried

consider | mustn't | tries

considering | my | truly

contain | myself | try

containing | n | trying

contains | name | t's

corresponding | namely | twice

could | nd | two

couldn't | near | u

course | nearly | un

c's | necessary | under

currently | need | underneath

d | needn't | undoing

dare | needs | unfortunately

daren't | neither | unless

definitely | never | unlike

described | neverf | unlikely

despite | neverless | until

did | nevertheless | unto

didn't | new | up

different | next | upon

directly | nine | upwards

do | ninety | us

does | no | use

doesn't | nobody | used

doing | non | useful

done | none | uses

don't | nonetheless | using

down | noone | usually

downwards | no-one | v

during | nor | value

e | normally | various

each | not | versus

edu | nothing | very

eg | notwithstanding | via

eight | novel | viz

eighty | now | vs

either | nowhere | w

else | o | want

elsewhere | obviously | wants

end | of | was

ending | off | wasn't

enough | often | way

entirely | oh | we

especially | ok | we'd

et | okay | welcome

etc | old | well

even | on | we'll

ever | once | went

evermore | one | were

every | ones | we're

everybody | one's | weren't

everyone | only | we've

everything | onto | what

everywhere | opposite | whatever

ex | or | what'll

exactly | other | what's

example | others | what've

except | otherwise | when

f | ought | whence

fairly | oughtn't | whenever

far | our | where

farther | ours | whereafter

few | ourselves | whereas

fewer | out | whereby

fifth | outside | wherein

first | over | where's

five | overall | whereupon

followed | own | wherever

following | p | whether

follows | particular | which

for | particularly | whichever

forever | past | while

former | per | whilst

formerly | perhaps | whither

forth | placed | who

forward | please | who'd

found | plus | whoever

four | possible | whole

from | presumably | who'll

further | probably | whom

furthermore | provided | whomever

g | provides | who's

get | q | whose

gets | que | why

getting | quite | will

given | qv | willing

gives | r | wish

go | rather | with

goes | rd | within

going | re | without

gone | really | wonder

got | reasonably | won't

gotten | recent | would

greetings | recently | wouldn't

h | regarding | x

had | regardless | y

hadn't | regards | yes

half | relatively | yet

happens | respectively | you

hardly | right | you'd

has | round | you'll

hasn't | s | your

have | said | you're

haven't | same | yours

having | saw | yourself

he | say | yourselves

he'd | saying | you've

he'll | says | z

hello | second | zero

help | secondly |

好好学习,天天向上,尽管不认为背单词是一件特别好的英语学习方式,但是我知道常用单词肯定是必须学习的嘛~

你可能感兴趣的:(大数据很热,用大数据挖个单词表试试)