最近在学习SEO(searching engine optimization)的时候受到启发,想到能用Google大数据挖掘出英语中最常见的单词。
其实方法很简单,去Google找stop word list。什么叫stop words呢?Stop word就是搜索引擎在搜索算法中忽略掉的词。为什么要忽略掉这个词呢?是因为这些词太太太常见了,以致于搜索引擎需要禁止自己的爬虫抓取这些词以节约缓存和增加搜索速度。这些常用搜索词源自大数据分析的结果。
举个例子,用百度搜索一些带”的“的词,百度就会自动省略”的“,比如你搜索”的字开头诗句“,搜搜结果会自动忽略到”的“(有点悲催)。因为“的”字太常见了。
同样道理,你用google搜索英文关键词,也会省略一些很常用的英文单词。而反过来,这些被google程序算法排出的stop word的list,自然就是我们这些学英语的人最最需要学习的单词了,而且,由于源自大数据,这可能是世界上最最精确的英语常见单词了。
当然这些单词呢,也有不足,那就是他们主要会以介词为主。不过,无论如何,这是特别好的一个单词表啦,毕竟全英语世界的人都在不自然地搜它们!!
下面是我找到的stop word list,六百个词
仅供自我学习和大家学习交流之用。
a | hence | see
able | her | seeing
about | here | seem
above | hereafter | seemed
abroad | hereby | seeming
according | herein | seems
accordingly | here's | seen
across | hereupon | self
actually | hers | selves
adj | herself | sensible
after | he's | sent
afterwards | hi | serious
again | him | seriously
against | himself | seven
ago | his | several
ahead | hither | shall
ain't | hopefully | shan't
all | how | she
allow | howbeit | she'd
allows | however | she'll
almost | hundred | she's
alone | i | should
along | i'd | shouldn't
alongside | ie | since
already | if | six
also | ignored | so
although | i'll | some
always | i'm | somebody
am | immediate | someday
amid | in | somehow
amidst | inasmuch | someone
among | inc | something
amongst | inc. | sometime
an | indeed | sometimes
and | indicate | somewhat
another | indicated | somewhere
any | indicates | soon
anybody | inner | sorry
anyhow | inside | specified
anyone | insofar | specify
anything | instead | specifying
anyway | into | still
anyways | inward | sub
anywhere | is | such
apart | isn't | sup
appear | it | sure
appreciate | it'd | t
appropriate | it'll | take
are | its | taken
aren't | it's | taking
around | itself | tell
as | i've | tends
a's | j | th
aside | just | than
ask | k | thank
asking | keep | thanks
associated | keeps | thanx
at | kept | that
available | know | that'll
away | known | thats
awfully | knows | that's
b | l | that've
back | last | the
backward | lately | their
backwards | later | theirs
be | latter | them
became | latterly | themselves
because | least | then
become | less | thence
becomes | lest | there
becoming | let | thereafter
been | let's | thereby
before | like | there'd
beforehand | liked | therefore
begin | likely | therein
behind | likewise | there'll
being | little | there're
believe | look | theres
below | looking | there's
beside | looks | thereupon
besides | low | there've
best | lower | these
better | ltd | they
between | m | they'd
beyond | made | they'll
both | mainly | they're
brief | make | they've
but | makes | thing
by | many | things
c | may | think
came | maybe | third
can | mayn't | thirty
cannot | me | this
cant | mean | thorough
can't | meantime | thoroughly
caption | meanwhile | those
cause | merely | though
causes | might | three
certain | mightn't | through
certainly | mine | throughout
changes | minus | thru
clearly | miss | thus
c'mon | more | till
co | moreover | to
co. | most | together
com | mostly | too
come | mr | took
comes | mrs | toward
concerning | much | towards
consequently | must | tried
consider | mustn't | tries
considering | my | truly
contain | myself | try
containing | n | trying
contains | name | t's
corresponding | namely | twice
could | nd | two
couldn't | near | u
course | nearly | un
c's | necessary | under
currently | need | underneath
d | needn't | undoing
dare | needs | unfortunately
daren't | neither | unless
definitely | never | unlike
described | neverf | unlikely
despite | neverless | until
did | nevertheless | unto
didn't | new | up
different | next | upon
directly | nine | upwards
do | ninety | us
does | no | use
doesn't | nobody | used
doing | non | useful
done | none | uses
don't | nonetheless | using
down | noone | usually
downwards | no-one | v
during | nor | value
e | normally | various
each | not | versus
edu | nothing | very
eg | notwithstanding | via
eight | novel | viz
eighty | now | vs
either | nowhere | w
else | o | want
elsewhere | obviously | wants
end | of | was
ending | off | wasn't
enough | often | way
entirely | oh | we
especially | ok | we'd
et | okay | welcome
etc | old | well
even | on | we'll
ever | once | went
evermore | one | were
every | ones | we're
everybody | one's | weren't
everyone | only | we've
everything | onto | what
everywhere | opposite | whatever
ex | or | what'll
exactly | other | what's
example | others | what've
except | otherwise | when
f | ought | whence
fairly | oughtn't | whenever
far | our | where
farther | ours | whereafter
few | ourselves | whereas
fewer | out | whereby
fifth | outside | wherein
first | over | where's
five | overall | whereupon
followed | own | wherever
following | p | whether
follows | particular | which
for | particularly | whichever
forever | past | while
former | per | whilst
formerly | perhaps | whither
forth | placed | who
forward | please | who'd
found | plus | whoever
four | possible | whole
from | presumably | who'll
further | probably | whom
furthermore | provided | whomever
g | provides | who's
get | q | whose
gets | que | why
getting | quite | will
given | qv | willing
gives | r | wish
go | rather | with
goes | rd | within
going | re | without
gone | really | wonder
got | reasonably | won't
gotten | recent | would
greetings | recently | wouldn't
h | regarding | x
had | regardless | y
hadn't | regards | yes
half | relatively | yet
happens | respectively | you
hardly | right | you'd
has | round | you'll
hasn't | s | your
have | said | you're
haven't | same | yours
having | saw | yourself
he | say | yourselves
he'd | saying | you've
he'll | says | z
hello | second | zero
help | secondly |
好好学习,天天向上,尽管不认为背单词是一件特别好的英语学习方式,但是我知道常用单词肯定是必须学习的嘛~