Most common words in spoken englishDo native English speakers use the word “touristic”?“Every” being used instead of “ever”?Are these terms considered uncivilized to native English speakers?Regionalism or just bad English?Can the relative pronoun “whose” be used for animals, things and countries?Why is it wrong to use chillax?Use of the word “Priority”How can I (programmatically) distinguish between descriptive and non-descriptive adjectives?The difference between “poetic” and “poetical” in usage“It was pretty pedantic of me”

What is it called when someone votes for an option that's not their first choice?

Why didn't Voldemort know what Grindelwald looked like?

Error in master's thesis, I do not know what to do

Why doesn't Gödel's incompleteness theorem apply to false statements?

Friend wants my recommendation but I don't want to give it to him

Would this string work as string?

Is divisi notation needed for brass or woodwind in an orchestra?

Offset in split text content

Why would five hundred and five same as one?

Are hand made posters acceptable in Academia?

How do you justify more code being written by following clean code practices?

How to preserve electronics (computers, ipads, phones) for hundreds of years?

Started in 1987 vs. Starting in 1987

Extract substring according to regexp with sed or grep

Strange behavior in TikZ draw command

Why can't I get pgrep output right to variable on bash script?

What should be the ideal length of sentences in a blog post for ease of reading?

What (if any) is the reason to buy in small local stores?

Sort with assumptions

What is the meaning of "You've never met a graph you didn't like?"

Mortal danger in mid-grade literature

Do people actually use the word "kaputt" in conversation?

Exposing a company lying about themselves in a tightly knit industry (videogames) : Is my career at risk on the long run?

What properties make a magic weapon befit a Rogue more than a DEX-based Fighter?



Most common words in spoken english


Do native English speakers use the word “touristic”?“Every” being used instead of “ever”?Are these terms considered uncivilized to native English speakers?Regionalism or just bad English?Can the relative pronoun “whose” be used for animals, things and countries?Why is it wrong to use chillax?Use of the word “Priority”How can I (programmatically) distinguish between descriptive and non-descriptive adjectives?The difference between “poetic” and “poetical” in usage“It was pretty pedantic of me”













1















I've seen lists of the most common words in English, compiled from bodies of text.



Normally "the" is ranked first, "and" and "to" are quite high, etc.



But this is only written English.



I wonder what the most common words are in spoken English, I feel like "hello", "how are you" etc. are much much more common in spoken English than in written English.










share|improve this question






















  • I'm thinking that it would be, uh ....

    – Hot Licks
    4 hours ago






  • 2





    You know, uh, like, I think, yeah.

    – TaliesinMerlin
    4 hours ago











  • You might also think about the fact that in speech, there often aren't any words. Rather, there are fixed or semifixed phrases that get strung together, with contractions allover the place and words runtogether like shoulda and wanna. and like that. "Word" is a concept with sharp edges and may not be the tool of choice for actual fluid speech.

    – John Lawler
    3 hours ago











  • Greetings are probably more common in speech than in writing overall, but unless you’re working as an usher in the US, they aren’t going to be anywhere near as frequent as articles, certain prepositions, copulas or coordinators and subordinators – not by a long shot.

    – Janus Bahs Jacquet
    3 hours ago















1















I've seen lists of the most common words in English, compiled from bodies of text.



Normally "the" is ranked first, "and" and "to" are quite high, etc.



But this is only written English.



I wonder what the most common words are in spoken English, I feel like "hello", "how are you" etc. are much much more common in spoken English than in written English.










share|improve this question






















  • I'm thinking that it would be, uh ....

    – Hot Licks
    4 hours ago






  • 2





    You know, uh, like, I think, yeah.

    – TaliesinMerlin
    4 hours ago











  • You might also think about the fact that in speech, there often aren't any words. Rather, there are fixed or semifixed phrases that get strung together, with contractions allover the place and words runtogether like shoulda and wanna. and like that. "Word" is a concept with sharp edges and may not be the tool of choice for actual fluid speech.

    – John Lawler
    3 hours ago











  • Greetings are probably more common in speech than in writing overall, but unless you’re working as an usher in the US, they aren’t going to be anywhere near as frequent as articles, certain prepositions, copulas or coordinators and subordinators – not by a long shot.

    – Janus Bahs Jacquet
    3 hours ago













1












1








1


1






I've seen lists of the most common words in English, compiled from bodies of text.



Normally "the" is ranked first, "and" and "to" are quite high, etc.



But this is only written English.



I wonder what the most common words are in spoken English, I feel like "hello", "how are you" etc. are much much more common in spoken English than in written English.










share|improve this question














I've seen lists of the most common words in English, compiled from bodies of text.



Normally "the" is ranked first, "and" and "to" are quite high, etc.



But this is only written English.



I wonder what the most common words are in spoken English, I feel like "hello", "how are you" etc. are much much more common in spoken English than in written English.







word-usage






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked 5 hours ago









theonlygustitheonlygusti

690824




690824












  • I'm thinking that it would be, uh ....

    – Hot Licks
    4 hours ago






  • 2





    You know, uh, like, I think, yeah.

    – TaliesinMerlin
    4 hours ago











  • You might also think about the fact that in speech, there often aren't any words. Rather, there are fixed or semifixed phrases that get strung together, with contractions allover the place and words runtogether like shoulda and wanna. and like that. "Word" is a concept with sharp edges and may not be the tool of choice for actual fluid speech.

    – John Lawler
    3 hours ago











  • Greetings are probably more common in speech than in writing overall, but unless you’re working as an usher in the US, they aren’t going to be anywhere near as frequent as articles, certain prepositions, copulas or coordinators and subordinators – not by a long shot.

    – Janus Bahs Jacquet
    3 hours ago

















  • I'm thinking that it would be, uh ....

    – Hot Licks
    4 hours ago






  • 2





    You know, uh, like, I think, yeah.

    – TaliesinMerlin
    4 hours ago











  • You might also think about the fact that in speech, there often aren't any words. Rather, there are fixed or semifixed phrases that get strung together, with contractions allover the place and words runtogether like shoulda and wanna. and like that. "Word" is a concept with sharp edges and may not be the tool of choice for actual fluid speech.

    – John Lawler
    3 hours ago











  • Greetings are probably more common in speech than in writing overall, but unless you’re working as an usher in the US, they aren’t going to be anywhere near as frequent as articles, certain prepositions, copulas or coordinators and subordinators – not by a long shot.

    – Janus Bahs Jacquet
    3 hours ago
















I'm thinking that it would be, uh ....

– Hot Licks
4 hours ago





I'm thinking that it would be, uh ....

– Hot Licks
4 hours ago




2




2





You know, uh, like, I think, yeah.

– TaliesinMerlin
4 hours ago





You know, uh, like, I think, yeah.

– TaliesinMerlin
4 hours ago













You might also think about the fact that in speech, there often aren't any words. Rather, there are fixed or semifixed phrases that get strung together, with contractions allover the place and words runtogether like shoulda and wanna. and like that. "Word" is a concept with sharp edges and may not be the tool of choice for actual fluid speech.

– John Lawler
3 hours ago





You might also think about the fact that in speech, there often aren't any words. Rather, there are fixed or semifixed phrases that get strung together, with contractions allover the place and words runtogether like shoulda and wanna. and like that. "Word" is a concept with sharp edges and may not be the tool of choice for actual fluid speech.

– John Lawler
3 hours ago













Greetings are probably more common in speech than in writing overall, but unless you’re working as an usher in the US, they aren’t going to be anywhere near as frequent as articles, certain prepositions, copulas or coordinators and subordinators – not by a long shot.

– Janus Bahs Jacquet
3 hours ago





Greetings are probably more common in speech than in writing overall, but unless you’re working as an usher in the US, they aren’t going to be anywhere near as frequent as articles, certain prepositions, copulas or coordinators and subordinators – not by a long shot.

– Janus Bahs Jacquet
3 hours ago










1 Answer
1






active

oldest

votes


















4














The is likely still the top word.



It's impossible to find this out for certain, since not all spoken language is recorded and corpuses tend to capture established usage. Furthermore, filler (like, oh, um, you know) may be underrepresented in corpuses, and spoken usage in general can vary widely depending on the context. Still, there are two good sources for American and British usage.



Method 1: Corpus search on the spoken subcorpus of COCA.



Result: The. It isn't close, folks. (5000 words lists these words in an accessible format but doesn't separate written and spoken English.)
"The is the most common word, and it isn't close.



Limitation: COCA's spoken corpus comes from TV, radio, and sources that privilege standard American dialects in a professional register.



Method 2: Consult Geoffrey Leech, Paul Rayson, Andrew Wilson, authors of Word Frequencies in Written and Spoken English: based on the British National Corpus, 2001.



Result: The (spoken English in quantity, though they cleverly find that "oh" and "yeah" are the most distinctively conversational versus task-oriented speech, and the most common interjections/discourse particles.)



Limitations: British English. Spoken corpus was ~10 million words.






share|improve this answer






















    Your Answer








    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "97"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fenglish.stackexchange.com%2fquestions%2f490450%2fmost-common-words-in-spoken-english%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    4














    The is likely still the top word.



    It's impossible to find this out for certain, since not all spoken language is recorded and corpuses tend to capture established usage. Furthermore, filler (like, oh, um, you know) may be underrepresented in corpuses, and spoken usage in general can vary widely depending on the context. Still, there are two good sources for American and British usage.



    Method 1: Corpus search on the spoken subcorpus of COCA.



    Result: The. It isn't close, folks. (5000 words lists these words in an accessible format but doesn't separate written and spoken English.)
    "The is the most common word, and it isn't close.



    Limitation: COCA's spoken corpus comes from TV, radio, and sources that privilege standard American dialects in a professional register.



    Method 2: Consult Geoffrey Leech, Paul Rayson, Andrew Wilson, authors of Word Frequencies in Written and Spoken English: based on the British National Corpus, 2001.



    Result: The (spoken English in quantity, though they cleverly find that "oh" and "yeah" are the most distinctively conversational versus task-oriented speech, and the most common interjections/discourse particles.)



    Limitations: British English. Spoken corpus was ~10 million words.






    share|improve this answer



























      4














      The is likely still the top word.



      It's impossible to find this out for certain, since not all spoken language is recorded and corpuses tend to capture established usage. Furthermore, filler (like, oh, um, you know) may be underrepresented in corpuses, and spoken usage in general can vary widely depending on the context. Still, there are two good sources for American and British usage.



      Method 1: Corpus search on the spoken subcorpus of COCA.



      Result: The. It isn't close, folks. (5000 words lists these words in an accessible format but doesn't separate written and spoken English.)
      "The is the most common word, and it isn't close.



      Limitation: COCA's spoken corpus comes from TV, radio, and sources that privilege standard American dialects in a professional register.



      Method 2: Consult Geoffrey Leech, Paul Rayson, Andrew Wilson, authors of Word Frequencies in Written and Spoken English: based on the British National Corpus, 2001.



      Result: The (spoken English in quantity, though they cleverly find that "oh" and "yeah" are the most distinctively conversational versus task-oriented speech, and the most common interjections/discourse particles.)



      Limitations: British English. Spoken corpus was ~10 million words.






      share|improve this answer

























        4












        4








        4







        The is likely still the top word.



        It's impossible to find this out for certain, since not all spoken language is recorded and corpuses tend to capture established usage. Furthermore, filler (like, oh, um, you know) may be underrepresented in corpuses, and spoken usage in general can vary widely depending on the context. Still, there are two good sources for American and British usage.



        Method 1: Corpus search on the spoken subcorpus of COCA.



        Result: The. It isn't close, folks. (5000 words lists these words in an accessible format but doesn't separate written and spoken English.)
        "The is the most common word, and it isn't close.



        Limitation: COCA's spoken corpus comes from TV, radio, and sources that privilege standard American dialects in a professional register.



        Method 2: Consult Geoffrey Leech, Paul Rayson, Andrew Wilson, authors of Word Frequencies in Written and Spoken English: based on the British National Corpus, 2001.



        Result: The (spoken English in quantity, though they cleverly find that "oh" and "yeah" are the most distinctively conversational versus task-oriented speech, and the most common interjections/discourse particles.)



        Limitations: British English. Spoken corpus was ~10 million words.






        share|improve this answer













        The is likely still the top word.



        It's impossible to find this out for certain, since not all spoken language is recorded and corpuses tend to capture established usage. Furthermore, filler (like, oh, um, you know) may be underrepresented in corpuses, and spoken usage in general can vary widely depending on the context. Still, there are two good sources for American and British usage.



        Method 1: Corpus search on the spoken subcorpus of COCA.



        Result: The. It isn't close, folks. (5000 words lists these words in an accessible format but doesn't separate written and spoken English.)
        "The is the most common word, and it isn't close.



        Limitation: COCA's spoken corpus comes from TV, radio, and sources that privilege standard American dialects in a professional register.



        Method 2: Consult Geoffrey Leech, Paul Rayson, Andrew Wilson, authors of Word Frequencies in Written and Spoken English: based on the British National Corpus, 2001.



        Result: The (spoken English in quantity, though they cleverly find that "oh" and "yeah" are the most distinctively conversational versus task-oriented speech, and the most common interjections/discourse particles.)



        Limitations: British English. Spoken corpus was ~10 million words.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered 4 hours ago









        TaliesinMerlinTaliesinMerlin

        5,8441127




        5,8441127



























            draft saved

            draft discarded
















































            Thanks for contributing an answer to English Language & Usage Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fenglish.stackexchange.com%2fquestions%2f490450%2fmost-common-words-in-spoken-english%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Францішак Багушэвіч Змест Сям'я | Біяграфія | Творчасць | Мова Багушэвіча | Ацэнкі дзейнасці | Цікавыя факты | Спадчына | Выбраная бібліяграфія | Ушанаванне памяці | У філатэліі | Зноскі | Літаратура | Спасылкі | НавігацыяЛяхоўскі У. Рупіўся дзеля Бога і людзей: Жыццёвы шлях Лявона Вітан-Дубейкаўскага // Вольскі і Памідораў з песняй пра немца Адвакат, паэт, народны заступнік Ашмянскі веснікВ Минске появится площадь Богушевича и улица Сырокомли, Белорусская деловая газета, 19 июля 2001 г.Айцец беларускай нацыянальнай ідэі паўстаў у бронзе Сяргей Аляксандравіч Адашкевіч (1918, Мінск). 80-я гады. Бюст «Францішак Багушэвіч».Яўген Мікалаевіч Ціхановіч. «Партрэт Францішка Багушэвіча»Мікола Мікалаевіч Купава. «Партрэт зачынальніка новай беларускай літаратуры Францішка Багушэвіча»Уладзімір Іванавіч Мелехаў. На помніку «Змагарам за родную мову» Барэльеф «Францішак Багушэвіч»Памяць пра Багушэвіча на Віленшчыне Страчаная сталіца. Беларускія шыльды на вуліцах Вільні«Krynica». Ideologia i przywódcy białoruskiego katolicyzmuФранцішак БагушэвічТворы на knihi.comТворы Францішка Багушэвіча на bellib.byСодаль Уладзімір. Францішак Багушэвіч на Лідчыне;Луцкевіч Антон. Жыцьцё і творчасьць Фр. Багушэвіча ў успамінах ягоных сучасьнікаў // Запісы Беларускага Навуковага таварыства. Вільня, 1938. Сшытак 1. С. 16-34.Большая российская1188761710000 0000 5537 633Xn9209310021619551927869394п

            Partai Komunis Tiongkok Daftar isi Kepemimpinan | Pranala luar | Referensi | Menu navigasidiperiksa1 perubahan tertundacpc.people.com.cnSitus resmiSurat kabar resmi"Why the Communist Party is alive, well and flourishing in China"0307-1235"Full text of Constitution of Communist Party of China"smengembangkannyas

            ValueError: Expected n_neighbors <= n_samples, but n_samples = 1, n_neighbors = 6 (SMOTE) The 2019 Stack Overflow Developer Survey Results Are InCan SMOTE be applied over sequence of words (sentences)?ValueError when doing validation with random forestsSMOTE and multi class oversamplingLogic behind SMOTE-NC?ValueError: Error when checking target: expected dense_1 to have shape (7,) but got array with shape (1,)SmoteBoost: Should SMOTE be ran individually for each iteration/tree in the boosting?solving multi-class imbalance classification using smote and OSSUsing SMOTE for Synthetic Data generation to improve performance on unbalanced dataproblem of entry format for a simple model in KerasSVM SMOTE fit_resample() function runs forever with no result