How do dictionaries source attestation












4















Some dictionaries source attestation and try to go for the earliest quotes they can find. How do they find them? Without electronic indexing this must have been impossibly difficult.



The reason I'm asking is that many answers on Stackexchange often conclude earliest attestation and imply earliest use, instead of the logical used at the latest in ... That's a slightly different issue to frame the question, not to make it too broad. To be fair, no inference is made because of this in many cases.










share|improve this question



























    4















    Some dictionaries source attestation and try to go for the earliest quotes they can find. How do they find them? Without electronic indexing this must have been impossibly difficult.



    The reason I'm asking is that many answers on Stackexchange often conclude earliest attestation and imply earliest use, instead of the logical used at the latest in ... That's a slightly different issue to frame the question, not to make it too broad. To be fair, no inference is made because of this in many cases.










    share|improve this question

























      4












      4








      4


      1






      Some dictionaries source attestation and try to go for the earliest quotes they can find. How do they find them? Without electronic indexing this must have been impossibly difficult.



      The reason I'm asking is that many answers on Stackexchange often conclude earliest attestation and imply earliest use, instead of the logical used at the latest in ... That's a slightly different issue to frame the question, not to make it too broad. To be fair, no inference is made because of this in many cases.










      share|improve this question














      Some dictionaries source attestation and try to go for the earliest quotes they can find. How do they find them? Without electronic indexing this must have been impossibly difficult.



      The reason I'm asking is that many answers on Stackexchange often conclude earliest attestation and imply earliest use, instead of the logical used at the latest in ... That's a slightly different issue to frame the question, not to make it too broad. To be fair, no inference is made because of this in many cases.







      lexicography






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked 5 hours ago









      vectoryvectory

      35411




      35411






















          2 Answers
          2






          active

          oldest

          votes


















          3














          I agree with your assumption that the date of the earliest recorded usage of a word does not necessarily correspond to the earliest usage of a word, since words may have been in circulation in spoken language before they were first used in publications, and many old publications have simply not survived. I touched upon this issue in my answer to the question How many of Shakespeare's words in his plays were new? on Literature Stack Exchange: a number of words that are first attested in Shakespeare's plays may have circulated earlier, either in the spoken language or in writings that have not survived (or both).



          Before there were digital corpora and digital texts, lexicographers had to read physical books to find usage examples. The editors of the first edition of the Oxford English Dictionary (which started their work in 1857 or 1858) asked thousands of volunteer readers to submit usage examples, in what was an early example of crowdsourcing. (See the crowdsourcing timeline, which says that 800 volunteers contributed to the first fascicle alone.)



          See also the description of the OED's Reading Programme. James A. H. Murray set this up in 1879; its focus was not specifically on finding the earliest examples of words and phrases. In addition, the first edition "relied heavily on a small number of authors (notably, of course, Shakespeare) for its coverage of Early Modern English (1500-1700)". This selection was possibly too narrow to find the earliest surviving examples of words and phrases. For this reason, the current Historical Reading Programme looks at a broader spectrum of texts:




          Today, readers systematically survey a much broader spectrum of texts from this and other periods. A separate Historical Reading Programme has been created to serve this function. Readers participating in this programme supply material specifically for the revision of the OED; usually these are earlier examples of words or phrases that are already included in the Dictionary. These readers check their findings against the Oxford English Dictionary to ensure that only significant items are filed, such as earlier examples of words and meanings or terms that have not yet been registered in the database.




          In short, before the digital era, finding early usage examples required a lot of reading.






          share|improve this answer

































            2














            Typically, you don't ever really know for certain that you have the earliest example. Or even the earliest written example. It's just the best so far.



            (As a person who frequently writes answers to etymology questions on ELU, I try to make this clear. "According to the OED", "according to my own research", "dates at least back to X" are all things I say, but I sometimes get sloppy and don't do this all the time.)



            There are some exceptions. We can be very certain that "cromulent", for example, was coined in 1996 (or '95 depending on when the episode of the Simpsons was written).





            How etymological research is done has varied through time. In the case of the "New English Dictionary" (the first edition of the Oxford English Dictionary), work started on it in 1857. Then:




            [I]n January 1859, the Society issued their 'Proposal for the publication of a New English Dictionary,' in which the characteristics of the proposed work were explained, and an appeal made to the English and American public to assist in collecting the raw materials for the work, these materials consisting of quotations illustrating the use of English words by all writers of all ages and in all senses, each quotation being made on a uniform plan on a half-sheet of notepaper, that they might in due course be arranged and classified alphabetically and by meanings. This Appeal met with a generous response: some hundreds of volunteers began to read books, make quotations, and send in their slips to 'sub-editors,' who volunteered each to take charge of a letter or part of one, and by whom the slips were in tum further arranged, classified, and to some extent used as the basis of definitions and skeleton schemes of the meanings of words in preparation for the Dictionary.
            An Appeal to the English-Speaking and English-Reading Public to Read Books and Make Extracts for The Philological Society's New English Dictionary




            The "Reading Programme" is still used by the OED, although the methodology is different. The books are still read all the same but here's what happens next according to a freelance researcher for the OED:




            I then consult OED Online to determine whether the word or phrase is in the Dictionary: if it is not, I submit it as a ‘not-in’, and if it is, I decide whether its form or context is important enough to warrant its submission. If it does qualify, I enter the information into tagged fields in an electronic file that has been set up in a standard format. When I have finished the reading, I submit the file to Oxford or New York, where the records are incorporated into OED‘s working database for consideration by the editors, along with thousands of paper citation slips, as they proceed through the current revision. Yes, some of my finds are still submitted as paper slips—a reminder of OED‘s long heritage—but, electronic or paper, I can hardly imagine a better job.




            The quotations were collected in a machine readable format for the first time in 1989. The 1990 UK Reading Programme captured material electronically. (Note that the second edition of the Oxford English Dictionary came out in 1989.)



            In addition to this, the OED now utilizes several online databases of texts, such as Early English Books Online, Eighteenth Century Collections Online, and some newspaper databases.



            If you do your own research with databases (many people use the free Google Books), it's often easy to beat pages that haven't been updated for the third edition of the OED. Updates to the OED3 started in 2000 and continue to this day: it's a huge dictionary and updating takes time.



            See also:




            • OED: Researching the Language






            share|improve this answer
























            • I'm asking specifically because gbooks is not satisfying, not the least because OCR fails for old books. I take it that "you don't ever really know for certain" counts for lost copies, as well as otherwise unread/not-scanned books. "hundreds of volunteers" pale in comparison to the amount of available books.

              – vectory
              17 mins ago











            Your Answer








            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "312"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            noCode: true, onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














            draft saved

            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2flinguistics.stackexchange.com%2fquestions%2f30661%2fhow-do-dictionaries-source-attestation%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            2 Answers
            2






            active

            oldest

            votes








            2 Answers
            2






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            3














            I agree with your assumption that the date of the earliest recorded usage of a word does not necessarily correspond to the earliest usage of a word, since words may have been in circulation in spoken language before they were first used in publications, and many old publications have simply not survived. I touched upon this issue in my answer to the question How many of Shakespeare's words in his plays were new? on Literature Stack Exchange: a number of words that are first attested in Shakespeare's plays may have circulated earlier, either in the spoken language or in writings that have not survived (or both).



            Before there were digital corpora and digital texts, lexicographers had to read physical books to find usage examples. The editors of the first edition of the Oxford English Dictionary (which started their work in 1857 or 1858) asked thousands of volunteer readers to submit usage examples, in what was an early example of crowdsourcing. (See the crowdsourcing timeline, which says that 800 volunteers contributed to the first fascicle alone.)



            See also the description of the OED's Reading Programme. James A. H. Murray set this up in 1879; its focus was not specifically on finding the earliest examples of words and phrases. In addition, the first edition "relied heavily on a small number of authors (notably, of course, Shakespeare) for its coverage of Early Modern English (1500-1700)". This selection was possibly too narrow to find the earliest surviving examples of words and phrases. For this reason, the current Historical Reading Programme looks at a broader spectrum of texts:




            Today, readers systematically survey a much broader spectrum of texts from this and other periods. A separate Historical Reading Programme has been created to serve this function. Readers participating in this programme supply material specifically for the revision of the OED; usually these are earlier examples of words or phrases that are already included in the Dictionary. These readers check their findings against the Oxford English Dictionary to ensure that only significant items are filed, such as earlier examples of words and meanings or terms that have not yet been registered in the database.




            In short, before the digital era, finding early usage examples required a lot of reading.






            share|improve this answer






























              3














              I agree with your assumption that the date of the earliest recorded usage of a word does not necessarily correspond to the earliest usage of a word, since words may have been in circulation in spoken language before they were first used in publications, and many old publications have simply not survived. I touched upon this issue in my answer to the question How many of Shakespeare's words in his plays were new? on Literature Stack Exchange: a number of words that are first attested in Shakespeare's plays may have circulated earlier, either in the spoken language or in writings that have not survived (or both).



              Before there were digital corpora and digital texts, lexicographers had to read physical books to find usage examples. The editors of the first edition of the Oxford English Dictionary (which started their work in 1857 or 1858) asked thousands of volunteer readers to submit usage examples, in what was an early example of crowdsourcing. (See the crowdsourcing timeline, which says that 800 volunteers contributed to the first fascicle alone.)



              See also the description of the OED's Reading Programme. James A. H. Murray set this up in 1879; its focus was not specifically on finding the earliest examples of words and phrases. In addition, the first edition "relied heavily on a small number of authors (notably, of course, Shakespeare) for its coverage of Early Modern English (1500-1700)". This selection was possibly too narrow to find the earliest surviving examples of words and phrases. For this reason, the current Historical Reading Programme looks at a broader spectrum of texts:




              Today, readers systematically survey a much broader spectrum of texts from this and other periods. A separate Historical Reading Programme has been created to serve this function. Readers participating in this programme supply material specifically for the revision of the OED; usually these are earlier examples of words or phrases that are already included in the Dictionary. These readers check their findings against the Oxford English Dictionary to ensure that only significant items are filed, such as earlier examples of words and meanings or terms that have not yet been registered in the database.




              In short, before the digital era, finding early usage examples required a lot of reading.






              share|improve this answer




























                3












                3








                3







                I agree with your assumption that the date of the earliest recorded usage of a word does not necessarily correspond to the earliest usage of a word, since words may have been in circulation in spoken language before they were first used in publications, and many old publications have simply not survived. I touched upon this issue in my answer to the question How many of Shakespeare's words in his plays were new? on Literature Stack Exchange: a number of words that are first attested in Shakespeare's plays may have circulated earlier, either in the spoken language or in writings that have not survived (or both).



                Before there were digital corpora and digital texts, lexicographers had to read physical books to find usage examples. The editors of the first edition of the Oxford English Dictionary (which started their work in 1857 or 1858) asked thousands of volunteer readers to submit usage examples, in what was an early example of crowdsourcing. (See the crowdsourcing timeline, which says that 800 volunteers contributed to the first fascicle alone.)



                See also the description of the OED's Reading Programme. James A. H. Murray set this up in 1879; its focus was not specifically on finding the earliest examples of words and phrases. In addition, the first edition "relied heavily on a small number of authors (notably, of course, Shakespeare) for its coverage of Early Modern English (1500-1700)". This selection was possibly too narrow to find the earliest surviving examples of words and phrases. For this reason, the current Historical Reading Programme looks at a broader spectrum of texts:




                Today, readers systematically survey a much broader spectrum of texts from this and other periods. A separate Historical Reading Programme has been created to serve this function. Readers participating in this programme supply material specifically for the revision of the OED; usually these are earlier examples of words or phrases that are already included in the Dictionary. These readers check their findings against the Oxford English Dictionary to ensure that only significant items are filed, such as earlier examples of words and meanings or terms that have not yet been registered in the database.




                In short, before the digital era, finding early usage examples required a lot of reading.






                share|improve this answer















                I agree with your assumption that the date of the earliest recorded usage of a word does not necessarily correspond to the earliest usage of a word, since words may have been in circulation in spoken language before they were first used in publications, and many old publications have simply not survived. I touched upon this issue in my answer to the question How many of Shakespeare's words in his plays were new? on Literature Stack Exchange: a number of words that are first attested in Shakespeare's plays may have circulated earlier, either in the spoken language or in writings that have not survived (or both).



                Before there were digital corpora and digital texts, lexicographers had to read physical books to find usage examples. The editors of the first edition of the Oxford English Dictionary (which started their work in 1857 or 1858) asked thousands of volunteer readers to submit usage examples, in what was an early example of crowdsourcing. (See the crowdsourcing timeline, which says that 800 volunteers contributed to the first fascicle alone.)



                See also the description of the OED's Reading Programme. James A. H. Murray set this up in 1879; its focus was not specifically on finding the earliest examples of words and phrases. In addition, the first edition "relied heavily on a small number of authors (notably, of course, Shakespeare) for its coverage of Early Modern English (1500-1700)". This selection was possibly too narrow to find the earliest surviving examples of words and phrases. For this reason, the current Historical Reading Programme looks at a broader spectrum of texts:




                Today, readers systematically survey a much broader spectrum of texts from this and other periods. A separate Historical Reading Programme has been created to serve this function. Readers participating in this programme supply material specifically for the revision of the OED; usually these are earlier examples of words or phrases that are already included in the Dictionary. These readers check their findings against the Oxford English Dictionary to ensure that only significant items are filed, such as earlier examples of words and meanings or terms that have not yet been registered in the database.




                In short, before the digital era, finding early usage examples required a lot of reading.







                share|improve this answer














                share|improve this answer



                share|improve this answer








                edited 58 mins ago

























                answered 4 hours ago









                Christophe StrobbeChristophe Strobbe

                3831212




                3831212























                    2














                    Typically, you don't ever really know for certain that you have the earliest example. Or even the earliest written example. It's just the best so far.



                    (As a person who frequently writes answers to etymology questions on ELU, I try to make this clear. "According to the OED", "according to my own research", "dates at least back to X" are all things I say, but I sometimes get sloppy and don't do this all the time.)



                    There are some exceptions. We can be very certain that "cromulent", for example, was coined in 1996 (or '95 depending on when the episode of the Simpsons was written).





                    How etymological research is done has varied through time. In the case of the "New English Dictionary" (the first edition of the Oxford English Dictionary), work started on it in 1857. Then:




                    [I]n January 1859, the Society issued their 'Proposal for the publication of a New English Dictionary,' in which the characteristics of the proposed work were explained, and an appeal made to the English and American public to assist in collecting the raw materials for the work, these materials consisting of quotations illustrating the use of English words by all writers of all ages and in all senses, each quotation being made on a uniform plan on a half-sheet of notepaper, that they might in due course be arranged and classified alphabetically and by meanings. This Appeal met with a generous response: some hundreds of volunteers began to read books, make quotations, and send in their slips to 'sub-editors,' who volunteered each to take charge of a letter or part of one, and by whom the slips were in tum further arranged, classified, and to some extent used as the basis of definitions and skeleton schemes of the meanings of words in preparation for the Dictionary.
                    An Appeal to the English-Speaking and English-Reading Public to Read Books and Make Extracts for The Philological Society's New English Dictionary




                    The "Reading Programme" is still used by the OED, although the methodology is different. The books are still read all the same but here's what happens next according to a freelance researcher for the OED:




                    I then consult OED Online to determine whether the word or phrase is in the Dictionary: if it is not, I submit it as a ‘not-in’, and if it is, I decide whether its form or context is important enough to warrant its submission. If it does qualify, I enter the information into tagged fields in an electronic file that has been set up in a standard format. When I have finished the reading, I submit the file to Oxford or New York, where the records are incorporated into OED‘s working database for consideration by the editors, along with thousands of paper citation slips, as they proceed through the current revision. Yes, some of my finds are still submitted as paper slips—a reminder of OED‘s long heritage—but, electronic or paper, I can hardly imagine a better job.




                    The quotations were collected in a machine readable format for the first time in 1989. The 1990 UK Reading Programme captured material electronically. (Note that the second edition of the Oxford English Dictionary came out in 1989.)



                    In addition to this, the OED now utilizes several online databases of texts, such as Early English Books Online, Eighteenth Century Collections Online, and some newspaper databases.



                    If you do your own research with databases (many people use the free Google Books), it's often easy to beat pages that haven't been updated for the third edition of the OED. Updates to the OED3 started in 2000 and continue to this day: it's a huge dictionary and updating takes time.



                    See also:




                    • OED: Researching the Language






                    share|improve this answer
























                    • I'm asking specifically because gbooks is not satisfying, not the least because OCR fails for old books. I take it that "you don't ever really know for certain" counts for lost copies, as well as otherwise unread/not-scanned books. "hundreds of volunteers" pale in comparison to the amount of available books.

                      – vectory
                      17 mins ago
















                    2














                    Typically, you don't ever really know for certain that you have the earliest example. Or even the earliest written example. It's just the best so far.



                    (As a person who frequently writes answers to etymology questions on ELU, I try to make this clear. "According to the OED", "according to my own research", "dates at least back to X" are all things I say, but I sometimes get sloppy and don't do this all the time.)



                    There are some exceptions. We can be very certain that "cromulent", for example, was coined in 1996 (or '95 depending on when the episode of the Simpsons was written).





                    How etymological research is done has varied through time. In the case of the "New English Dictionary" (the first edition of the Oxford English Dictionary), work started on it in 1857. Then:




                    [I]n January 1859, the Society issued their 'Proposal for the publication of a New English Dictionary,' in which the characteristics of the proposed work were explained, and an appeal made to the English and American public to assist in collecting the raw materials for the work, these materials consisting of quotations illustrating the use of English words by all writers of all ages and in all senses, each quotation being made on a uniform plan on a half-sheet of notepaper, that they might in due course be arranged and classified alphabetically and by meanings. This Appeal met with a generous response: some hundreds of volunteers began to read books, make quotations, and send in their slips to 'sub-editors,' who volunteered each to take charge of a letter or part of one, and by whom the slips were in tum further arranged, classified, and to some extent used as the basis of definitions and skeleton schemes of the meanings of words in preparation for the Dictionary.
                    An Appeal to the English-Speaking and English-Reading Public to Read Books and Make Extracts for The Philological Society's New English Dictionary




                    The "Reading Programme" is still used by the OED, although the methodology is different. The books are still read all the same but here's what happens next according to a freelance researcher for the OED:




                    I then consult OED Online to determine whether the word or phrase is in the Dictionary: if it is not, I submit it as a ‘not-in’, and if it is, I decide whether its form or context is important enough to warrant its submission. If it does qualify, I enter the information into tagged fields in an electronic file that has been set up in a standard format. When I have finished the reading, I submit the file to Oxford or New York, where the records are incorporated into OED‘s working database for consideration by the editors, along with thousands of paper citation slips, as they proceed through the current revision. Yes, some of my finds are still submitted as paper slips—a reminder of OED‘s long heritage—but, electronic or paper, I can hardly imagine a better job.




                    The quotations were collected in a machine readable format for the first time in 1989. The 1990 UK Reading Programme captured material electronically. (Note that the second edition of the Oxford English Dictionary came out in 1989.)



                    In addition to this, the OED now utilizes several online databases of texts, such as Early English Books Online, Eighteenth Century Collections Online, and some newspaper databases.



                    If you do your own research with databases (many people use the free Google Books), it's often easy to beat pages that haven't been updated for the third edition of the OED. Updates to the OED3 started in 2000 and continue to this day: it's a huge dictionary and updating takes time.



                    See also:




                    • OED: Researching the Language






                    share|improve this answer
























                    • I'm asking specifically because gbooks is not satisfying, not the least because OCR fails for old books. I take it that "you don't ever really know for certain" counts for lost copies, as well as otherwise unread/not-scanned books. "hundreds of volunteers" pale in comparison to the amount of available books.

                      – vectory
                      17 mins ago














                    2












                    2








                    2







                    Typically, you don't ever really know for certain that you have the earliest example. Or even the earliest written example. It's just the best so far.



                    (As a person who frequently writes answers to etymology questions on ELU, I try to make this clear. "According to the OED", "according to my own research", "dates at least back to X" are all things I say, but I sometimes get sloppy and don't do this all the time.)



                    There are some exceptions. We can be very certain that "cromulent", for example, was coined in 1996 (or '95 depending on when the episode of the Simpsons was written).





                    How etymological research is done has varied through time. In the case of the "New English Dictionary" (the first edition of the Oxford English Dictionary), work started on it in 1857. Then:




                    [I]n January 1859, the Society issued their 'Proposal for the publication of a New English Dictionary,' in which the characteristics of the proposed work were explained, and an appeal made to the English and American public to assist in collecting the raw materials for the work, these materials consisting of quotations illustrating the use of English words by all writers of all ages and in all senses, each quotation being made on a uniform plan on a half-sheet of notepaper, that they might in due course be arranged and classified alphabetically and by meanings. This Appeal met with a generous response: some hundreds of volunteers began to read books, make quotations, and send in their slips to 'sub-editors,' who volunteered each to take charge of a letter or part of one, and by whom the slips were in tum further arranged, classified, and to some extent used as the basis of definitions and skeleton schemes of the meanings of words in preparation for the Dictionary.
                    An Appeal to the English-Speaking and English-Reading Public to Read Books and Make Extracts for The Philological Society's New English Dictionary




                    The "Reading Programme" is still used by the OED, although the methodology is different. The books are still read all the same but here's what happens next according to a freelance researcher for the OED:




                    I then consult OED Online to determine whether the word or phrase is in the Dictionary: if it is not, I submit it as a ‘not-in’, and if it is, I decide whether its form or context is important enough to warrant its submission. If it does qualify, I enter the information into tagged fields in an electronic file that has been set up in a standard format. When I have finished the reading, I submit the file to Oxford or New York, where the records are incorporated into OED‘s working database for consideration by the editors, along with thousands of paper citation slips, as they proceed through the current revision. Yes, some of my finds are still submitted as paper slips—a reminder of OED‘s long heritage—but, electronic or paper, I can hardly imagine a better job.




                    The quotations were collected in a machine readable format for the first time in 1989. The 1990 UK Reading Programme captured material electronically. (Note that the second edition of the Oxford English Dictionary came out in 1989.)



                    In addition to this, the OED now utilizes several online databases of texts, such as Early English Books Online, Eighteenth Century Collections Online, and some newspaper databases.



                    If you do your own research with databases (many people use the free Google Books), it's often easy to beat pages that haven't been updated for the third edition of the OED. Updates to the OED3 started in 2000 and continue to this day: it's a huge dictionary and updating takes time.



                    See also:




                    • OED: Researching the Language






                    share|improve this answer













                    Typically, you don't ever really know for certain that you have the earliest example. Or even the earliest written example. It's just the best so far.



                    (As a person who frequently writes answers to etymology questions on ELU, I try to make this clear. "According to the OED", "according to my own research", "dates at least back to X" are all things I say, but I sometimes get sloppy and don't do this all the time.)



                    There are some exceptions. We can be very certain that "cromulent", for example, was coined in 1996 (or '95 depending on when the episode of the Simpsons was written).





                    How etymological research is done has varied through time. In the case of the "New English Dictionary" (the first edition of the Oxford English Dictionary), work started on it in 1857. Then:




                    [I]n January 1859, the Society issued their 'Proposal for the publication of a New English Dictionary,' in which the characteristics of the proposed work were explained, and an appeal made to the English and American public to assist in collecting the raw materials for the work, these materials consisting of quotations illustrating the use of English words by all writers of all ages and in all senses, each quotation being made on a uniform plan on a half-sheet of notepaper, that they might in due course be arranged and classified alphabetically and by meanings. This Appeal met with a generous response: some hundreds of volunteers began to read books, make quotations, and send in their slips to 'sub-editors,' who volunteered each to take charge of a letter or part of one, and by whom the slips were in tum further arranged, classified, and to some extent used as the basis of definitions and skeleton schemes of the meanings of words in preparation for the Dictionary.
                    An Appeal to the English-Speaking and English-Reading Public to Read Books and Make Extracts for The Philological Society's New English Dictionary




                    The "Reading Programme" is still used by the OED, although the methodology is different. The books are still read all the same but here's what happens next according to a freelance researcher for the OED:




                    I then consult OED Online to determine whether the word or phrase is in the Dictionary: if it is not, I submit it as a ‘not-in’, and if it is, I decide whether its form or context is important enough to warrant its submission. If it does qualify, I enter the information into tagged fields in an electronic file that has been set up in a standard format. When I have finished the reading, I submit the file to Oxford or New York, where the records are incorporated into OED‘s working database for consideration by the editors, along with thousands of paper citation slips, as they proceed through the current revision. Yes, some of my finds are still submitted as paper slips—a reminder of OED‘s long heritage—but, electronic or paper, I can hardly imagine a better job.




                    The quotations were collected in a machine readable format for the first time in 1989. The 1990 UK Reading Programme captured material electronically. (Note that the second edition of the Oxford English Dictionary came out in 1989.)



                    In addition to this, the OED now utilizes several online databases of texts, such as Early English Books Online, Eighteenth Century Collections Online, and some newspaper databases.



                    If you do your own research with databases (many people use the free Google Books), it's often easy to beat pages that haven't been updated for the third edition of the OED. Updates to the OED3 started in 2000 and continue to this day: it's a huge dictionary and updating takes time.



                    See also:




                    • OED: Researching the Language







                    share|improve this answer












                    share|improve this answer



                    share|improve this answer










                    answered 2 hours ago









                    LaurelLaurel

                    2915




                    2915













                    • I'm asking specifically because gbooks is not satisfying, not the least because OCR fails for old books. I take it that "you don't ever really know for certain" counts for lost copies, as well as otherwise unread/not-scanned books. "hundreds of volunteers" pale in comparison to the amount of available books.

                      – vectory
                      17 mins ago



















                    • I'm asking specifically because gbooks is not satisfying, not the least because OCR fails for old books. I take it that "you don't ever really know for certain" counts for lost copies, as well as otherwise unread/not-scanned books. "hundreds of volunteers" pale in comparison to the amount of available books.

                      – vectory
                      17 mins ago

















                    I'm asking specifically because gbooks is not satisfying, not the least because OCR fails for old books. I take it that "you don't ever really know for certain" counts for lost copies, as well as otherwise unread/not-scanned books. "hundreds of volunteers" pale in comparison to the amount of available books.

                    – vectory
                    17 mins ago





                    I'm asking specifically because gbooks is not satisfying, not the least because OCR fails for old books. I take it that "you don't ever really know for certain" counts for lost copies, as well as otherwise unread/not-scanned books. "hundreds of volunteers" pale in comparison to the amount of available books.

                    – vectory
                    17 mins ago


















                    draft saved

                    draft discarded




















































                    Thanks for contributing an answer to Linguistics Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2flinguistics.stackexchange.com%2fquestions%2f30661%2fhow-do-dictionaries-source-attestation%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Callistus I

                    Tabula Rosettana

                    How to label and detect the document text images