Getting the upper and lower quartiles in data with an even number of observations, or where the quartile...












0














I want to draw a box plot, which requires that I know the median, the lower and upper quartiles, and the minimum and maximum values of my data.



I understand that the quartiles are simply the value on certainly "percentile" of the cumulative frequency of the data.



So lower quartile = the value of the observation on the 25th percentile of the data.
Now my question (for AQA GCSE prep) is - what if taking 25% of my data ends up in a decimal number, let's say, $3.5$. And my data consists of classes in a grouped frequency table. And two of my classes are:



$ class 1$ || $ 2 <= h < 3.5$



$ class 2$ || $ 3.5 <= h < 5$



So when I take 25% of 3.5 falls in between two classes. Which value should I choose as the lower quartile? Should it be $class 1$, or $class 2$? Should my rounding of 3.5 be the same as regular rounding is done, i.e. just rounding up to 4 (hence selecting $class 2$)? Or should I round choose $class 1$ for some reason?










share|cite|improve this question














bumped to the homepage by Community yesterday


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.




















    0














    I want to draw a box plot, which requires that I know the median, the lower and upper quartiles, and the minimum and maximum values of my data.



    I understand that the quartiles are simply the value on certainly "percentile" of the cumulative frequency of the data.



    So lower quartile = the value of the observation on the 25th percentile of the data.
    Now my question (for AQA GCSE prep) is - what if taking 25% of my data ends up in a decimal number, let's say, $3.5$. And my data consists of classes in a grouped frequency table. And two of my classes are:



    $ class 1$ || $ 2 <= h < 3.5$



    $ class 2$ || $ 3.5 <= h < 5$



    So when I take 25% of 3.5 falls in between two classes. Which value should I choose as the lower quartile? Should it be $class 1$, or $class 2$? Should my rounding of 3.5 be the same as regular rounding is done, i.e. just rounding up to 4 (hence selecting $class 2$)? Or should I round choose $class 1$ for some reason?










    share|cite|improve this question














    bumped to the homepage by Community yesterday


    This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.


















      0












      0








      0







      I want to draw a box plot, which requires that I know the median, the lower and upper quartiles, and the minimum and maximum values of my data.



      I understand that the quartiles are simply the value on certainly "percentile" of the cumulative frequency of the data.



      So lower quartile = the value of the observation on the 25th percentile of the data.
      Now my question (for AQA GCSE prep) is - what if taking 25% of my data ends up in a decimal number, let's say, $3.5$. And my data consists of classes in a grouped frequency table. And two of my classes are:



      $ class 1$ || $ 2 <= h < 3.5$



      $ class 2$ || $ 3.5 <= h < 5$



      So when I take 25% of 3.5 falls in between two classes. Which value should I choose as the lower quartile? Should it be $class 1$, or $class 2$? Should my rounding of 3.5 be the same as regular rounding is done, i.e. just rounding up to 4 (hence selecting $class 2$)? Or should I round choose $class 1$ for some reason?










      share|cite|improve this question













      I want to draw a box plot, which requires that I know the median, the lower and upper quartiles, and the minimum and maximum values of my data.



      I understand that the quartiles are simply the value on certainly "percentile" of the cumulative frequency of the data.



      So lower quartile = the value of the observation on the 25th percentile of the data.
      Now my question (for AQA GCSE prep) is - what if taking 25% of my data ends up in a decimal number, let's say, $3.5$. And my data consists of classes in a grouped frequency table. And two of my classes are:



      $ class 1$ || $ 2 <= h < 3.5$



      $ class 2$ || $ 3.5 <= h < 5$



      So when I take 25% of 3.5 falls in between two classes. Which value should I choose as the lower quartile? Should it be $class 1$, or $class 2$? Should my rounding of 3.5 be the same as regular rounding is done, i.e. just rounding up to 4 (hence selecting $class 2$)? Or should I round choose $class 1$ for some reason?







      statistics






      share|cite|improve this question













      share|cite|improve this question











      share|cite|improve this question




      share|cite|improve this question










      asked May 2 '14 at 9:46









      user961627

      187211




      187211





      bumped to the homepage by Community yesterday


      This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.







      bumped to the homepage by Community yesterday


      This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
























          3 Answers
          3






          active

          oldest

          votes


















          0














          If your data consist of the frequency of observations within each class, then the best that you can day is say that the quartile is somewhere within the class. To find out which class the quartile belongs to, you need to first figure out the index of the quartile. For example, if you have 100 observations in your dataset, then the median would be the average of the 50th and the 51st observations of your dataset. You would then need to find which class contains both the 50th and 51st observations by examining the cumulative frequencies. Perhaps the cumulative frequency for class 2 is 45 and the cumulative frequency for class 3 is 55. Then you would know that the 50th and 51st observations were in class 3, so the median would be in class 3.






          share|cite|improve this answer





























            0














            If taking the $25$th percentile of the data gives you the value $3.5$,
            then the value of the first quartile is $3.5$.
            We would usually expect that the number of observations in each
            of your frequency classes is a whole number, but
            there is nothing wrong with having a quartile or median value that
            is not a whole number, so further rounding is neither needed nor desired.






            share|cite|improve this answer





























              0














              Intro textbooks tend to use one of two different methods for boxplot/five number summary construction, in my experience.



              The first method is to apply your percentile to the total number of observations; for example, the first quartile of 14 data points ordered least to greatest is the data value located in position $14*(0.25)= 3.5;$ that is, your fourth quartile is the value in the "3.5th place."



              Well, there's no such thing as the "three point fifth place," so the first method has you round up to the nearest larger integer; this means you round this "place value calculation" up when you have a nonzero decimal in your computation (even if normal rounding rules would have you round down). In this case, 3.5 rounds up to 4, so the first quartile is whatever data value is in 4th place in the list of 14 data values ordered least to greatest. This method, if defined this way in your text, is good to use for finding any percentile: for instance, the 80th percentile is $(.8)(14) = 11.2,$ or the 12th place, and the median or 50th percentile is $(.5)(14) = 7,$ which lacks a nonzero fractional part, indicating that the median is to be taken as the average of the 7th place and its "next door neighbor," the data value in 8th place.



              The second method is slightly different; after finding median, you find the first quartile by taking the median of the values to the left of your median placement in your ordered list and you find the third quartile by taking the median of the data place values beyond the place value of your median. In this case, the boxplot for your 14-point data set would not change, but if it had, say, 13 elements instead, the median would be in place value $(0.5)(13) = 6.5$ or the 7th place, the first quartile is the median, then, of the first six values in the ordered list, meaning it is the average of the 3rd and 4th values in the ordered list (in the previous paragraph, the quartile would have been the value in place number $13*.25 = 3.25 to 4$th place).



              Double-check your text to see which applies to you, but it is likely one of these two ways unless I am expressing something in error above.






              share|cite|improve this answer





















                Your Answer





                StackExchange.ifUsing("editor", function () {
                return StackExchange.using("mathjaxEditing", function () {
                StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
                StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
                });
                });
                }, "mathjax-editing");

                StackExchange.ready(function() {
                var channelOptions = {
                tags: "".split(" "),
                id: "69"
                };
                initTagRenderer("".split(" "), "".split(" "), channelOptions);

                StackExchange.using("externalEditor", function() {
                // Have to fire editor after snippets, if snippets enabled
                if (StackExchange.settings.snippets.snippetsEnabled) {
                StackExchange.using("snippets", function() {
                createEditor();
                });
                }
                else {
                createEditor();
                }
                });

                function createEditor() {
                StackExchange.prepareEditor({
                heartbeatType: 'answer',
                autoActivateHeartbeat: false,
                convertImagesToLinks: true,
                noModals: true,
                showLowRepImageUploadWarning: true,
                reputationToPostImages: 10,
                bindNavPrevention: true,
                postfix: "",
                imageUploader: {
                brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
                contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
                allowUrls: true
                },
                noCode: true, onDemand: true,
                discardSelector: ".discard-answer"
                ,immediatelyShowMarkdownHelp:true
                });


                }
                });














                draft saved

                draft discarded


















                StackExchange.ready(
                function () {
                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f778140%2fgetting-the-upper-and-lower-quartiles-in-data-with-an-even-number-of-observation%23new-answer', 'question_page');
                }
                );

                Post as a guest















                Required, but never shown

























                3 Answers
                3






                active

                oldest

                votes








                3 Answers
                3






                active

                oldest

                votes









                active

                oldest

                votes






                active

                oldest

                votes









                0














                If your data consist of the frequency of observations within each class, then the best that you can day is say that the quartile is somewhere within the class. To find out which class the quartile belongs to, you need to first figure out the index of the quartile. For example, if you have 100 observations in your dataset, then the median would be the average of the 50th and the 51st observations of your dataset. You would then need to find which class contains both the 50th and 51st observations by examining the cumulative frequencies. Perhaps the cumulative frequency for class 2 is 45 and the cumulative frequency for class 3 is 55. Then you would know that the 50th and 51st observations were in class 3, so the median would be in class 3.






                share|cite|improve this answer


























                  0














                  If your data consist of the frequency of observations within each class, then the best that you can day is say that the quartile is somewhere within the class. To find out which class the quartile belongs to, you need to first figure out the index of the quartile. For example, if you have 100 observations in your dataset, then the median would be the average of the 50th and the 51st observations of your dataset. You would then need to find which class contains both the 50th and 51st observations by examining the cumulative frequencies. Perhaps the cumulative frequency for class 2 is 45 and the cumulative frequency for class 3 is 55. Then you would know that the 50th and 51st observations were in class 3, so the median would be in class 3.






                  share|cite|improve this answer
























                    0












                    0








                    0






                    If your data consist of the frequency of observations within each class, then the best that you can day is say that the quartile is somewhere within the class. To find out which class the quartile belongs to, you need to first figure out the index of the quartile. For example, if you have 100 observations in your dataset, then the median would be the average of the 50th and the 51st observations of your dataset. You would then need to find which class contains both the 50th and 51st observations by examining the cumulative frequencies. Perhaps the cumulative frequency for class 2 is 45 and the cumulative frequency for class 3 is 55. Then you would know that the 50th and 51st observations were in class 3, so the median would be in class 3.






                    share|cite|improve this answer












                    If your data consist of the frequency of observations within each class, then the best that you can day is say that the quartile is somewhere within the class. To find out which class the quartile belongs to, you need to first figure out the index of the quartile. For example, if you have 100 observations in your dataset, then the median would be the average of the 50th and the 51st observations of your dataset. You would then need to find which class contains both the 50th and 51st observations by examining the cumulative frequencies. Perhaps the cumulative frequency for class 2 is 45 and the cumulative frequency for class 3 is 55. Then you would know that the 50th and 51st observations were in class 3, so the median would be in class 3.







                    share|cite|improve this answer












                    share|cite|improve this answer



                    share|cite|improve this answer










                    answered May 3 '14 at 5:23









                    jsk

                    51326




                    51326























                        0














                        If taking the $25$th percentile of the data gives you the value $3.5$,
                        then the value of the first quartile is $3.5$.
                        We would usually expect that the number of observations in each
                        of your frequency classes is a whole number, but
                        there is nothing wrong with having a quartile or median value that
                        is not a whole number, so further rounding is neither needed nor desired.






                        share|cite|improve this answer


























                          0














                          If taking the $25$th percentile of the data gives you the value $3.5$,
                          then the value of the first quartile is $3.5$.
                          We would usually expect that the number of observations in each
                          of your frequency classes is a whole number, but
                          there is nothing wrong with having a quartile or median value that
                          is not a whole number, so further rounding is neither needed nor desired.






                          share|cite|improve this answer
























                            0












                            0








                            0






                            If taking the $25$th percentile of the data gives you the value $3.5$,
                            then the value of the first quartile is $3.5$.
                            We would usually expect that the number of observations in each
                            of your frequency classes is a whole number, but
                            there is nothing wrong with having a quartile or median value that
                            is not a whole number, so further rounding is neither needed nor desired.






                            share|cite|improve this answer












                            If taking the $25$th percentile of the data gives you the value $3.5$,
                            then the value of the first quartile is $3.5$.
                            We would usually expect that the number of observations in each
                            of your frequency classes is a whole number, but
                            there is nothing wrong with having a quartile or median value that
                            is not a whole number, so further rounding is neither needed nor desired.







                            share|cite|improve this answer












                            share|cite|improve this answer



                            share|cite|improve this answer










                            answered Jul 16 '16 at 3:00









                            David K

                            52.7k340115




                            52.7k340115























                                0














                                Intro textbooks tend to use one of two different methods for boxplot/five number summary construction, in my experience.



                                The first method is to apply your percentile to the total number of observations; for example, the first quartile of 14 data points ordered least to greatest is the data value located in position $14*(0.25)= 3.5;$ that is, your fourth quartile is the value in the "3.5th place."



                                Well, there's no such thing as the "three point fifth place," so the first method has you round up to the nearest larger integer; this means you round this "place value calculation" up when you have a nonzero decimal in your computation (even if normal rounding rules would have you round down). In this case, 3.5 rounds up to 4, so the first quartile is whatever data value is in 4th place in the list of 14 data values ordered least to greatest. This method, if defined this way in your text, is good to use for finding any percentile: for instance, the 80th percentile is $(.8)(14) = 11.2,$ or the 12th place, and the median or 50th percentile is $(.5)(14) = 7,$ which lacks a nonzero fractional part, indicating that the median is to be taken as the average of the 7th place and its "next door neighbor," the data value in 8th place.



                                The second method is slightly different; after finding median, you find the first quartile by taking the median of the values to the left of your median placement in your ordered list and you find the third quartile by taking the median of the data place values beyond the place value of your median. In this case, the boxplot for your 14-point data set would not change, but if it had, say, 13 elements instead, the median would be in place value $(0.5)(13) = 6.5$ or the 7th place, the first quartile is the median, then, of the first six values in the ordered list, meaning it is the average of the 3rd and 4th values in the ordered list (in the previous paragraph, the quartile would have been the value in place number $13*.25 = 3.25 to 4$th place).



                                Double-check your text to see which applies to you, but it is likely one of these two ways unless I am expressing something in error above.






                                share|cite|improve this answer


























                                  0














                                  Intro textbooks tend to use one of two different methods for boxplot/five number summary construction, in my experience.



                                  The first method is to apply your percentile to the total number of observations; for example, the first quartile of 14 data points ordered least to greatest is the data value located in position $14*(0.25)= 3.5;$ that is, your fourth quartile is the value in the "3.5th place."



                                  Well, there's no such thing as the "three point fifth place," so the first method has you round up to the nearest larger integer; this means you round this "place value calculation" up when you have a nonzero decimal in your computation (even if normal rounding rules would have you round down). In this case, 3.5 rounds up to 4, so the first quartile is whatever data value is in 4th place in the list of 14 data values ordered least to greatest. This method, if defined this way in your text, is good to use for finding any percentile: for instance, the 80th percentile is $(.8)(14) = 11.2,$ or the 12th place, and the median or 50th percentile is $(.5)(14) = 7,$ which lacks a nonzero fractional part, indicating that the median is to be taken as the average of the 7th place and its "next door neighbor," the data value in 8th place.



                                  The second method is slightly different; after finding median, you find the first quartile by taking the median of the values to the left of your median placement in your ordered list and you find the third quartile by taking the median of the data place values beyond the place value of your median. In this case, the boxplot for your 14-point data set would not change, but if it had, say, 13 elements instead, the median would be in place value $(0.5)(13) = 6.5$ or the 7th place, the first quartile is the median, then, of the first six values in the ordered list, meaning it is the average of the 3rd and 4th values in the ordered list (in the previous paragraph, the quartile would have been the value in place number $13*.25 = 3.25 to 4$th place).



                                  Double-check your text to see which applies to you, but it is likely one of these two ways unless I am expressing something in error above.






                                  share|cite|improve this answer
























                                    0












                                    0








                                    0






                                    Intro textbooks tend to use one of two different methods for boxplot/five number summary construction, in my experience.



                                    The first method is to apply your percentile to the total number of observations; for example, the first quartile of 14 data points ordered least to greatest is the data value located in position $14*(0.25)= 3.5;$ that is, your fourth quartile is the value in the "3.5th place."



                                    Well, there's no such thing as the "three point fifth place," so the first method has you round up to the nearest larger integer; this means you round this "place value calculation" up when you have a nonzero decimal in your computation (even if normal rounding rules would have you round down). In this case, 3.5 rounds up to 4, so the first quartile is whatever data value is in 4th place in the list of 14 data values ordered least to greatest. This method, if defined this way in your text, is good to use for finding any percentile: for instance, the 80th percentile is $(.8)(14) = 11.2,$ or the 12th place, and the median or 50th percentile is $(.5)(14) = 7,$ which lacks a nonzero fractional part, indicating that the median is to be taken as the average of the 7th place and its "next door neighbor," the data value in 8th place.



                                    The second method is slightly different; after finding median, you find the first quartile by taking the median of the values to the left of your median placement in your ordered list and you find the third quartile by taking the median of the data place values beyond the place value of your median. In this case, the boxplot for your 14-point data set would not change, but if it had, say, 13 elements instead, the median would be in place value $(0.5)(13) = 6.5$ or the 7th place, the first quartile is the median, then, of the first six values in the ordered list, meaning it is the average of the 3rd and 4th values in the ordered list (in the previous paragraph, the quartile would have been the value in place number $13*.25 = 3.25 to 4$th place).



                                    Double-check your text to see which applies to you, but it is likely one of these two ways unless I am expressing something in error above.






                                    share|cite|improve this answer












                                    Intro textbooks tend to use one of two different methods for boxplot/five number summary construction, in my experience.



                                    The first method is to apply your percentile to the total number of observations; for example, the first quartile of 14 data points ordered least to greatest is the data value located in position $14*(0.25)= 3.5;$ that is, your fourth quartile is the value in the "3.5th place."



                                    Well, there's no such thing as the "three point fifth place," so the first method has you round up to the nearest larger integer; this means you round this "place value calculation" up when you have a nonzero decimal in your computation (even if normal rounding rules would have you round down). In this case, 3.5 rounds up to 4, so the first quartile is whatever data value is in 4th place in the list of 14 data values ordered least to greatest. This method, if defined this way in your text, is good to use for finding any percentile: for instance, the 80th percentile is $(.8)(14) = 11.2,$ or the 12th place, and the median or 50th percentile is $(.5)(14) = 7,$ which lacks a nonzero fractional part, indicating that the median is to be taken as the average of the 7th place and its "next door neighbor," the data value in 8th place.



                                    The second method is slightly different; after finding median, you find the first quartile by taking the median of the values to the left of your median placement in your ordered list and you find the third quartile by taking the median of the data place values beyond the place value of your median. In this case, the boxplot for your 14-point data set would not change, but if it had, say, 13 elements instead, the median would be in place value $(0.5)(13) = 6.5$ or the 7th place, the first quartile is the median, then, of the first six values in the ordered list, meaning it is the average of the 3rd and 4th values in the ordered list (in the previous paragraph, the quartile would have been the value in place number $13*.25 = 3.25 to 4$th place).



                                    Double-check your text to see which applies to you, but it is likely one of these two ways unless I am expressing something in error above.







                                    share|cite|improve this answer












                                    share|cite|improve this answer



                                    share|cite|improve this answer










                                    answered Feb 23 '17 at 22:53









                                    Thomas Rasberry

                                    531211




                                    531211






























                                        draft saved

                                        draft discarded




















































                                        Thanks for contributing an answer to Mathematics Stack Exchange!


                                        • Please be sure to answer the question. Provide details and share your research!

                                        But avoid



                                        • Asking for help, clarification, or responding to other answers.

                                        • Making statements based on opinion; back them up with references or personal experience.


                                        Use MathJax to format equations. MathJax reference.


                                        To learn more, see our tips on writing great answers.





                                        Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


                                        Please pay close attention to the following guidance:


                                        • Please be sure to answer the question. Provide details and share your research!

                                        But avoid



                                        • Asking for help, clarification, or responding to other answers.

                                        • Making statements based on opinion; back them up with references or personal experience.


                                        To learn more, see our tips on writing great answers.




                                        draft saved


                                        draft discarded














                                        StackExchange.ready(
                                        function () {
                                        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f778140%2fgetting-the-upper-and-lower-quartiles-in-data-with-an-even-number-of-observation%23new-answer', 'question_page');
                                        }
                                        );

                                        Post as a guest















                                        Required, but never shown





















































                                        Required, but never shown














                                        Required, but never shown












                                        Required, but never shown







                                        Required, but never shown

































                                        Required, but never shown














                                        Required, but never shown












                                        Required, but never shown







                                        Required, but never shown







                                        Popular posts from this blog

                                        Mario Kart Wii

                                        What does “Dominus providebit” mean?

                                        Antonio Litta Visconti Arese