plot multiple strings from a dataframe












1












$begingroup$


I am trying to read and plot several files which looks like as below when I open with python using pandas read.table. The columns to plot are "dev" vs timestamp.



The timestamp should be compose from the columns; mm dd,time and yyyy.



    mm dd time yyyy dev st fault typ
0 Jul 5 2:48:29 2018 aaa STANDBY HRW_FAULT neg
1 Jul 5 2:48:29 2018 aaa SOFT SWF_FAULT ack
2 Jul 5 2:48:29 2018 aaa HARDWARE disable
3 Jul 5 2:50:47 2018 bbb STANDBY HRW_FAULT pos


...............










share|improve this question











$endgroup$




bumped to the homepage by Community yesterday


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.











  • 1




    $begingroup$
    Where's the problem/traceback if any? Read about pandas date time indexing and just create a temp df of the datetime stamp and the desired column maybe..
    $endgroup$
    – Aditya
    Sep 11 '18 at 1:24












  • $begingroup$
    Welcome Aboard to DSE!
    $endgroup$
    – Aditya
    Sep 11 '18 at 1:43
















1












$begingroup$


I am trying to read and plot several files which looks like as below when I open with python using pandas read.table. The columns to plot are "dev" vs timestamp.



The timestamp should be compose from the columns; mm dd,time and yyyy.



    mm dd time yyyy dev st fault typ
0 Jul 5 2:48:29 2018 aaa STANDBY HRW_FAULT neg
1 Jul 5 2:48:29 2018 aaa SOFT SWF_FAULT ack
2 Jul 5 2:48:29 2018 aaa HARDWARE disable
3 Jul 5 2:50:47 2018 bbb STANDBY HRW_FAULT pos


...............










share|improve this question











$endgroup$




bumped to the homepage by Community yesterday


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.











  • 1




    $begingroup$
    Where's the problem/traceback if any? Read about pandas date time indexing and just create a temp df of the datetime stamp and the desired column maybe..
    $endgroup$
    – Aditya
    Sep 11 '18 at 1:24












  • $begingroup$
    Welcome Aboard to DSE!
    $endgroup$
    – Aditya
    Sep 11 '18 at 1:43














1












1








1





$begingroup$


I am trying to read and plot several files which looks like as below when I open with python using pandas read.table. The columns to plot are "dev" vs timestamp.



The timestamp should be compose from the columns; mm dd,time and yyyy.



    mm dd time yyyy dev st fault typ
0 Jul 5 2:48:29 2018 aaa STANDBY HRW_FAULT neg
1 Jul 5 2:48:29 2018 aaa SOFT SWF_FAULT ack
2 Jul 5 2:48:29 2018 aaa HARDWARE disable
3 Jul 5 2:50:47 2018 bbb STANDBY HRW_FAULT pos


...............










share|improve this question











$endgroup$




I am trying to read and plot several files which looks like as below when I open with python using pandas read.table. The columns to plot are "dev" vs timestamp.



The timestamp should be compose from the columns; mm dd,time and yyyy.



    mm dd time yyyy dev st fault typ
0 Jul 5 2:48:29 2018 aaa STANDBY HRW_FAULT neg
1 Jul 5 2:48:29 2018 aaa SOFT SWF_FAULT ack
2 Jul 5 2:48:29 2018 aaa HARDWARE disable
3 Jul 5 2:50:47 2018 bbb STANDBY HRW_FAULT pos


...............







pandas visualization






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Sep 11 '18 at 1:23









n1k31t4

6,5112421




6,5112421










asked Sep 11 '18 at 0:21









lucialucia

111




111





bumped to the homepage by Community yesterday


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.







bumped to the homepage by Community yesterday


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.










  • 1




    $begingroup$
    Where's the problem/traceback if any? Read about pandas date time indexing and just create a temp df of the datetime stamp and the desired column maybe..
    $endgroup$
    – Aditya
    Sep 11 '18 at 1:24












  • $begingroup$
    Welcome Aboard to DSE!
    $endgroup$
    – Aditya
    Sep 11 '18 at 1:43














  • 1




    $begingroup$
    Where's the problem/traceback if any? Read about pandas date time indexing and just create a temp df of the datetime stamp and the desired column maybe..
    $endgroup$
    – Aditya
    Sep 11 '18 at 1:24












  • $begingroup$
    Welcome Aboard to DSE!
    $endgroup$
    – Aditya
    Sep 11 '18 at 1:43








1




1




$begingroup$
Where's the problem/traceback if any? Read about pandas date time indexing and just create a temp df of the datetime stamp and the desired column maybe..
$endgroup$
– Aditya
Sep 11 '18 at 1:24






$begingroup$
Where's the problem/traceback if any? Read about pandas date time indexing and just create a temp df of the datetime stamp and the desired column maybe..
$endgroup$
– Aditya
Sep 11 '18 at 1:24














$begingroup$
Welcome Aboard to DSE!
$endgroup$
– Aditya
Sep 11 '18 at 1:43




$begingroup$
Welcome Aboard to DSE!
$endgroup$
– Aditya
Sep 11 '18 at 1:43










2 Answers
2






active

oldest

votes


















0












$begingroup$

df['DateTime'] = df[['Year', 'Month', 'Day', 'Hour']].apply(lambda s : datetime.datetime(*s),axis = 1)


This might pop some error depending on the fact that that whether you have hour properly defined like it should be in between 0-23 , seconds in-between 0-59 etc...



Or try this



You can pass only the columns that you need to assemble.



In [33]: pd.to_datetime(df[['year', 'month', 'day']])
Out[33]:
0 2015-02-04
1 2016-03-05
dtype: datetime64[ns


]



From the docs itself,



pd.to_datetime looks for standard designations of the datetime component in the column names, including:




  • required: year, month, day

  • optional: hour, minute, second, millisecond, microsecond, nanosecond


Also Another Approach is that Since it's probably a .csv file, you can do this at the time of reading/parsing the file itself, pandas is very intelligent and interesting!



This is done by passing in the column names as a list([year, month, day, time ]) to the parameter parse_dates, or via infer_datetime_format to True or by using a custom date_parser in the function pd.read_csv()>...



Hope this helps!






share|improve this answer











$endgroup$





















    0












    $begingroup$

    @Aditya's answer is already very good. I just want to contribute an alternative.



    We can consider that pandas pd.to_datetime has a built in parser that can take as input a datetime string. We can construct these strings to look like a standard




    5/JUl/2018 T2:48:29




    This is a standard format and is well understood by pandas.



    This following dataframe



    data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
    'dd': ['5', '5', '5', '5'],
    'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
    'yyyy': ['2018', '2018', '2018', '2018']}
    df = pd.DataFrame(data)


    enter image description here



    We can get the desired column by doing



    df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' + df['yyyy'] + ' T' + df['time'])


    enter image description here



    We can also verify the types using



    df.dtypes



    dd object
    mm

    object
    time object
    yyyy

    object
    formatted_datetime datetime64[ns]
    dtype: object






    Plotting the columns



    You will notice that when you try to plot this df you will get an error, this is because matplotlib cannot handle datetime objects in their plot function. They do have an alternative plot_date function which can be used as



    plt.plot_date(df['formatted_datetime'], df['dev'])
    plt.show()


    For example, we will recreate the same data as above with the dev column as well. We will do the same type conversion for the datetime.



    data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
    'dd': ['5', '5', '5', '5'],
    'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
    'yyyy': ['2018', '2018', '2018', '2018'],
    'dev': ['aaa', 'aaa', 'aaa', 'bbb']}

    df = pd.DataFrame(data)
    df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' +
    df['yyyy'] + ' T' + df['time'])


    Now we will convert the categorical value of dev into numerical values, we will also keep track of this conversion so that we can set them on y-axis ticks.



    df['dev'] =df['dev'].astype('category')
    categorie_codes = dict(enumerate(df['dev'].cat.categories))
    df['dev'] =df['dev'].cat.codes


    enter image description here



    Then we can plot



    plt.plot_date(df['formatted_datetime'], df['dev'])
    plt.yticks(range(len(categorie_codes)), list(categorie_codes.values()))
    plt.show()


    enter image description here






    share|improve this answer











    $endgroup$













    • $begingroup$
      Nice work, but dont you plot count of string occurrences to formatted_date?
      $endgroup$
      – krayyem
      Nov 10 '18 at 11:54












    Your Answer





    StackExchange.ifUsing("editor", function () {
    return StackExchange.using("mathjaxEditing", function () {
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    });
    });
    }, "mathjax-editing");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "557"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f38074%2fplot-multiple-strings-from-a-dataframe%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0












    $begingroup$

    df['DateTime'] = df[['Year', 'Month', 'Day', 'Hour']].apply(lambda s : datetime.datetime(*s),axis = 1)


    This might pop some error depending on the fact that that whether you have hour properly defined like it should be in between 0-23 , seconds in-between 0-59 etc...



    Or try this



    You can pass only the columns that you need to assemble.



    In [33]: pd.to_datetime(df[['year', 'month', 'day']])
    Out[33]:
    0 2015-02-04
    1 2016-03-05
    dtype: datetime64[ns


    ]



    From the docs itself,



    pd.to_datetime looks for standard designations of the datetime component in the column names, including:




    • required: year, month, day

    • optional: hour, minute, second, millisecond, microsecond, nanosecond


    Also Another Approach is that Since it's probably a .csv file, you can do this at the time of reading/parsing the file itself, pandas is very intelligent and interesting!



    This is done by passing in the column names as a list([year, month, day, time ]) to the parameter parse_dates, or via infer_datetime_format to True or by using a custom date_parser in the function pd.read_csv()>...



    Hope this helps!






    share|improve this answer











    $endgroup$


















      0












      $begingroup$

      df['DateTime'] = df[['Year', 'Month', 'Day', 'Hour']].apply(lambda s : datetime.datetime(*s),axis = 1)


      This might pop some error depending on the fact that that whether you have hour properly defined like it should be in between 0-23 , seconds in-between 0-59 etc...



      Or try this



      You can pass only the columns that you need to assemble.



      In [33]: pd.to_datetime(df[['year', 'month', 'day']])
      Out[33]:
      0 2015-02-04
      1 2016-03-05
      dtype: datetime64[ns


      ]



      From the docs itself,



      pd.to_datetime looks for standard designations of the datetime component in the column names, including:




      • required: year, month, day

      • optional: hour, minute, second, millisecond, microsecond, nanosecond


      Also Another Approach is that Since it's probably a .csv file, you can do this at the time of reading/parsing the file itself, pandas is very intelligent and interesting!



      This is done by passing in the column names as a list([year, month, day, time ]) to the parameter parse_dates, or via infer_datetime_format to True or by using a custom date_parser in the function pd.read_csv()>...



      Hope this helps!






      share|improve this answer











      $endgroup$
















        0












        0








        0





        $begingroup$

        df['DateTime'] = df[['Year', 'Month', 'Day', 'Hour']].apply(lambda s : datetime.datetime(*s),axis = 1)


        This might pop some error depending on the fact that that whether you have hour properly defined like it should be in between 0-23 , seconds in-between 0-59 etc...



        Or try this



        You can pass only the columns that you need to assemble.



        In [33]: pd.to_datetime(df[['year', 'month', 'day']])
        Out[33]:
        0 2015-02-04
        1 2016-03-05
        dtype: datetime64[ns


        ]



        From the docs itself,



        pd.to_datetime looks for standard designations of the datetime component in the column names, including:




        • required: year, month, day

        • optional: hour, minute, second, millisecond, microsecond, nanosecond


        Also Another Approach is that Since it's probably a .csv file, you can do this at the time of reading/parsing the file itself, pandas is very intelligent and interesting!



        This is done by passing in the column names as a list([year, month, day, time ]) to the parameter parse_dates, or via infer_datetime_format to True or by using a custom date_parser in the function pd.read_csv()>...



        Hope this helps!






        share|improve this answer











        $endgroup$



        df['DateTime'] = df[['Year', 'Month', 'Day', 'Hour']].apply(lambda s : datetime.datetime(*s),axis = 1)


        This might pop some error depending on the fact that that whether you have hour properly defined like it should be in between 0-23 , seconds in-between 0-59 etc...



        Or try this



        You can pass only the columns that you need to assemble.



        In [33]: pd.to_datetime(df[['year', 'month', 'day']])
        Out[33]:
        0 2015-02-04
        1 2016-03-05
        dtype: datetime64[ns


        ]



        From the docs itself,



        pd.to_datetime looks for standard designations of the datetime component in the column names, including:




        • required: year, month, day

        • optional: hour, minute, second, millisecond, microsecond, nanosecond


        Also Another Approach is that Since it's probably a .csv file, you can do this at the time of reading/parsing the file itself, pandas is very intelligent and interesting!



        This is done by passing in the column names as a list([year, month, day, time ]) to the parameter parse_dates, or via infer_datetime_format to True or by using a custom date_parser in the function pd.read_csv()>...



        Hope this helps!







        share|improve this answer














        share|improve this answer



        share|improve this answer








        edited Sep 11 '18 at 1:42

























        answered Sep 11 '18 at 1:35









        AdityaAditya

        1,4341626




        1,4341626























            0












            $begingroup$

            @Aditya's answer is already very good. I just want to contribute an alternative.



            We can consider that pandas pd.to_datetime has a built in parser that can take as input a datetime string. We can construct these strings to look like a standard




            5/JUl/2018 T2:48:29




            This is a standard format and is well understood by pandas.



            This following dataframe



            data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
            'dd': ['5', '5', '5', '5'],
            'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
            'yyyy': ['2018', '2018', '2018', '2018']}
            df = pd.DataFrame(data)


            enter image description here



            We can get the desired column by doing



            df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' + df['yyyy'] + ' T' + df['time'])


            enter image description here



            We can also verify the types using



            df.dtypes



            dd object
            mm

            object
            time object
            yyyy

            object
            formatted_datetime datetime64[ns]
            dtype: object






            Plotting the columns



            You will notice that when you try to plot this df you will get an error, this is because matplotlib cannot handle datetime objects in their plot function. They do have an alternative plot_date function which can be used as



            plt.plot_date(df['formatted_datetime'], df['dev'])
            plt.show()


            For example, we will recreate the same data as above with the dev column as well. We will do the same type conversion for the datetime.



            data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
            'dd': ['5', '5', '5', '5'],
            'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
            'yyyy': ['2018', '2018', '2018', '2018'],
            'dev': ['aaa', 'aaa', 'aaa', 'bbb']}

            df = pd.DataFrame(data)
            df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' +
            df['yyyy'] + ' T' + df['time'])


            Now we will convert the categorical value of dev into numerical values, we will also keep track of this conversion so that we can set them on y-axis ticks.



            df['dev'] =df['dev'].astype('category')
            categorie_codes = dict(enumerate(df['dev'].cat.categories))
            df['dev'] =df['dev'].cat.codes


            enter image description here



            Then we can plot



            plt.plot_date(df['formatted_datetime'], df['dev'])
            plt.yticks(range(len(categorie_codes)), list(categorie_codes.values()))
            plt.show()


            enter image description here






            share|improve this answer











            $endgroup$













            • $begingroup$
              Nice work, but dont you plot count of string occurrences to formatted_date?
              $endgroup$
              – krayyem
              Nov 10 '18 at 11:54
















            0












            $begingroup$

            @Aditya's answer is already very good. I just want to contribute an alternative.



            We can consider that pandas pd.to_datetime has a built in parser that can take as input a datetime string. We can construct these strings to look like a standard




            5/JUl/2018 T2:48:29




            This is a standard format and is well understood by pandas.



            This following dataframe



            data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
            'dd': ['5', '5', '5', '5'],
            'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
            'yyyy': ['2018', '2018', '2018', '2018']}
            df = pd.DataFrame(data)


            enter image description here



            We can get the desired column by doing



            df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' + df['yyyy'] + ' T' + df['time'])


            enter image description here



            We can also verify the types using



            df.dtypes



            dd object
            mm

            object
            time object
            yyyy

            object
            formatted_datetime datetime64[ns]
            dtype: object






            Plotting the columns



            You will notice that when you try to plot this df you will get an error, this is because matplotlib cannot handle datetime objects in their plot function. They do have an alternative plot_date function which can be used as



            plt.plot_date(df['formatted_datetime'], df['dev'])
            plt.show()


            For example, we will recreate the same data as above with the dev column as well. We will do the same type conversion for the datetime.



            data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
            'dd': ['5', '5', '5', '5'],
            'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
            'yyyy': ['2018', '2018', '2018', '2018'],
            'dev': ['aaa', 'aaa', 'aaa', 'bbb']}

            df = pd.DataFrame(data)
            df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' +
            df['yyyy'] + ' T' + df['time'])


            Now we will convert the categorical value of dev into numerical values, we will also keep track of this conversion so that we can set them on y-axis ticks.



            df['dev'] =df['dev'].astype('category')
            categorie_codes = dict(enumerate(df['dev'].cat.categories))
            df['dev'] =df['dev'].cat.codes


            enter image description here



            Then we can plot



            plt.plot_date(df['formatted_datetime'], df['dev'])
            plt.yticks(range(len(categorie_codes)), list(categorie_codes.values()))
            plt.show()


            enter image description here






            share|improve this answer











            $endgroup$













            • $begingroup$
              Nice work, but dont you plot count of string occurrences to formatted_date?
              $endgroup$
              – krayyem
              Nov 10 '18 at 11:54














            0












            0








            0





            $begingroup$

            @Aditya's answer is already very good. I just want to contribute an alternative.



            We can consider that pandas pd.to_datetime has a built in parser that can take as input a datetime string. We can construct these strings to look like a standard




            5/JUl/2018 T2:48:29




            This is a standard format and is well understood by pandas.



            This following dataframe



            data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
            'dd': ['5', '5', '5', '5'],
            'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
            'yyyy': ['2018', '2018', '2018', '2018']}
            df = pd.DataFrame(data)


            enter image description here



            We can get the desired column by doing



            df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' + df['yyyy'] + ' T' + df['time'])


            enter image description here



            We can also verify the types using



            df.dtypes



            dd object
            mm

            object
            time object
            yyyy

            object
            formatted_datetime datetime64[ns]
            dtype: object






            Plotting the columns



            You will notice that when you try to plot this df you will get an error, this is because matplotlib cannot handle datetime objects in their plot function. They do have an alternative plot_date function which can be used as



            plt.plot_date(df['formatted_datetime'], df['dev'])
            plt.show()


            For example, we will recreate the same data as above with the dev column as well. We will do the same type conversion for the datetime.



            data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
            'dd': ['5', '5', '5', '5'],
            'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
            'yyyy': ['2018', '2018', '2018', '2018'],
            'dev': ['aaa', 'aaa', 'aaa', 'bbb']}

            df = pd.DataFrame(data)
            df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' +
            df['yyyy'] + ' T' + df['time'])


            Now we will convert the categorical value of dev into numerical values, we will also keep track of this conversion so that we can set them on y-axis ticks.



            df['dev'] =df['dev'].astype('category')
            categorie_codes = dict(enumerate(df['dev'].cat.categories))
            df['dev'] =df['dev'].cat.codes


            enter image description here



            Then we can plot



            plt.plot_date(df['formatted_datetime'], df['dev'])
            plt.yticks(range(len(categorie_codes)), list(categorie_codes.values()))
            plt.show()


            enter image description here






            share|improve this answer











            $endgroup$



            @Aditya's answer is already very good. I just want to contribute an alternative.



            We can consider that pandas pd.to_datetime has a built in parser that can take as input a datetime string. We can construct these strings to look like a standard




            5/JUl/2018 T2:48:29




            This is a standard format and is well understood by pandas.



            This following dataframe



            data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
            'dd': ['5', '5', '5', '5'],
            'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
            'yyyy': ['2018', '2018', '2018', '2018']}
            df = pd.DataFrame(data)


            enter image description here



            We can get the desired column by doing



            df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' + df['yyyy'] + ' T' + df['time'])


            enter image description here



            We can also verify the types using



            df.dtypes



            dd object
            mm

            object
            time object
            yyyy

            object
            formatted_datetime datetime64[ns]
            dtype: object






            Plotting the columns



            You will notice that when you try to plot this df you will get an error, this is because matplotlib cannot handle datetime objects in their plot function. They do have an alternative plot_date function which can be used as



            plt.plot_date(df['formatted_datetime'], df['dev'])
            plt.show()


            For example, we will recreate the same data as above with the dev column as well. We will do the same type conversion for the datetime.



            data = {'mm': ['JUl', 'Jul', 'Jul', 'Jul'],
            'dd': ['5', '5', '5', '5'],
            'time': ['2:48:29', '2:48:29', '2:48:29', '2:50:47'],
            'yyyy': ['2018', '2018', '2018', '2018'],
            'dev': ['aaa', 'aaa', 'aaa', 'bbb']}

            df = pd.DataFrame(data)
            df['formatted_datetime'] = pd.to_datetime(df['dd'] + '/' + df['mm'] + '/' +
            df['yyyy'] + ' T' + df['time'])


            Now we will convert the categorical value of dev into numerical values, we will also keep track of this conversion so that we can set them on y-axis ticks.



            df['dev'] =df['dev'].astype('category')
            categorie_codes = dict(enumerate(df['dev'].cat.categories))
            df['dev'] =df['dev'].cat.codes


            enter image description here



            Then we can plot



            plt.plot_date(df['formatted_datetime'], df['dev'])
            plt.yticks(range(len(categorie_codes)), list(categorie_codes.values()))
            plt.show()


            enter image description here







            share|improve this answer














            share|improve this answer



            share|improve this answer








            edited Oct 11 '18 at 4:17

























            answered Oct 11 '18 at 4:05









            JahKnowsJahKnows

            5,277727




            5,277727












            • $begingroup$
              Nice work, but dont you plot count of string occurrences to formatted_date?
              $endgroup$
              – krayyem
              Nov 10 '18 at 11:54


















            • $begingroup$
              Nice work, but dont you plot count of string occurrences to formatted_date?
              $endgroup$
              – krayyem
              Nov 10 '18 at 11:54
















            $begingroup$
            Nice work, but dont you plot count of string occurrences to formatted_date?
            $endgroup$
            – krayyem
            Nov 10 '18 at 11:54




            $begingroup$
            Nice work, but dont you plot count of string occurrences to formatted_date?
            $endgroup$
            – krayyem
            Nov 10 '18 at 11:54


















            draft saved

            draft discarded




















































            Thanks for contributing an answer to Data Science Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f38074%2fplot-multiple-strings-from-a-dataframe%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            How to label and detect the document text images

            Tabula Rosettana

            Aureus (color)