Map list to bin numbers

Does WL have the equivalent of Matlab's discretize or NumPy's digitize? I.e., a function that takes a length-N list and a list of bin edges and returns a length-N list of bin numbers, mapping each list item to its bin number?

edited 23 hours ago

user64494

3,60911122

asked yesterday

Alan

6,6501125

$begingroup$
HistogramList seems similar. This could also be done efficiently with GroupBy and some easy little Compile-d selection determiner. Or maybe hit it first with Sort then write something that only checks the next bin up. Again, can be easily Compile-d.
$endgroup$
– b3m2a1
yesterday

$begingroup$
I need it to work like a map (in terms of the order of the items in the resulting list). Of course it is possible to write something ...
$endgroup$
– Alan
yesterday

$begingroup$
Related: 140577
$endgroup$
– Carl Woll
yesterday

1

$begingroup$
Did you try BinCounts? I guess it is what you need.
$endgroup$
– Rom38
yesterday

$begingroup$
@Rom38 You probably meant BinLists, right?
$endgroup$
– Henrik Schumacher
23 hours ago

|
show 1 more comment

edited 23 hours ago

user64494

3,60911122

asked yesterday

Alan

6,6501125

$begingroup$
HistogramList seems similar. This could also be done efficiently with GroupBy and some easy little Compile-d selection determiner. Or maybe hit it first with Sort then write something that only checks the next bin up. Again, can be easily Compile-d.
$endgroup$
– b3m2a1
yesterday

$begingroup$
I need it to work like a map (in terms of the order of the items in the resulting list). Of course it is possible to write something ...
$endgroup$
– Alan
yesterday

$begingroup$
Related: 140577
$endgroup$
– Carl Woll
yesterday

1

$begingroup$
Did you try BinCounts? I guess it is what you need.
$endgroup$
– Rom38
yesterday

$begingroup$
@Rom38 You probably meant BinLists, right?
$endgroup$
– Henrik Schumacher
23 hours ago

|
show 1 more comment

edited 23 hours ago

user64494

3,60911122

asked yesterday

Alan

6,6501125

list-manipulation data

edited 23 hours ago

user64494

3,60911122

asked yesterday

Alan

6,6501125

edited 23 hours ago

user64494

3,60911122

asked yesterday

Alan

6,6501125

edited 23 hours ago

user64494

3,60911122

edited 23 hours ago

user64494

3,60911122

edited 23 hours ago

user64494

3,60911122

asked yesterday

Alan

6,6501125

asked yesterday

Alan

6,6501125

asked yesterday

Alan

6,6501125

$begingroup$
HistogramList seems similar. This could also be done efficiently with GroupBy and some easy little Compile-d selection determiner. Or maybe hit it first with Sort then write something that only checks the next bin up. Again, can be easily Compile-d.
$endgroup$
– b3m2a1
yesterday

$begingroup$
I need it to work like a map (in terms of the order of the items in the resulting list). Of course it is possible to write something ...
$endgroup$
– Alan
yesterday

$begingroup$
Related: 140577
$endgroup$
– Carl Woll
yesterday

1

$begingroup$
Did you try BinCounts? I guess it is what you need.
$endgroup$
– Rom38
yesterday

$begingroup$
@Rom38 You probably meant BinLists, right?
$endgroup$
– Henrik Schumacher
23 hours ago

|
show 1 more comment

$begingroup$
HistogramList seems similar. This could also be done efficiently with GroupBy and some easy little Compile-d selection determiner. Or maybe hit it first with Sort then write something that only checks the next bin up. Again, can be easily Compile-d.
$endgroup$
– b3m2a1
yesterday

$begingroup$
I need it to work like a map (in terms of the order of the items in the resulting list). Of course it is possible to write something ...
$endgroup$
– Alan
yesterday

$begingroup$
Related: 140577
$endgroup$
– Carl Woll
yesterday

1

$begingroup$
Did you try BinCounts? I guess it is what you need.
$endgroup$
– Rom38
yesterday

$begingroup$
@Rom38 You probably meant BinLists, right?
$endgroup$
– Henrik Schumacher
23 hours ago

HistogramList seems similar. This could also be done efficiently with GroupBy and some easy little Compile-d selection determiner. Or maybe hit it first with Sort then write something that only checks the next bin up. Again, can be easily Compile-d.

– b3m2a1
yesterday

I need it to work like a map (in terms of the order of the items in the resulting list). Of course it is possible to write something ...

– Alan
yesterday

Related: 140577

– Carl Woll
yesterday

Did you try BinCounts? I guess it is what you need.

– Rom38
yesterday

@Rom38 You probably meant BinLists, right?

– Henrik Schumacher
23 hours ago

|
show 1 more comment

3 Answers
3

active

oldest

votes

Here's a version based on Nearest:

digitize[edges_] := DigitizeFunction[edges, Nearest[edges -> "Index"]]

digitize[data_, edges_] := digitize[edges][data]



DigitizeFunction[edges_, nf_NearestFunction][data_] := With[{init = nf[data][[All, 1]]},

    init + UnitStep[data - edges[[init]]] - 1

]

For example:

SeedRandom[1]

data = RandomReal[10, 10]

digitize[data, {2, 4, 5, 7, 8}]

{8.17389, 1.1142, 7.89526, 1.87803, 2.41361, 0.657388, 5.42247, 2.31155, 3.96006, 7.00474}

{5, 0, 4, 0, 1, 0, 3, 1, 1, 4}

Note that I broke up the definition of digitize into two pieces, so that if you do this for multiple data sets with the same edges list, you only need to compute the nearest function once.

edited yesterday

answered yesterday

Carl Woll

73.2k397191

add a comment |

This is a very quick-n-dirty, but may serve as a simple example.

This creates a piecewise function following the first definition in Matlab's discretize documentation, then applies that to the data.

disc[data_, edges_] := Module[{e = Partition[edges, 2, 1], p, l},

   l = Length@e;

   p=Piecewise[Append[Table[{i, e[[i, 1]] <= x < e[[i, 2]]}, {i, l - 1}]

                          , {l,e[[l, 1]] <= x <= e[[l, 2]]}]

                   , "NaN"];

   Table[p, {x, data}]];

From the first example in the above referenced documentation:

data={1, 1, 2, 3, 6, 5, 8, 10, 4, 4};

edges={2, 4, 6, 8, 10};



disc[data,edges]

{NaN,NaN,1,1,3,2,4,4,2,2}

I'm sure there are more efficient/elegant solutions, and will revisit as time permits.

edited 20 hours ago

answered yesterday

ciao

17.5k138109

add a comment |

You may also use Interpolation with InterpolationOrder -> 0. However, employing Nearest as Carl Woll did will usually be much faster.

First, we prepare the interplating function.

m = 20;

binboundaries = Join[{-1.}, Sort[RandomReal[{-1, 1}, m - 1]], {1.}];



f = Interpolation[Transpose[{binboundaries, Range[0, m]}], InterpolationOrder -> 0];

Now you can apply it to lists of values as follows:

vals = RandomReal[{-1, 1}, 1000];   

Round[f[vals]]

answered 22 hours ago

Henrik Schumacher

59.6k582166

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "387"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmathematica.stackexchange.com%2fquestions%2f194844%2fmap-list-to-bin-numbers%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

3 Answers
3

active

oldest

votes

3 Answers
3

active

oldest

votes

Here's a version based on Nearest:

digitize[edges_] := DigitizeFunction[edges, Nearest[edges -> "Index"]]

digitize[data_, edges_] := digitize[edges][data]



DigitizeFunction[edges_, nf_NearestFunction][data_] := With[{init = nf[data][[All, 1]]},

    init + UnitStep[data - edges[[init]]] - 1

]

For example:

SeedRandom[1]

data = RandomReal[10, 10]

digitize[data, {2, 4, 5, 7, 8}]

{8.17389, 1.1142, 7.89526, 1.87803, 2.41361, 0.657388, 5.42247, 2.31155, 3.96006, 7.00474}

{5, 0, 4, 0, 1, 0, 3, 1, 1, 4}

Note that I broke up the definition of digitize into two pieces, so that if you do this for multiple data sets with the same edges list, you only need to compute the nearest function once.

edited yesterday

answered yesterday

Carl Woll

73.2k397191

add a comment |

Here's a version based on Nearest:

digitize[edges_] := DigitizeFunction[edges, Nearest[edges -> "Index"]]

digitize[data_, edges_] := digitize[edges][data]



DigitizeFunction[edges_, nf_NearestFunction][data_] := With[{init = nf[data][[All, 1]]},

    init + UnitStep[data - edges[[init]]] - 1

]

For example:

SeedRandom[1]

data = RandomReal[10, 10]

digitize[data, {2, 4, 5, 7, 8}]

{8.17389, 1.1142, 7.89526, 1.87803, 2.41361, 0.657388, 5.42247, 2.31155, 3.96006, 7.00474}

{5, 0, 4, 0, 1, 0, 3, 1, 1, 4}

Note that I broke up the definition of digitize into two pieces, so that if you do this for multiple data sets with the same edges list, you only need to compute the nearest function once.

edited yesterday

answered yesterday

Carl Woll

73.2k397191

add a comment |

Here's a version based on Nearest:

digitize[edges_] := DigitizeFunction[edges, Nearest[edges -> "Index"]]

digitize[data_, edges_] := digitize[edges][data]



DigitizeFunction[edges_, nf_NearestFunction][data_] := With[{init = nf[data][[All, 1]]},

    init + UnitStep[data - edges[[init]]] - 1

]

For example:

SeedRandom[1]

data = RandomReal[10, 10]

digitize[data, {2, 4, 5, 7, 8}]

{8.17389, 1.1142, 7.89526, 1.87803, 2.41361, 0.657388, 5.42247, 2.31155, 3.96006, 7.00474}

{5, 0, 4, 0, 1, 0, 3, 1, 1, 4}

Note that I broke up the definition of digitize into two pieces, so that if you do this for multiple data sets with the same edges list, you only need to compute the nearest function once.

edited yesterday

answered yesterday

Carl Woll

73.2k397191

Here's a version based on Nearest:

digitize[edges_] := DigitizeFunction[edges, Nearest[edges -> "Index"]]

digitize[data_, edges_] := digitize[edges][data]



DigitizeFunction[edges_, nf_NearestFunction][data_] := With[{init = nf[data][[All, 1]]},

    init + UnitStep[data - edges[[init]]] - 1

]

For example:

SeedRandom[1]

data = RandomReal[10, 10]

digitize[data, {2, 4, 5, 7, 8}]

{8.17389, 1.1142, 7.89526, 1.87803, 2.41361, 0.657388, 5.42247, 2.31155, 3.96006, 7.00474}

{5, 0, 4, 0, 1, 0, 3, 1, 1, 4}

Note that I broke up the definition of digitize into two pieces, so that if you do this for multiple data sets with the same edges list, you only need to compute the nearest function once.

edited yesterday

answered yesterday

Carl Woll

73.2k397191

edited yesterday

answered yesterday

Carl Woll

73.2k397191

answered yesterday

Carl Woll

73.2k397191

answered yesterday

Carl Woll

73.2k397191

add a comment |

This is a very quick-n-dirty, but may serve as a simple example.

This creates a piecewise function following the first definition in Matlab's discretize documentation, then applies that to the data.

disc[data_, edges_] := Module[{e = Partition[edges, 2, 1], p, l},

   l = Length@e;

   p=Piecewise[Append[Table[{i, e[[i, 1]] <= x < e[[i, 2]]}, {i, l - 1}]

                          , {l,e[[l, 1]] <= x <= e[[l, 2]]}]

                   , "NaN"];

   Table[p, {x, data}]];

From the first example in the above referenced documentation:

data={1, 1, 2, 3, 6, 5, 8, 10, 4, 4};

edges={2, 4, 6, 8, 10};



disc[data,edges]

{NaN,NaN,1,1,3,2,4,4,2,2}

I'm sure there are more efficient/elegant solutions, and will revisit as time permits.

edited 20 hours ago

answered yesterday

ciao

17.5k138109

add a comment |

This is a very quick-n-dirty, but may serve as a simple example.

This creates a piecewise function following the first definition in Matlab's discretize documentation, then applies that to the data.

disc[data_, edges_] := Module[{e = Partition[edges, 2, 1], p, l},

   l = Length@e;

   p=Piecewise[Append[Table[{i, e[[i, 1]] <= x < e[[i, 2]]}, {i, l - 1}]

                          , {l,e[[l, 1]] <= x <= e[[l, 2]]}]

                   , "NaN"];

   Table[p, {x, data}]];

From the first example in the above referenced documentation:

data={1, 1, 2, 3, 6, 5, 8, 10, 4, 4};

edges={2, 4, 6, 8, 10};



disc[data,edges]

{NaN,NaN,1,1,3,2,4,4,2,2}

I'm sure there are more efficient/elegant solutions, and will revisit as time permits.

edited 20 hours ago

answered yesterday

ciao

17.5k138109

add a comment |

This is a very quick-n-dirty, but may serve as a simple example.

This creates a piecewise function following the first definition in Matlab's discretize documentation, then applies that to the data.

disc[data_, edges_] := Module[{e = Partition[edges, 2, 1], p, l},

   l = Length@e;

   p=Piecewise[Append[Table[{i, e[[i, 1]] <= x < e[[i, 2]]}, {i, l - 1}]

                          , {l,e[[l, 1]] <= x <= e[[l, 2]]}]

                   , "NaN"];

   Table[p, {x, data}]];

From the first example in the above referenced documentation:

data={1, 1, 2, 3, 6, 5, 8, 10, 4, 4};

edges={2, 4, 6, 8, 10};



disc[data,edges]

{NaN,NaN,1,1,3,2,4,4,2,2}

I'm sure there are more efficient/elegant solutions, and will revisit as time permits.

edited 20 hours ago

answered yesterday

ciao

17.5k138109

This is a very quick-n-dirty, but may serve as a simple example.

This creates a piecewise function following the first definition in Matlab's discretize documentation, then applies that to the data.

disc[data_, edges_] := Module[{e = Partition[edges, 2, 1], p, l},

   l = Length@e;

   p=Piecewise[Append[Table[{i, e[[i, 1]] <= x < e[[i, 2]]}, {i, l - 1}]

                          , {l,e[[l, 1]] <= x <= e[[l, 2]]}]

                   , "NaN"];

   Table[p, {x, data}]];

From the first example in the above referenced documentation:

data={1, 1, 2, 3, 6, 5, 8, 10, 4, 4};

edges={2, 4, 6, 8, 10};



disc[data,edges]

{NaN,NaN,1,1,3,2,4,4,2,2}

I'm sure there are more efficient/elegant solutions, and will revisit as time permits.

edited 20 hours ago

answered yesterday

ciao

17.5k138109

edited 20 hours ago

answered yesterday

ciao

17.5k138109

answered yesterday

ciao

17.5k138109

answered yesterday

ciao

17.5k138109

add a comment |

You may also use Interpolation with InterpolationOrder -> 0. However, employing Nearest as Carl Woll did will usually be much faster.

First, we prepare the interplating function.

m = 20;

binboundaries = Join[{-1.}, Sort[RandomReal[{-1, 1}, m - 1]], {1.}];



f = Interpolation[Transpose[{binboundaries, Range[0, m]}], InterpolationOrder -> 0];

Now you can apply it to lists of values as follows:

vals = RandomReal[{-1, 1}, 1000];   

Round[f[vals]]

answered 22 hours ago

Henrik Schumacher

59.6k582166

add a comment |

You may also use Interpolation with InterpolationOrder -> 0. However, employing Nearest as Carl Woll did will usually be much faster.

First, we prepare the interplating function.

m = 20;

binboundaries = Join[{-1.}, Sort[RandomReal[{-1, 1}, m - 1]], {1.}];



f = Interpolation[Transpose[{binboundaries, Range[0, m]}], InterpolationOrder -> 0];

Now you can apply it to lists of values as follows:

vals = RandomReal[{-1, 1}, 1000];   

Round[f[vals]]

answered 22 hours ago

Henrik Schumacher

59.6k582166

add a comment |

You may also use Interpolation with InterpolationOrder -> 0. However, employing Nearest as Carl Woll did will usually be much faster.

First, we prepare the interplating function.

m = 20;

binboundaries = Join[{-1.}, Sort[RandomReal[{-1, 1}, m - 1]], {1.}];



f = Interpolation[Transpose[{binboundaries, Range[0, m]}], InterpolationOrder -> 0];

Now you can apply it to lists of values as follows:

vals = RandomReal[{-1, 1}, 1000];   

Round[f[vals]]

answered 22 hours ago

Henrik Schumacher

59.6k582166

You may also use Interpolation with InterpolationOrder -> 0. However, employing Nearest as Carl Woll did will usually be much faster.

First, we prepare the interplating function.

m = 20;

binboundaries = Join[{-1.}, Sort[RandomReal[{-1, 1}, m - 1]], {1.}];



f = Interpolation[Transpose[{binboundaries, Range[0, m]}], InterpolationOrder -> 0];

Now you can apply it to lists of values as follows:

vals = RandomReal[{-1, 1}, 1000];   

Round[f[vals]]

answered 22 hours ago

Henrik Schumacher

59.6k582166

answered 22 hours ago

Henrik Schumacher

59.6k582166

answered 22 hours ago

Henrik Schumacher

59.6k582166

answered 22 hours ago

Henrik Schumacher

59.6k582166

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Mathematica Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Htydjtk