Will parallel stream work fine with distinct operation?

up vote
7
down vote

favorite

I was reading about statelessness and came across this in doc:

Stream pipeline results may be nondeterministic or incorrect if the
behavioral parameters to the stream operations are stateful. A
stateful lambda (or other object implementing the appropriate
functional interface) is one whose result depends on any state which
might change during the execution of the stream pipeline.

Now if I have the a list of string (strList say) and then trying to remove duplicate strings from it using parallel streams in the following way:

List<String> resultOne = strList.parallelStream().distinct().collect(Collectors.toList());

or in case we want case insensitive:

List<String> result2 = strList.parallelStream().map(String::toLowerCase)

                       .distinct().collect(Collectors.toList());

Can this code have any problem as parallel streams will split the input and distinct in one chunk does not necessarily mean distinct in the whole input?

edited 2 hours ago

ernest_k

18.2k41838

asked 2 hours ago

i_am_zero

11.2k25353

1

Only problem I can think of is that the order of the strings may different than the initial order in the strList
– smac89
2 hours ago

2

There's a hint in the apiNote of Stream#distinct: "@apiNote: Preserving stability for distinct() in parallel pipelines is relatively expensive (requires that the operation act as a full barrier, with substantial buffering overhead)". And the same can be asked about reduction operations too (through parallel reduction is more easily conceivable than this distinct operation)
– ernest_k
2 hours ago

2

The quoted text is about lambdas, and distinct() doesn't take a lambda, so the quoted text is irrelevant. Also, if you read the documentation, i.e. the javadoc of distinct(), you will see that it fully addresses the behavior of the method in parallel pipelines. The only problem is performance. The method guarantees functionality, as described by the javadoc.
– Andreas
2 hours ago

add a comment |

up vote
7
down vote

favorite

I was reading about statelessness and came across this in doc:

Stream pipeline results may be nondeterministic or incorrect if the
behavioral parameters to the stream operations are stateful. A
stateful lambda (or other object implementing the appropriate
functional interface) is one whose result depends on any state which
might change during the execution of the stream pipeline.

Now if I have the a list of string (strList say) and then trying to remove duplicate strings from it using parallel streams in the following way:

List<String> resultOne = strList.parallelStream().distinct().collect(Collectors.toList());

or in case we want case insensitive:

List<String> result2 = strList.parallelStream().map(String::toLowerCase)

                       .distinct().collect(Collectors.toList());

Can this code have any problem as parallel streams will split the input and distinct in one chunk does not necessarily mean distinct in the whole input?

edited 2 hours ago

ernest_k

18.2k41838

asked 2 hours ago

i_am_zero

11.2k25353

1

Only problem I can think of is that the order of the strings may different than the initial order in the strList
– smac89
2 hours ago

2

There's a hint in the apiNote of Stream#distinct: "@apiNote: Preserving stability for distinct() in parallel pipelines is relatively expensive (requires that the operation act as a full barrier, with substantial buffering overhead)". And the same can be asked about reduction operations too (through parallel reduction is more easily conceivable than this distinct operation)
– ernest_k
2 hours ago

2

The quoted text is about lambdas, and distinct() doesn't take a lambda, so the quoted text is irrelevant. Also, if you read the documentation, i.e. the javadoc of distinct(), you will see that it fully addresses the behavior of the method in parallel pipelines. The only problem is performance. The method guarantees functionality, as described by the javadoc.
– Andreas
2 hours ago

add a comment |

up vote
7
down vote

favorite

I was reading about statelessness and came across this in doc:

Stream pipeline results may be nondeterministic or incorrect if the
behavioral parameters to the stream operations are stateful. A
stateful lambda (or other object implementing the appropriate
functional interface) is one whose result depends on any state which
might change during the execution of the stream pipeline.

Now if I have the a list of string (strList say) and then trying to remove duplicate strings from it using parallel streams in the following way:

List<String> resultOne = strList.parallelStream().distinct().collect(Collectors.toList());

or in case we want case insensitive:

List<String> result2 = strList.parallelStream().map(String::toLowerCase)

                       .distinct().collect(Collectors.toList());

Can this code have any problem as parallel streams will split the input and distinct in one chunk does not necessarily mean distinct in the whole input?

edited 2 hours ago

ernest_k

18.2k41838

asked 2 hours ago

i_am_zero

11.2k25353

I was reading about statelessness and came across this in doc:

Stream pipeline results may be nondeterministic or incorrect if the
behavioral parameters to the stream operations are stateful. A
stateful lambda (or other object implementing the appropriate
functional interface) is one whose result depends on any state which
might change during the execution of the stream pipeline.

Now if I have the a list of string (strList say) and then trying to remove duplicate strings from it using parallel streams in the following way:

List<String> resultOne = strList.parallelStream().distinct().collect(Collectors.toList());

or in case we want case insensitive:

List<String> result2 = strList.parallelStream().map(String::toLowerCase)

                       .distinct().collect(Collectors.toList());

Can this code have any problem as parallel streams will split the input and distinct in one chunk does not necessarily mean distinct in the whole input?

java java-8 java-stream

edited 2 hours ago

ernest_k

18.2k41838

asked 2 hours ago

i_am_zero

11.2k25353

edited 2 hours ago

ernest_k

18.2k41838

asked 2 hours ago

i_am_zero

11.2k25353

edited 2 hours ago

ernest_k

18.2k41838

edited 2 hours ago

ernest_k

18.2k41838

edited 2 hours ago

ernest_k

18.2k41838

asked 2 hours ago

i_am_zero

11.2k25353

asked 2 hours ago

i_am_zero

11.2k25353

asked 2 hours ago

i_am_zero

11.2k25353

1

Only problem I can think of is that the order of the strings may different than the initial order in the strList
– smac89
2 hours ago

2

There's a hint in the apiNote of Stream#distinct: "@apiNote: Preserving stability for distinct() in parallel pipelines is relatively expensive (requires that the operation act as a full barrier, with substantial buffering overhead)". And the same can be asked about reduction operations too (through parallel reduction is more easily conceivable than this distinct operation)
– ernest_k
2 hours ago

2

The quoted text is about lambdas, and distinct() doesn't take a lambda, so the quoted text is irrelevant. Also, if you read the documentation, i.e. the javadoc of distinct(), you will see that it fully addresses the behavior of the method in parallel pipelines. The only problem is performance. The method guarantees functionality, as described by the javadoc.
– Andreas
2 hours ago

add a comment |

1

Only problem I can think of is that the order of the strings may different than the initial order in the strList
– smac89
2 hours ago

2

There's a hint in the apiNote of Stream#distinct: "@apiNote: Preserving stability for distinct() in parallel pipelines is relatively expensive (requires that the operation act as a full barrier, with substantial buffering overhead)". And the same can be asked about reduction operations too (through parallel reduction is more easily conceivable than this distinct operation)
– ernest_k
2 hours ago

2

The quoted text is about lambdas, and distinct() doesn't take a lambda, so the quoted text is irrelevant. Also, if you read the documentation, i.e. the javadoc of distinct(), you will see that it fully addresses the behavior of the method in parallel pipelines. The only problem is performance. The method guarantees functionality, as described by the javadoc.
– Andreas
2 hours ago

Only problem I can think of is that the order of the strings may different than the initial order in the strList
– smac89
2 hours ago

There's a hint in the apiNote of Stream#distinct: "@apiNote: Preserving stability for distinct() in parallel pipelines is relatively expensive (requires that the operation act as a full barrier, with substantial buffering overhead)". And the same can be asked about reduction operations too (through parallel reduction is more easily conceivable than this distinct operation)
– ernest_k
2 hours ago

The quoted text is about lambdas, and distinct() doesn't take a lambda, so the quoted text is irrelevant. Also, if you read the documentation, i.e. the javadoc of distinct(), you will see that it fully addresses the behavior of the method in parallel pipelines. The only problem is performance. The method guarantees functionality, as described by the javadoc.
– Andreas
2 hours ago

add a comment |

3 Answers
3

active

oldest

votes

up vote
6
down vote

Roughly pointing out the relevant parts of the doc:

Intermediate operations are further divided into stateless and
stateful operations. Stateless operations, such as filter and map,
retain no state from previously seen element when processing a new
element -- each element can be processed independently of operations
on other elements. Stateful operations, such as distinct and sorted,
may incorporate state from previously seen elements when processing
new elements

Stateful operations may need to process the entire input before
producing a result. For example, one cannot produce any results from
sorting a stream until one has seen all elements of the stream. As a
result, under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data. Pipelines containing exclusively
stateless intermediate operations can be processed in a single pass,
whether sequential or parallel, with minimal data buffering

If you read further down (section on ordering):

Streams may or may not have a defined encounter order. Whether or not
a stream has an encounter order depends on the source and the
intermediate operations. Certain stream sources (such as List or
arrays) are intrinsically ordered, whereas others (such as HashSet)
are not. Some intermediate operations, such as sorted(), may impose an
encounter order on an otherwise unordered stream, and others may
render an ordered stream unordered, such as BaseStream.unordered().
Further, some terminal operations may ignore encounter order, such as
forEach().

...

For parallel streams, relaxing the ordering constraint can sometimes
enable more efficient execution. Certain aggregate operations, such as
filtering duplicates (distinct()) or grouped reductions
(Collectors.groupingBy()) can be implemented more efficiently if
ordering of elements is not relevant. Similarly, operations that are
intrinsically tied to encounter order, such as limit(), may require
buffering to ensure proper ordering, undermining the benefit of
parallelism. In cases where the stream has an encounter order, but the
user does not particularly care about that encounter order, explicitly
de-ordering the stream with unordered() may improve parallel
performance for some stateful or terminal operations. However, most
stream pipelines, such as the "sum of weight of blocks" example above,
still parallelize efficiently even under ordering constraints.

In conclusion,

distinct will work fine with parallel streams, but as you may already know, it has to consume the entire stream before continuing and this uses a lot of data.

If the source of the items is an unordered collection (such as hashset) or the stream is unordered(), then distinct is not worried about ordering the output and thus will be efficient

Solution is to add .unordered() to the stream pipeline if you are not worried about order and would like to see more performance.

List<String> result2 = strList.parallelStream()

                              .unordered()

                              .map(String::toLowerCase)

                              .distinct()

                              .collect(Collectors.toList());

Alas there is no (available builtin) concurrent hashset in Java (unless they got clever with ConcurrentHashMap), so I can only leave you with the unfortunate possibility that distinct is implemented in a blocking fashion using a regular Java set. In which case, I don't see any benefit of doing a parallel distinct.

edited 1 hour ago

answered 1 hour ago

smac89

11.8k43472

Was reading the docs as well for this one.. As a result, under parallel computation, some pipelines containing stateful intermediate operations may require multiple passes on the data or may need to buffer significant data. seems to be the precise answer OP is looking for.
– nullpointer
1 hour ago

@smac well there actually is a built-in concurrent Set. That the factory method for it is placed in ConcurrentHashMap does not really change that: newKeySet()
– Hulk
10 mins ago

add a comment |

up vote
2
down vote

There won't be a problem (problem as in a wrong result) but as the API note says

Preserving stability for distinct() in parallel pipelines is relatively expensive

But if performance is of concern and if stability is not a problem (i.e the result having a different order of elements with respect to the collection it processed ) then you follow the API's note

removing the ordering constraint with BaseStream.unordered() may
result in significantly more efficient execution for distinct() in
parallel pipelines,

I thought why not benchmark performance of parallel and sequential streams for distinct

public static void main(String args) {

        List<String> strList = Arrays.asList("cat", "nat", "hat", "tat", "heart", "fat", "bat", "lad", "crab", "snob");



        List<String> words = new Vector<>();





        int wordCount = 1_000_000; // no. of words in the list words

        int avgIter = 10; // iterations to run to find average running time



        //populate a list randomly with the strings in `strList`

        for (int i = 0; i < wordCount; i++) 

            words.add(strList.get((int) Math.round(Math.random() * (strList.size() - 1))));











        //find out average running times

        long starttime, pod = 0, pud = 0, sod = 0;

        for (int i = 0; i < avgIter; i++) {

            starttime = System.currentTimeMillis();

            List<String> parallelOrderedDistinct = words.parallelStream().distinct().collect(Collectors.toList());

            pod += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> parallelUnorderedDistinct =

                    words.parallelStream().unordered().distinct().collect(Collectors.toList());

            pud += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> sequentialOrderedDistinct = words.stream().distinct().collect(Collectors.toList());

            sod += System.currentTimeMillis() - starttime;

        }



        System.out.println("Parallel ordered time in ms: " + pod / avgIter);

        System.out.println("Parallel unordered time in ms: " + pud / avgIter);

        System.out.println("Sequential implicitly ordered time in ms: " + sod / avgIter);

    }

The above was compiled by open-jdk 8 and run on openjdk's jre 8 (no jvm specific arguments) on an i3 6th gen (4 logical cores) and I got these results

Seemed like after a certain no. of elements, ordered parallel was faster and ironically parallel unordered was the slowest.

Parallel ordered time in ms: 52

Parallel unordered time in ms: 81

Sequential implicitly ordered time in ms: 35

Parallel ordered time in ms: 48

Parallel unordered time in ms: 83

Sequential implicitly ordered time in ms: 34

Parallel ordered time in ms: 36

Parallel unordered time in ms: 70

Sequential implicitly ordered time in ms: 32

The unordered parallel was twice slower than both.

Then I upped wordCount to 5_000_000 and these were the results

Parallel ordered time in ms: 93

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 123

Parallel ordered time in ms: 100

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 124

Parallel ordered time in ms: 89

Parallel unordered time in ms: 365

Sequential implicitly ordered time in ms: 118

and then to 10_000_000

Parallel ordered time in ms: 148

Parallel unordered time in ms: 725

Sequential implicitly ordered time in ms: 218

Parallel ordered time in ms: 150

Parallel unordered time in ms: 749

Sequential implicitly ordered time in ms: 224

Parallel ordered time in ms: 143

Parallel unordered time in ms: 743

Sequential implicitly ordered time in ms: 222

edited 45 mins ago

answered 2 hours ago

Ryotsu

390111

add a comment |

up vote
1
down vote

From the javadocs, parallelStream()

Returns a possibly parallel Stream with this collection as its source.
It is allowable for this method to return a sequential stream.

Performance:

Let us consider we have a multiple stream(luckily) that is given to different cores of CPU. ArrayList<T> which has internal data representation based upon an array. Or a LinkedList<T> which needs more computation for splitting to be processed in parallel. ArrayList<T> is better in this case!

stream.unordered().parallel().distinct() has better performance than stream.parallel().distinct()

Preserving stability for distinct() in parallel pipelines is
relatively expensive (requires that the operation act as a full
barrier, with substantial buffering overhead).

So, in your case it should not be a problem(Unless your List<T> does not care of order). Read below for explanation,

Lets say you have 4 elements in ArrayList,
{"a","b","a","b"}

Now if you don't use parallelStream() before calling distinct(), only the String at positions 0 and 1 is retained.(Preserves the order,Sequential stream)

Else, (if you use parallelStream().distinct()) then elements at 1 and 2 can be retained as distinct(It is unstable, but the result is same {"a,"b"} or it can even be {"b","a"}).

An unstable distinct operation will randomly eliminate the duplicates.

Finally,

under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data

edited 1 hour ago

answered 1 hour ago

Mohamed Anees A

593413

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53645037%2fwill-parallel-stream-work-fine-with-distinct-operation%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

3 Answers
3

active

oldest

votes

3 Answers
3

active

oldest

votes

up vote
6
down vote

Roughly pointing out the relevant parts of the doc:

Intermediate operations are further divided into stateless and
stateful operations. Stateless operations, such as filter and map,
retain no state from previously seen element when processing a new
element -- each element can be processed independently of operations
on other elements. Stateful operations, such as distinct and sorted,
may incorporate state from previously seen elements when processing
new elements

Stateful operations may need to process the entire input before
producing a result. For example, one cannot produce any results from
sorting a stream until one has seen all elements of the stream. As a
result, under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data. Pipelines containing exclusively
stateless intermediate operations can be processed in a single pass,
whether sequential or parallel, with minimal data buffering

If you read further down (section on ordering):

Streams may or may not have a defined encounter order. Whether or not
a stream has an encounter order depends on the source and the
intermediate operations. Certain stream sources (such as List or
arrays) are intrinsically ordered, whereas others (such as HashSet)
are not. Some intermediate operations, such as sorted(), may impose an
encounter order on an otherwise unordered stream, and others may
render an ordered stream unordered, such as BaseStream.unordered().
Further, some terminal operations may ignore encounter order, such as
forEach().

...

For parallel streams, relaxing the ordering constraint can sometimes
enable more efficient execution. Certain aggregate operations, such as
filtering duplicates (distinct()) or grouped reductions
(Collectors.groupingBy()) can be implemented more efficiently if
ordering of elements is not relevant. Similarly, operations that are
intrinsically tied to encounter order, such as limit(), may require
buffering to ensure proper ordering, undermining the benefit of
parallelism. In cases where the stream has an encounter order, but the
user does not particularly care about that encounter order, explicitly
de-ordering the stream with unordered() may improve parallel
performance for some stateful or terminal operations. However, most
stream pipelines, such as the "sum of weight of blocks" example above,
still parallelize efficiently even under ordering constraints.

In conclusion,

distinct will work fine with parallel streams, but as you may already know, it has to consume the entire stream before continuing and this uses a lot of data.

If the source of the items is an unordered collection (such as hashset) or the stream is unordered(), then distinct is not worried about ordering the output and thus will be efficient

Solution is to add .unordered() to the stream pipeline if you are not worried about order and would like to see more performance.

List<String> result2 = strList.parallelStream()

                              .unordered()

                              .map(String::toLowerCase)

                              .distinct()

                              .collect(Collectors.toList());

edited 1 hour ago

answered 1 hour ago

smac89

11.8k43472

Was reading the docs as well for this one.. As a result, under parallel computation, some pipelines containing stateful intermediate operations may require multiple passes on the data or may need to buffer significant data. seems to be the precise answer OP is looking for.
– nullpointer
1 hour ago

@smac well there actually is a built-in concurrent Set. That the factory method for it is placed in ConcurrentHashMap does not really change that: newKeySet()
– Hulk
10 mins ago

add a comment |

up vote
6
down vote

Roughly pointing out the relevant parts of the doc:

Intermediate operations are further divided into stateless and
stateful operations. Stateless operations, such as filter and map,
retain no state from previously seen element when processing a new
element -- each element can be processed independently of operations
on other elements. Stateful operations, such as distinct and sorted,
may incorporate state from previously seen elements when processing
new elements

Stateful operations may need to process the entire input before
producing a result. For example, one cannot produce any results from
sorting a stream until one has seen all elements of the stream. As a
result, under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data. Pipelines containing exclusively
stateless intermediate operations can be processed in a single pass,
whether sequential or parallel, with minimal data buffering

If you read further down (section on ordering):

Streams may or may not have a defined encounter order. Whether or not
a stream has an encounter order depends on the source and the
intermediate operations. Certain stream sources (such as List or
arrays) are intrinsically ordered, whereas others (such as HashSet)
are not. Some intermediate operations, such as sorted(), may impose an
encounter order on an otherwise unordered stream, and others may
render an ordered stream unordered, such as BaseStream.unordered().
Further, some terminal operations may ignore encounter order, such as
forEach().

...

For parallel streams, relaxing the ordering constraint can sometimes
enable more efficient execution. Certain aggregate operations, such as
filtering duplicates (distinct()) or grouped reductions
(Collectors.groupingBy()) can be implemented more efficiently if
ordering of elements is not relevant. Similarly, operations that are
intrinsically tied to encounter order, such as limit(), may require
buffering to ensure proper ordering, undermining the benefit of
parallelism. In cases where the stream has an encounter order, but the
user does not particularly care about that encounter order, explicitly
de-ordering the stream with unordered() may improve parallel
performance for some stateful or terminal operations. However, most
stream pipelines, such as the "sum of weight of blocks" example above,
still parallelize efficiently even under ordering constraints.

In conclusion,

distinct will work fine with parallel streams, but as you may already know, it has to consume the entire stream before continuing and this uses a lot of data.

If the source of the items is an unordered collection (such as hashset) or the stream is unordered(), then distinct is not worried about ordering the output and thus will be efficient

Solution is to add .unordered() to the stream pipeline if you are not worried about order and would like to see more performance.

List<String> result2 = strList.parallelStream()

                              .unordered()

                              .map(String::toLowerCase)

                              .distinct()

                              .collect(Collectors.toList());

edited 1 hour ago

answered 1 hour ago

smac89

11.8k43472

Was reading the docs as well for this one.. As a result, under parallel computation, some pipelines containing stateful intermediate operations may require multiple passes on the data or may need to buffer significant data. seems to be the precise answer OP is looking for.
– nullpointer
1 hour ago

@smac well there actually is a built-in concurrent Set. That the factory method for it is placed in ConcurrentHashMap does not really change that: newKeySet()
– Hulk
10 mins ago

add a comment |

up vote
6
down vote

Roughly pointing out the relevant parts of the doc:

Intermediate operations are further divided into stateless and
stateful operations. Stateless operations, such as filter and map,
retain no state from previously seen element when processing a new
element -- each element can be processed independently of operations
on other elements. Stateful operations, such as distinct and sorted,
may incorporate state from previously seen elements when processing
new elements

Stateful operations may need to process the entire input before
producing a result. For example, one cannot produce any results from
sorting a stream until one has seen all elements of the stream. As a
result, under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data. Pipelines containing exclusively
stateless intermediate operations can be processed in a single pass,
whether sequential or parallel, with minimal data buffering

If you read further down (section on ordering):

Streams may or may not have a defined encounter order. Whether or not
a stream has an encounter order depends on the source and the
intermediate operations. Certain stream sources (such as List or
arrays) are intrinsically ordered, whereas others (such as HashSet)
are not. Some intermediate operations, such as sorted(), may impose an
encounter order on an otherwise unordered stream, and others may
render an ordered stream unordered, such as BaseStream.unordered().
Further, some terminal operations may ignore encounter order, such as
forEach().

...

For parallel streams, relaxing the ordering constraint can sometimes
enable more efficient execution. Certain aggregate operations, such as
filtering duplicates (distinct()) or grouped reductions
(Collectors.groupingBy()) can be implemented more efficiently if
ordering of elements is not relevant. Similarly, operations that are
intrinsically tied to encounter order, such as limit(), may require
buffering to ensure proper ordering, undermining the benefit of
parallelism. In cases where the stream has an encounter order, but the
user does not particularly care about that encounter order, explicitly
de-ordering the stream with unordered() may improve parallel
performance for some stateful or terminal operations. However, most
stream pipelines, such as the "sum of weight of blocks" example above,
still parallelize efficiently even under ordering constraints.

In conclusion,

distinct will work fine with parallel streams, but as you may already know, it has to consume the entire stream before continuing and this uses a lot of data.

If the source of the items is an unordered collection (such as hashset) or the stream is unordered(), then distinct is not worried about ordering the output and thus will be efficient

Solution is to add .unordered() to the stream pipeline if you are not worried about order and would like to see more performance.

List<String> result2 = strList.parallelStream()

                              .unordered()

                              .map(String::toLowerCase)

                              .distinct()

                              .collect(Collectors.toList());

edited 1 hour ago

answered 1 hour ago

smac89

11.8k43472

Roughly pointing out the relevant parts of the doc:

Intermediate operations are further divided into stateless and
stateful operations. Stateless operations, such as filter and map,
retain no state from previously seen element when processing a new
element -- each element can be processed independently of operations
on other elements. Stateful operations, such as distinct and sorted,
may incorporate state from previously seen elements when processing
new elements

Stateful operations may need to process the entire input before
producing a result. For example, one cannot produce any results from
sorting a stream until one has seen all elements of the stream. As a
result, under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data. Pipelines containing exclusively
stateless intermediate operations can be processed in a single pass,
whether sequential or parallel, with minimal data buffering

If you read further down (section on ordering):

Streams may or may not have a defined encounter order. Whether or not
a stream has an encounter order depends on the source and the
intermediate operations. Certain stream sources (such as List or
arrays) are intrinsically ordered, whereas others (such as HashSet)
are not. Some intermediate operations, such as sorted(), may impose an
encounter order on an otherwise unordered stream, and others may
render an ordered stream unordered, such as BaseStream.unordered().
Further, some terminal operations may ignore encounter order, such as
forEach().

...

For parallel streams, relaxing the ordering constraint can sometimes
enable more efficient execution. Certain aggregate operations, such as
filtering duplicates (distinct()) or grouped reductions
(Collectors.groupingBy()) can be implemented more efficiently if
ordering of elements is not relevant. Similarly, operations that are
intrinsically tied to encounter order, such as limit(), may require
buffering to ensure proper ordering, undermining the benefit of
parallelism. In cases where the stream has an encounter order, but the
user does not particularly care about that encounter order, explicitly
de-ordering the stream with unordered() may improve parallel
performance for some stateful or terminal operations. However, most
stream pipelines, such as the "sum of weight of blocks" example above,
still parallelize efficiently even under ordering constraints.

In conclusion,

distinct will work fine with parallel streams, but as you may already know, it has to consume the entire stream before continuing and this uses a lot of data.

If the source of the items is an unordered collection (such as hashset) or the stream is unordered(), then distinct is not worried about ordering the output and thus will be efficient

Solution is to add .unordered() to the stream pipeline if you are not worried about order and would like to see more performance.

List<String> result2 = strList.parallelStream()

                              .unordered()

                              .map(String::toLowerCase)

                              .distinct()

                              .collect(Collectors.toList());

edited 1 hour ago

answered 1 hour ago

smac89

11.8k43472

edited 1 hour ago

answered 1 hour ago

smac89

11.8k43472

answered 1 hour ago

smac89

11.8k43472

answered 1 hour ago

smac89

11.8k43472

Was reading the docs as well for this one.. As a result, under parallel computation, some pipelines containing stateful intermediate operations may require multiple passes on the data or may need to buffer significant data. seems to be the precise answer OP is looking for.
– nullpointer
1 hour ago

@smac well there actually is a built-in concurrent Set. That the factory method for it is placed in ConcurrentHashMap does not really change that: newKeySet()
– Hulk
10 mins ago

add a comment |

Was reading the docs as well for this one.. As a result, under parallel computation, some pipelines containing stateful intermediate operations may require multiple passes on the data or may need to buffer significant data. seems to be the precise answer OP is looking for.
– nullpointer
1 hour ago

@smac well there actually is a built-in concurrent Set. That the factory method for it is placed in ConcurrentHashMap does not really change that: newKeySet()
– Hulk
10 mins ago

Was reading the docs as well for this one.. As a result, under parallel computation, some pipelines containing stateful intermediate operations may require multiple passes on the data or may need to buffer significant data. seems to be the precise answer OP is looking for.
– nullpointer
1 hour ago

@smac well there actually is a built-in concurrent Set. That the factory method for it is placed in ConcurrentHashMap does not really change that: newKeySet()
– Hulk
10 mins ago

add a comment |

up vote
2
down vote

There won't be a problem (problem as in a wrong result) but as the API note says

Preserving stability for distinct() in parallel pipelines is relatively expensive

But if performance is of concern and if stability is not a problem (i.e the result having a different order of elements with respect to the collection it processed ) then you follow the API's note

removing the ordering constraint with BaseStream.unordered() may
result in significantly more efficient execution for distinct() in
parallel pipelines,

I thought why not benchmark performance of parallel and sequential streams for distinct

public static void main(String args) {

        List<String> strList = Arrays.asList("cat", "nat", "hat", "tat", "heart", "fat", "bat", "lad", "crab", "snob");



        List<String> words = new Vector<>();





        int wordCount = 1_000_000; // no. of words in the list words

        int avgIter = 10; // iterations to run to find average running time



        //populate a list randomly with the strings in `strList`

        for (int i = 0; i < wordCount; i++) 

            words.add(strList.get((int) Math.round(Math.random() * (strList.size() - 1))));











        //find out average running times

        long starttime, pod = 0, pud = 0, sod = 0;

        for (int i = 0; i < avgIter; i++) {

            starttime = System.currentTimeMillis();

            List<String> parallelOrderedDistinct = words.parallelStream().distinct().collect(Collectors.toList());

            pod += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> parallelUnorderedDistinct =

                    words.parallelStream().unordered().distinct().collect(Collectors.toList());

            pud += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> sequentialOrderedDistinct = words.stream().distinct().collect(Collectors.toList());

            sod += System.currentTimeMillis() - starttime;

        }



        System.out.println("Parallel ordered time in ms: " + pod / avgIter);

        System.out.println("Parallel unordered time in ms: " + pud / avgIter);

        System.out.println("Sequential implicitly ordered time in ms: " + sod / avgIter);

    }

The above was compiled by open-jdk 8 and run on openjdk's jre 8 (no jvm specific arguments) on an i3 6th gen (4 logical cores) and I got these results

Seemed like after a certain no. of elements, ordered parallel was faster and ironically parallel unordered was the slowest.

Parallel ordered time in ms: 52

Parallel unordered time in ms: 81

Sequential implicitly ordered time in ms: 35

Parallel ordered time in ms: 48

Parallel unordered time in ms: 83

Sequential implicitly ordered time in ms: 34

Parallel ordered time in ms: 36

Parallel unordered time in ms: 70

Sequential implicitly ordered time in ms: 32

The unordered parallel was twice slower than both.

Then I upped wordCount to 5_000_000 and these were the results

Parallel ordered time in ms: 93

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 123

Parallel ordered time in ms: 100

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 124

Parallel ordered time in ms: 89

Parallel unordered time in ms: 365

Sequential implicitly ordered time in ms: 118

and then to 10_000_000

Parallel ordered time in ms: 148

Parallel unordered time in ms: 725

Sequential implicitly ordered time in ms: 218

Parallel ordered time in ms: 150

Parallel unordered time in ms: 749

Sequential implicitly ordered time in ms: 224

Parallel ordered time in ms: 143

Parallel unordered time in ms: 743

Sequential implicitly ordered time in ms: 222

edited 45 mins ago

answered 2 hours ago

Ryotsu

390111

add a comment |

up vote
2
down vote

There won't be a problem (problem as in a wrong result) but as the API note says

Preserving stability for distinct() in parallel pipelines is relatively expensive

But if performance is of concern and if stability is not a problem (i.e the result having a different order of elements with respect to the collection it processed ) then you follow the API's note

removing the ordering constraint with BaseStream.unordered() may
result in significantly more efficient execution for distinct() in
parallel pipelines,

I thought why not benchmark performance of parallel and sequential streams for distinct

public static void main(String args) {

        List<String> strList = Arrays.asList("cat", "nat", "hat", "tat", "heart", "fat", "bat", "lad", "crab", "snob");



        List<String> words = new Vector<>();





        int wordCount = 1_000_000; // no. of words in the list words

        int avgIter = 10; // iterations to run to find average running time



        //populate a list randomly with the strings in `strList`

        for (int i = 0; i < wordCount; i++) 

            words.add(strList.get((int) Math.round(Math.random() * (strList.size() - 1))));











        //find out average running times

        long starttime, pod = 0, pud = 0, sod = 0;

        for (int i = 0; i < avgIter; i++) {

            starttime = System.currentTimeMillis();

            List<String> parallelOrderedDistinct = words.parallelStream().distinct().collect(Collectors.toList());

            pod += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> parallelUnorderedDistinct =

                    words.parallelStream().unordered().distinct().collect(Collectors.toList());

            pud += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> sequentialOrderedDistinct = words.stream().distinct().collect(Collectors.toList());

            sod += System.currentTimeMillis() - starttime;

        }



        System.out.println("Parallel ordered time in ms: " + pod / avgIter);

        System.out.println("Parallel unordered time in ms: " + pud / avgIter);

        System.out.println("Sequential implicitly ordered time in ms: " + sod / avgIter);

    }

The above was compiled by open-jdk 8 and run on openjdk's jre 8 (no jvm specific arguments) on an i3 6th gen (4 logical cores) and I got these results

Seemed like after a certain no. of elements, ordered parallel was faster and ironically parallel unordered was the slowest.

Parallel ordered time in ms: 52

Parallel unordered time in ms: 81

Sequential implicitly ordered time in ms: 35

Parallel ordered time in ms: 48

Parallel unordered time in ms: 83

Sequential implicitly ordered time in ms: 34

Parallel ordered time in ms: 36

Parallel unordered time in ms: 70

Sequential implicitly ordered time in ms: 32

The unordered parallel was twice slower than both.

Then I upped wordCount to 5_000_000 and these were the results

Parallel ordered time in ms: 93

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 123

Parallel ordered time in ms: 100

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 124

Parallel ordered time in ms: 89

Parallel unordered time in ms: 365

Sequential implicitly ordered time in ms: 118

and then to 10_000_000

Parallel ordered time in ms: 148

Parallel unordered time in ms: 725

Sequential implicitly ordered time in ms: 218

Parallel ordered time in ms: 150

Parallel unordered time in ms: 749

Sequential implicitly ordered time in ms: 224

Parallel ordered time in ms: 143

Parallel unordered time in ms: 743

Sequential implicitly ordered time in ms: 222

edited 45 mins ago

answered 2 hours ago

Ryotsu

390111

add a comment |

up vote
2
down vote

There won't be a problem (problem as in a wrong result) but as the API note says

Preserving stability for distinct() in parallel pipelines is relatively expensive

But if performance is of concern and if stability is not a problem (i.e the result having a different order of elements with respect to the collection it processed ) then you follow the API's note

removing the ordering constraint with BaseStream.unordered() may
result in significantly more efficient execution for distinct() in
parallel pipelines,

I thought why not benchmark performance of parallel and sequential streams for distinct

public static void main(String args) {

        List<String> strList = Arrays.asList("cat", "nat", "hat", "tat", "heart", "fat", "bat", "lad", "crab", "snob");



        List<String> words = new Vector<>();





        int wordCount = 1_000_000; // no. of words in the list words

        int avgIter = 10; // iterations to run to find average running time



        //populate a list randomly with the strings in `strList`

        for (int i = 0; i < wordCount; i++) 

            words.add(strList.get((int) Math.round(Math.random() * (strList.size() - 1))));











        //find out average running times

        long starttime, pod = 0, pud = 0, sod = 0;

        for (int i = 0; i < avgIter; i++) {

            starttime = System.currentTimeMillis();

            List<String> parallelOrderedDistinct = words.parallelStream().distinct().collect(Collectors.toList());

            pod += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> parallelUnorderedDistinct =

                    words.parallelStream().unordered().distinct().collect(Collectors.toList());

            pud += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> sequentialOrderedDistinct = words.stream().distinct().collect(Collectors.toList());

            sod += System.currentTimeMillis() - starttime;

        }



        System.out.println("Parallel ordered time in ms: " + pod / avgIter);

        System.out.println("Parallel unordered time in ms: " + pud / avgIter);

        System.out.println("Sequential implicitly ordered time in ms: " + sod / avgIter);

    }

The above was compiled by open-jdk 8 and run on openjdk's jre 8 (no jvm specific arguments) on an i3 6th gen (4 logical cores) and I got these results

Seemed like after a certain no. of elements, ordered parallel was faster and ironically parallel unordered was the slowest.

Parallel ordered time in ms: 52

Parallel unordered time in ms: 81

Sequential implicitly ordered time in ms: 35

Parallel ordered time in ms: 48

Parallel unordered time in ms: 83

Sequential implicitly ordered time in ms: 34

Parallel ordered time in ms: 36

Parallel unordered time in ms: 70

Sequential implicitly ordered time in ms: 32

The unordered parallel was twice slower than both.

Then I upped wordCount to 5_000_000 and these were the results

Parallel ordered time in ms: 93

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 123

Parallel ordered time in ms: 100

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 124

Parallel ordered time in ms: 89

Parallel unordered time in ms: 365

Sequential implicitly ordered time in ms: 118

and then to 10_000_000

Parallel ordered time in ms: 148

Parallel unordered time in ms: 725

Sequential implicitly ordered time in ms: 218

Parallel ordered time in ms: 150

Parallel unordered time in ms: 749

Sequential implicitly ordered time in ms: 224

Parallel ordered time in ms: 143

Parallel unordered time in ms: 743

Sequential implicitly ordered time in ms: 222

edited 45 mins ago

answered 2 hours ago

Ryotsu

390111

There won't be a problem (problem as in a wrong result) but as the API note says

Preserving stability for distinct() in parallel pipelines is relatively expensive

But if performance is of concern and if stability is not a problem (i.e the result having a different order of elements with respect to the collection it processed ) then you follow the API's note

removing the ordering constraint with BaseStream.unordered() may
result in significantly more efficient execution for distinct() in
parallel pipelines,

I thought why not benchmark performance of parallel and sequential streams for distinct

public static void main(String args) {

        List<String> strList = Arrays.asList("cat", "nat", "hat", "tat", "heart", "fat", "bat", "lad", "crab", "snob");



        List<String> words = new Vector<>();





        int wordCount = 1_000_000; // no. of words in the list words

        int avgIter = 10; // iterations to run to find average running time



        //populate a list randomly with the strings in `strList`

        for (int i = 0; i < wordCount; i++) 

            words.add(strList.get((int) Math.round(Math.random() * (strList.size() - 1))));











        //find out average running times

        long starttime, pod = 0, pud = 0, sod = 0;

        for (int i = 0; i < avgIter; i++) {

            starttime = System.currentTimeMillis();

            List<String> parallelOrderedDistinct = words.parallelStream().distinct().collect(Collectors.toList());

            pod += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> parallelUnorderedDistinct =

                    words.parallelStream().unordered().distinct().collect(Collectors.toList());

            pud += System.currentTimeMillis() - starttime;



            starttime = System.currentTimeMillis();

            List<String> sequentialOrderedDistinct = words.stream().distinct().collect(Collectors.toList());

            sod += System.currentTimeMillis() - starttime;

        }



        System.out.println("Parallel ordered time in ms: " + pod / avgIter);

        System.out.println("Parallel unordered time in ms: " + pud / avgIter);

        System.out.println("Sequential implicitly ordered time in ms: " + sod / avgIter);

    }

The above was compiled by open-jdk 8 and run on openjdk's jre 8 (no jvm specific arguments) on an i3 6th gen (4 logical cores) and I got these results

Seemed like after a certain no. of elements, ordered parallel was faster and ironically parallel unordered was the slowest.

Parallel ordered time in ms: 52

Parallel unordered time in ms: 81

Sequential implicitly ordered time in ms: 35

Parallel ordered time in ms: 48

Parallel unordered time in ms: 83

Sequential implicitly ordered time in ms: 34

Parallel ordered time in ms: 36

Parallel unordered time in ms: 70

Sequential implicitly ordered time in ms: 32

The unordered parallel was twice slower than both.

Then I upped wordCount to 5_000_000 and these were the results

Parallel ordered time in ms: 93

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 123

Parallel ordered time in ms: 100

Parallel unordered time in ms: 363

Sequential implicitly ordered time in ms: 124

Parallel ordered time in ms: 89

Parallel unordered time in ms: 365

Sequential implicitly ordered time in ms: 118

and then to 10_000_000

Parallel ordered time in ms: 148

Parallel unordered time in ms: 725

Sequential implicitly ordered time in ms: 218

Parallel ordered time in ms: 150

Parallel unordered time in ms: 749

Sequential implicitly ordered time in ms: 224

Parallel ordered time in ms: 143

Parallel unordered time in ms: 743

Sequential implicitly ordered time in ms: 222

edited 45 mins ago

answered 2 hours ago

Ryotsu

390111

edited 45 mins ago

answered 2 hours ago

Ryotsu

390111

answered 2 hours ago

Ryotsu

390111

answered 2 hours ago

Ryotsu

390111

add a comment |

up vote
1
down vote

From the javadocs, parallelStream()

Returns a possibly parallel Stream with this collection as its source.
It is allowable for this method to return a sequential stream.

Performance:

Let us consider we have a multiple stream(luckily) that is given to different cores of CPU. ArrayList<T> which has internal data representation based upon an array. Or a LinkedList<T> which needs more computation for splitting to be processed in parallel. ArrayList<T> is better in this case!

stream.unordered().parallel().distinct() has better performance than stream.parallel().distinct()

Preserving stability for distinct() in parallel pipelines is
relatively expensive (requires that the operation act as a full
barrier, with substantial buffering overhead).

So, in your case it should not be a problem(Unless your List<T> does not care of order). Read below for explanation,

Lets say you have 4 elements in ArrayList,
{"a","b","a","b"}

Now if you don't use parallelStream() before calling distinct(), only the String at positions 0 and 1 is retained.(Preserves the order,Sequential stream)

Else, (if you use parallelStream().distinct()) then elements at 1 and 2 can be retained as distinct(It is unstable, but the result is same {"a,"b"} or it can even be {"b","a"}).

An unstable distinct operation will randomly eliminate the duplicates.

Finally,

under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data

edited 1 hour ago

answered 1 hour ago

Mohamed Anees A

593413

add a comment |

up vote
1
down vote

From the javadocs, parallelStream()

Returns a possibly parallel Stream with this collection as its source.
It is allowable for this method to return a sequential stream.

Performance:

Let us consider we have a multiple stream(luckily) that is given to different cores of CPU. ArrayList<T> which has internal data representation based upon an array. Or a LinkedList<T> which needs more computation for splitting to be processed in parallel. ArrayList<T> is better in this case!

stream.unordered().parallel().distinct() has better performance than stream.parallel().distinct()

Preserving stability for distinct() in parallel pipelines is
relatively expensive (requires that the operation act as a full
barrier, with substantial buffering overhead).

So, in your case it should not be a problem(Unless your List<T> does not care of order). Read below for explanation,

Lets say you have 4 elements in ArrayList,
{"a","b","a","b"}

Now if you don't use parallelStream() before calling distinct(), only the String at positions 0 and 1 is retained.(Preserves the order,Sequential stream)

Else, (if you use parallelStream().distinct()) then elements at 1 and 2 can be retained as distinct(It is unstable, but the result is same {"a,"b"} or it can even be {"b","a"}).

An unstable distinct operation will randomly eliminate the duplicates.

Finally,

under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data

edited 1 hour ago

answered 1 hour ago

Mohamed Anees A

593413

add a comment |

up vote
1
down vote

From the javadocs, parallelStream()

Returns a possibly parallel Stream with this collection as its source.
It is allowable for this method to return a sequential stream.

Performance:

Let us consider we have a multiple stream(luckily) that is given to different cores of CPU. ArrayList<T> which has internal data representation based upon an array. Or a LinkedList<T> which needs more computation for splitting to be processed in parallel. ArrayList<T> is better in this case!

stream.unordered().parallel().distinct() has better performance than stream.parallel().distinct()

Preserving stability for distinct() in parallel pipelines is
relatively expensive (requires that the operation act as a full
barrier, with substantial buffering overhead).

So, in your case it should not be a problem(Unless your List<T> does not care of order). Read below for explanation,

Lets say you have 4 elements in ArrayList,
{"a","b","a","b"}

Now if you don't use parallelStream() before calling distinct(), only the String at positions 0 and 1 is retained.(Preserves the order,Sequential stream)

Else, (if you use parallelStream().distinct()) then elements at 1 and 2 can be retained as distinct(It is unstable, but the result is same {"a,"b"} or it can even be {"b","a"}).

An unstable distinct operation will randomly eliminate the duplicates.

Finally,

under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data

edited 1 hour ago

answered 1 hour ago

Mohamed Anees A

593413

From the javadocs, parallelStream()

Returns a possibly parallel Stream with this collection as its source.
It is allowable for this method to return a sequential stream.

Performance:

Let us consider we have a multiple stream(luckily) that is given to different cores of CPU. ArrayList<T> which has internal data representation based upon an array. Or a LinkedList<T> which needs more computation for splitting to be processed in parallel. ArrayList<T> is better in this case!

stream.unordered().parallel().distinct() has better performance than stream.parallel().distinct()

Preserving stability for distinct() in parallel pipelines is
relatively expensive (requires that the operation act as a full
barrier, with substantial buffering overhead).

So, in your case it should not be a problem(Unless your List<T> does not care of order). Read below for explanation,

Lets say you have 4 elements in ArrayList,
{"a","b","a","b"}

Now if you don't use parallelStream() before calling distinct(), only the String at positions 0 and 1 is retained.(Preserves the order,Sequential stream)

Else, (if you use parallelStream().distinct()) then elements at 1 and 2 can be retained as distinct(It is unstable, but the result is same {"a,"b"} or it can even be {"b","a"}).

An unstable distinct operation will randomly eliminate the duplicates.

Finally,

under parallel computation, some pipelines containing stateful
intermediate operations may require multiple passes on the data or may
need to buffer significant data

edited 1 hour ago

answered 1 hour ago

Mohamed Anees A

593413

edited 1 hour ago

answered 1 hour ago

Mohamed Anees A

593413

answered 1 hour ago

Mohamed Anees A

593413

answered 1 hour ago

Mohamed Anees A

593413

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

Some of your past answers have not been well-received, and you're in danger of being blocked from answering.

Please pay close attention to the following guidance:

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Tukukkk