HashSet or == sign

user2767168 Published at Dev

user2767168

I have two 50kb string data in vars doc1 and doc2 coming form the database through entity framework. I want to compare these two vars to see if doc1 and doc2 are equal. I can take the hashset of the strings and compare the hashes. Or I can simply use if (doc1 == doc2) Is there a third option that's better?

If there is no third option, does anyone have any suggestion (a logical one is good) in regards to hashset v. == in terms of optimization, performance and what IL does in the background? I would imagine that a hashset would have to scan the string in a linear fashion to the end to create a unique hash string (for two vars). So does ==. Then which one is logically better?

krisku

The == operator compares character by character but stops as soon as it finds a mismatch between the two strings, so in that regard it will be able to perform better as it likely will not have to scan the whole strings and even in the worst case when they are equal you have not performed any more work than hashing them both.

If you are really concerned about performance, you could store precomputed hashes of the long strings in the database and thus would not have to look at their content at all (provided a hash collision is not fatal).

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-02-12

Comments

0 comments

From Dev

Related Related

Article

HashSet or == sign

HashSet or == sign

HashSet or == sign

HashSet as key for other HashSet

HashSet implementations

Is there a HashSet in Delphi?

HashSet of integers

Implementing HashSet

HashSet is empty

HashSet in Hibernate

Is it possible that TreeSet equals HashSet but not HashSet equals TreeSet

Can I pass a HashSet<SomeEnumeration> as HashSet<byte>?

Delete hashset of actions from hashset of actions

Event Sourcing for sign up, sign in, sign out

HashSet and TreeSet performance test

HashSet<T> fundamental things

Locking HashSet for concurrency

HashSet for finding duplicate arrays

Hibernate Compare PersistentSet with HashSet

Define: What is a HashSet?

Duplicate values in a hashSet

Sort the hashset based on date

Hashset vs Treeset

Hashset memory overhead

Collectors.toSet() and HashSet

"double, Double, HashSet" in Java

HashSet for unique characters

Java HashSet contains Object

Use a HashSet as a constant

Remove on custom HashSet

HashSet not removing an item