I have a large number of numbers, each set has 10 numbers and I need to delete all those set whose Near 5 or more numbers (unroded) matches are from any other set. For example: set 1: {12,14,222,998,1,89,43,22,7654,23} set 2: {44,23,64 , 76,987,3,2345,443,431,88} set 3: {998,22,7654,345,112,32, 89,9842,31,23} above the set number of 10 Looking at the 3 set and Set 3 will be considered a duplicate because they have 5 mail numbers, in this case I will delete set 3 (because it is considered equal to 1 set). I have more than 10000 sets to compare and I want to do it very efficiently. I'm ending it and I can not think of it in an efficient manner (it would be good to do this at a single pass). Any ideas? Thanks! Mike You should reconsider your requirements because it is in operation Also, do not have a well-defined result, for example, set this: set 1: {1, 2, 3, 4, 5, 6, 7, 8, 9, 10 } Set 2: {6, 7, 8, 9, 10, 11, 12, 13, 14, 15} Set 3: {11, 12, 13, 14, 15, 16, 1...