I would like to rank or sort a collection of items (with size potentially greater than 100,000) where items in the collection have no intrinsic (comparable) value, instead all I have is the comparisons between any two items which have been provided by users in a subjective manner.
Example: Consider a collection with elements [a, b, c, d]
and comparisons by users b > a
, a > d
, d > c
. The correct order of this collection would be [b, a, d, c]
.
This example is simple, however there could be more complicated cases:
- Since the comparisons are subjective, a user could also say that
c > b
. In which case that would cause a conflict with the ordering above.
- Also you may not have comparisons that “connects” all the items, i.e.
b > a
, d > c
. In which case the ordering is ambiguous. It could be [b, a, d, c]
or [d, c, b, a]
. In this case either ordering is acceptable.
If possible it would be nice to somehow take into account multiple instances of the same comparison and give those with higher occurrences more weight. But a solution without this condition would still be acceptable.
A similar application of this algorithm was used by Zuckerberg's FaceMash application where he ranked people based on comparisons (if I understood it correctly), but I have not been able to find what that algorithm actually was.
Is there an algorithm which already exists that can solve the problem above? I would not like to spend effort trying to come up with one if that is the case. If there is no specific algorithm, is there perhaps certain types of algorithms or techniques which you can point me to?
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…