Information Retrieval research and search engine development discussion.
Pretty--and interesting. Though I'm curious if / how they distinguish duplication from citation.
Hey Daniel.Duplication can be detected by determining a somewhat arbitrary threshold for what you define as the limit of fair use.A few paragraphs at a time is probably fair followed by some sort of equal percentage of original text.This should probably fit into a probability model though since there's a bit of a spectrum from fair use to outright theft.
Pretty--and interesting. Though I'm curious if / how they distinguish duplication from citation.
ReplyDeleteHey Daniel.
ReplyDeleteDuplication can be detected by determining a somewhat arbitrary threshold for what you define as the limit of fair use.
A few paragraphs at a time is probably fair followed by some sort of equal percentage of original text.
This should probably fit into a probability model though since there's a bit of a spectrum from fair use to outright theft.