We propose a method for finding seeds for the local alignment of two
nucleotide sequences. Our method uses randomized algorithms to find
approximate seeds. We present a dynamic index to store the
fingerprints of k-grams and a highly scalable and accurate (HSA)
algorithm to incorporate randomization into process of seed
generation. Experimental results show that our method produces better
quality seeds with improved running time and memory usage compared to
traditional non-spaced and spaced seeds. The presented algorithm
scales very well with higher seed lengths while maintaining the
quality and performance.