Skip to content

Latest commit

 

History

History
58 lines (43 loc) · 1.41 KB

README.md

File metadata and controls

58 lines (43 loc) · 1.41 KB

bioinformatics-algorithms

The algorithms implemented in the extent of CENG 4217 Bioinformatics Algorithms course.
Pseudocodes are as follows.

BetterFrequentWords(text, k)
  freqPatterns = an empty array
  freqMap = empty map
  n = len(text)
  
  for every integer i between 0 and n - k
    pattern = text[i, i+k]
    if freqMap[pattern] doesn’t exist
      freqMap[pattern] = 1
    else
      freqMap[pattern] = freqMap[pattern] + 1
      
  maxCount = MaxMap(freqMap)
  
  for all strings pattern in freqMap
    if freqMap[pattern] = maxCount
      freqPatterns = Append(freqPatterns, pattern)
      
  return freqPatterns
GibbsSampler(Dna, k, t, N)
  randomly select k-mers Motifs = (Motif1, ..., Motift ) from Dna
  BestMotifs ← Motifs
  
  for j ← 1 to N 
    i ← randomly generated integer between 1 and t
    Profile ← profile formed from all Motifs other than Motifi
    Motifi ← Profile-randomly generated k-mer in Dnai
    if Score(Motifs) < Score(BestMotifs)
      BestMotifs ← Motifs
      
 return BestMotifs
Lloyd Algorithm

Select k arbitrary data points as Centers and then 
iteratively perform the following steps:

• Centers to Clusters: Assign each data point to 
the cluster corresponding to its nearest center 
(ties are broken arbitrarily).

• Clusters to Centers: After the assignment of 
data points to k clusters, compute new centers as 
clusters’ center of gravity.