GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:06:38 Sequence gi568815594r:9975381_10216234 : 240854 bp : 47.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 984 979 6 -0.45 1.03 Term - 1803 1714 90 0 0 50 49 97 0.225 -0.28 1.02 Intr - 5323 5212 112 0 1 70 113 130 0.449 14.08 1.01 Init - 6014 5551 464 2 2 63 -64 1055 0.540 82.97 1.00 Prom - 8446 8407 40 -4.16 2.08 PlyA - 8916 8911 6 1.05 2.07 Term - 10169 10084 86 0 2 60 55 39 0.573 -4.38 2.06 Intr - 10413 10289 125 2 2 115 78 160 0.790 17.93 2.05 Intr - 28227 28068 160 1 1 41 61 112 0.050 2.85 2.04 Intr - 38823 38809 15 1 0 131 100 2 0.156 1.22 2.03 Intr - 41312 41240 73 2 1 75 103 29 0.902 2.08 2.02 Intr - 43693 43595 99 1 0 140 117 163 0.968 24.71 2.01 Init - 50586 50524 63 0 0 85 57 96 0.756 7.35 2.00 Prom - 51286 51247 40 -8.56 3.00 Prom + 52908 52947 40 -2.86 3.01 Init + 59017 59074 58 0 1 78 81 40 0.321 3.78 3.02 Intr + 76398 76537 140 1 2 98 53 4 0.004 -1.92 3.03 Intr + 80479 80544 66 0 0 120 93 -22 0.007 0.60 3.04 Term + 93853 93912 60 0 0 97 49 104 0.631 5.20 3.05 PlyA + 94626 94631 6 1.05 4.22 PlyA - 94819 94814 6 1.05 4.21 Term - 100104 99998 107 1 2 88 55 161 0.998 11.37 4.20 Intr - 102068 101924 145 2 1 67 57 108 0.921 5.46 4.19 Intr - 102546 102373 174 0 0 106 109 432 0.999 47.24 4.18 Intr - 103621 103511 111 1 0 98 103 210 0.998 24.08 4.17 Intr - 106064 105977 88 1 1 81 91 114 0.596 10.97 4.16 Intr - 107783 107642 142 0 1 45 94 287 0.810 24.41 4.15 Intr - 109616 109463 154 2 1 32 15 84 0.109 -4.85 4.14 Intr - 110434 110252 183 1 0 45 119 34 0.655 2.18 4.13 Intr - 112560 112327 234 0 0 81 116 264 0.998 26.39 4.12 Intr - 112993 112913 81 1 0 43 73 103 0.806 4.13 4.11 Intr - 113361 113284 78 0 0 97 110 85 0.964 11.35 4.10 Intr - 116110 115872 239 2 2 54 82 81 0.722 1.43 4.09 Intr - 116759 116657 103 2 1 42 45 153 0.793 6.15 4.08 Intr - 122511 122331 181 2 1 19 88 177 0.885 10.67 4.07 Intr - 123759 123612 148 1 1 122 81 134 0.996 15.39 4.06 Intr - 128606 128516 91 2 1 95 69 131 0.683 11.47 4.05 Intr - 140854 140733 122 2 2 118 86 214 0.930 24.41 4.04 Intr - 154696 154560 137 0 2 64 4 88 0.022 -1.79 4.03 Intr - 156944 156795 150 1 0 106 60 68 0.638 5.08 4.02 Intr - 158428 158298 131 1 2 53 95 45 0.112 1.19 4.01 Init - 177943 177854 90 1 0 62 77 94 0.269 6.19 4.00 Prom - 181421 181382 40 -7.36 5.11 PlyA - 181562 181557 6 1.05 5.10 Term - 191083 190919 165 1 0 68 49 128 0.876 4.82 5.09 Intr - 192134 191731 404 0 2 56 24 233 0.809 7.85 5.08 Intr - 193270 193165 106 1 1 67 29 99 0.951 1.69 5.07 Intr - 195029 194840 190 1 1 -14 38 158 0.616 0.19 5.06 Intr - 197027 196910 118 0 1 77 82 106 0.788 8.42 5.05 Intr - 197532 197399 134 2 2 -26 74 110 0.317 -1.51 5.04 Intr - 198774 198633 142 1 1 2 86 110 0.094 1.61 5.03 Intr - 206608 206477 132 2 0 -26 100 96 0.074 0.02 5.02 Intr - 210320 210201 120 0 0 55 113 52 0.736 4.87 5.01 Init - 212607 212523 85 0 1 73 99 75 0.609 8.08 5.00 Prom - 221332 221293 40 -2.26 6.00 Prom + 223427 223466 40 -6.56 6.01 Init + 225714 225743 30 2 0 88 84 39 0.046 1.76 6.02 Intr + 231986 232043 58 1 1 98 100 -14 0.173 -0.74 6.03 Intr + 234710 234972 263 0 2 79 56 125 0.308 5.61 6.04 Term + 238929 239078 150 2 0 68 40 105 0.239 1.61 6.05 PlyA + 239942 239947 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:9975381_10216234|GENSCAN_predicted_peptide_1|221_aa MCIGYYHCDHHHHHHHHHHHHTITITTIATITIITITTTTTTITIIIIATTIITVITTTI IIITIMTITTTTTITTITTTIITVTIAIITITTTTTITITTITTTIIITIMTITITTTIT TITTTVITITVIITINTITSTNTISCVLSSEMHLHEISPKEIRGSLGQVTAIFICIGVFT GQLLGLPELLGKQWDIIEDMMSASRRLLACSMEKGLDPSQD >gi568815594r:9975381_10216234|GENSCAN_predicted_CDS_1|666_bp atgtgtattggctattatcattgtgaccaccatcaccaccatcaccatcaccaccaccac cacaccatcactatcaccaccatcgccaccatcactatcataaccattactactaccacc accactatcaccatcatcatcattgccacaaccatcatcacagtcatcaccaccaccatc atcatcatcaccatcatgaccatcactaccaccaccaccatcactaccattaccaccacc atcatcaccgtcaccatcgccatcattaccatcactaccactaccaccatcaccattact accatcactactaccatcatcatcaccatcatgaccatcactatcaccaccaccatcact accatcacgaccaccgttatcaccatcaccgtcattatcaccatcaataccatcacctcc accaacaccatcagctgtgtcctgtcctctgaaatgcacctccatgagatctcacccaag gagatccgtggctctctggggcaggtgactgccatctttatctgcattggcgtgttcact gggcagcttctgggcctgcccgagctgctgggaaagcaatgggacatcattgaggacatg atgagtgcttcaagaaggcttttggcctgcagcatggagaaaggattggatcccagccag gactga >gi568815594r:9975381_10216234|GENSCAN_predicted_peptide_2|206_aa MKLSKKDRGEDEESDSAKKKLDWSCSLLVASLAGAFGSSFLYGYNLSVVNAPTPKNNFGV SEKDRRGQSLEVFNGKDHVGLETDLEDKRSVAKSIGANSFRGTFEKSVPAGGLGYAASLL APPESIEASVSHFPTSQKHTLLANNGFAISAALLMACSLQAGAFEMLIVGRFIMGIDGGT KDAGTLHKVGQSCGMNHPIHDALAPN >gi568815594r:9975381_10216234|GENSCAN_predicted_CDS_2|621_bp atgaagctcagtaaaaaggaccgaggagaagatgaagaaagtgattcagcgaaaaagaaa ttggactggtcctgctcgctcctcgtggcctccctcgcgggcgccttcggctcctccttc ctctacggctacaacctgtcggtggtgaatgcccccaccccgaaaaacaactttggtgtc agtgagaaggacaggaggggccagagcttagaggtatttaatgggaaagaccatgtgggc ttagaaacggacctggaagacaaacgttcagttgccaaaagcattggggcaaacagcttt cggggaacttttgagaaaagtgttcctgcagggggcctaggctatgcagcaagtcttctg gcccctccagagtctatagaggcttctgtcagccatttcccaacgtcacagaagcacact ttgctggccaataatgggtttgcaatttctgctgcattgctgatggcctgctcgctccag gcaggagcctttgaaatgctcatcgtgggacgcttcatcatgggcatagatggagggacc aaggatgctggaactctgcataaagtgggacagtcctgtgggatgaaccatcccattcat gatgctctagcccctaattga >gi568815594r:9975381_10216234|GENSCAN_predicted_peptide_3|107_aa MESVSGKAKATAAGFALGDALWVPPQNRSRGCLFSLLYSQPAKMPSHQLSQPPPLSPANR SHSATKLPCKKSFSPPAMILRPPQPCGTNADAILEGELPKWDVEAAA >gi568815594r:9975381_10216234|GENSCAN_predicted_CDS_3|324_bp atggaatcagtttctggcaaagccaaggccacagccgcagggtttgccttgggagatgcg ctatgggttccaccacaaaacagatcccgcggctgcctgttttcattactctactctcag ccagccaagatgccttcccaccagctctcacagcctccacccttgtcccccgcaaaccga tcgcactcagcaacaaagctgccatgtaagaagtccttttcacctcccgccatgattctg agacctccccagccatgtggaactaatgcagatgctatcctggagggcgagctgcccaaa tgggatgtggaggcagcagcatga >gi568815594r:9975381_10216234|GENSCAN_predicted_peptide_4|962_aa MTGSATQNGCILDPAQVSDMRIFVQEEIHMLPCLESPRVHLPQALIATVTVGQKSRFSSN YVPQFQNTFPCEAWMSRDECLLLKAVHTEDIQGFLPIAATLCLWDKLLESVTKGPSWSDS QDGGIYGFTEVKHSASVKSYNTRTGLTLASFLISTLESQWLPSTESLNRKKVFASLPQVE RGVSKIIGGDPKGNNFLYTNGKCVILRNIDNPALADIYTEHAHQVVVAKYAPSGFYIASG DVSGKLRIWDTTQKEHLLKYEYQPFAGKIKDIAWTEDSKRIAVVGEGREKFGAVFLWDSG SSVGEITGHNKVINSVDIKQSRPYRLATGSDDNCAAFFEGPPFKFKFTIGEAQAGLEEPV RSQRQGEAVLAKEAEFGSERTRCAGLVFSCEEFARRLSMAPRGVKRLWRALEPHLESWFS LSAAEWIGPSGETAFPNFTYLEVLENLEHRPFKQRRPALELVKKDHSRFVNCVRFSPDGN RFATASADGQIYIYDGKTGEKVCALGGSKAHDGGIYAISWSPDSTHLLSASGDKTSKIWD VSVNSVVSTFPMGSTVLDQQLGCLWQKDHLLSVSLSGYINYLDRNNPSKPLHVIKWDGAA GISPGCVLEPPGSSKKRWLLTPPEALSLQGSLRAAGACRPPEGTPVGIWAWLGVFLGQHS WAGREKKPHKPGPEANVASEVLLDRPPCGSPRRVLVGGVQATCLHTLETGENDSFAGKGH TNQVSRMTVDESGQLISCSMDDTVRYTSLMLRDYSGQGVVKLDVQPKCVAVGPGGYAVVV CIGQIVLLKDQRKCFSIDNPGYEPEVVAVHPGGDTVAIGGVDGNVRLYSILGTTLKDEGK LLEAKGPVTDVAYSHDGAFLAVCDASKVVTVFSVADGYSENNVFYGHHAKIVCLAWSPDN EHFASGGMDMMVYVWTLSDPETRVKIQDAHRLHHVSSLAWLDEHTLVTTSHDASVKEWTI TY >gi568815594r:9975381_10216234|GENSCAN_predicted_CDS_4|2889_bp atgactggctctgcaactcagaatggctgcatcctggacccagctcaagtgtcagacatg agaatctttgtccaggaagaaattcacatgctgccctgcctagagagcccccgtgtccac ctgcctcaggccctcattgccacagtcacagtaggccagaagtccaggttttccagcaat tatgttcctcaattccagaacacgttcccatgtgaggcctggatgtcgcgggatgagtgc ttactactgaaggctgttcacactgaggacattcaaggatttctcccaatagcagcgacc ctatgtttatgggataaactactggagtctgtcactaaaggtccttcctggtctgactca caggatggggggatctacggttttacagaagtgaagcactcagccagtgtcaagtcttat aatacccggactggtctgacactggccagtttcctcatctctacactggaatcacagtgg ctgcccagcacggagtcactcaaccgaaagaaggtgttcgccagcctcccgcaggtggag aggggcgtctccaagatcatcggcggcgaccctaagggcaacaattttctgtacaccaat ggaaagtgcgtcatcctaaggaacatcgacaacccagcccttgctgacatctacacagag cacgcccatcaggtggtggtggccaagtatgcgcccagcggattctacattgcctccgga gatgtgtctgggaagctgaggatctgggataccacgcagaaggagcacctgttgaagtat gagtaccagcctttcgctgggaagatcaaagacattgcttggactgaagacagtaagagg atcgccgtggtcggggaaggaagggagaagtttggagcagtcttcctctgggatagtggc tcttctgtgggcgagattacaggacacaacaaagtcatcaacagcgtggacatcaagcag agccggccataccggctggccacgggaagcgatgataactgcgcggcattctttgaggga cccccattcaagttcaagttcacaattggcgaagctcaggctggcttggaggagccagtg cgaagtcagcggcaaggcgaggctgtgctggccaaggaagcggagttcggcagcgagaga actcgatgtgctggcctggtgttctcctgtgaagaatttgcccgcaggctctccatggcc cctcggggggtaaagcgtctctggagggccctggagccccatctggaaagttggttttca ctctcagcagctgagtggatcgggccttctggggaaactgctttccccaacttcacatac ctagaagtgctagaaaacttagaacacagaccatttaagcaaagaagaccagccttagaa ctagtgaaaaaggaccacagccgctttgtcaactgtgtgcgattctctcctgatgggaac agatttgccacagccagtgctgacggccagatatacatctatgacgggaagactggggag aaggtgtgcgcgctgggcggaagcaaggcccacgacggtgggatttacgcaattagttgg agtcccgacagcacccatttgctttctgcttctggggacaaaacttccaagatttgggac gtcagcgtgaactccgtggtcagcacatttcccatgggctccacggttctggaccagcag ctgggctgcctatggcagaaggaccacctgctcagtgtctccctgtccgggtacatcaac tatctggacagaaacaaccccagcaagcccctgcacgtcatcaagtgggatggagcagca ggtatcagcccaggctgcgtgttagaaccacctgggagctctaaaaagcgctggctgctg acgcccccagaggcactaagtttacagggaagtctcagggcagcaggagcttgcaggcct ccagaaggaacacccgtgggcatctgggcatggctaggagtgtttctgggccagcactcc tgggcagggagagagaagaaacctcacaaaccagggcctgaagcgaatgtggccagcgaa gtcctcctggaccgacccccttgtggctccccgcgacgggtgctggtcggcggggtgcag gccacgtgtctgcacacgttagagacgggggagaacgactccttcgctgggaaaggccac acgaaccaggtgtccaggatgaccgtggatgagtcggggcagctcatcagctgcagcatg gacgacaccgtgcggtacaccagcctcatgctgcgggactacagcggacaaggagttgtg aaactggacgttcagccaaagtgcgtagccgtcggccccgggggatacgccgtggtcgtg tgcattggacagattgtcctgctgaaggatcagaggaagtgcttcagcatcgacaacccc ggctacgagcccgaagttgtggcagtgcaccccggcggggacacggtggcaattgggggt gtggacggcaacgtccgcctgtattccatcctgggcaccacgctgaaggatgagggcaag ctcctagaggccaagggccccgtgaccgacgtggcctactcccacgacggcgccttcctc gcggtgtgcgacgccagcaaggtggtcacagtgttcagcgttgctgacggctactcggag aacaatgttttttatggacaccatgcaaaaatcgtctgcctggcctggtccccagacaat gaacactttgcctccggtggcatggacatgatggtgtatgtttggaccctgagtgacccg gaaaccagagtcaagatccaagatgcacaccggctgcaccatgtcagcagcctggcctgg ctggacgagcacacgctggtcacgacctcccatgatgcctctgtcaaggagtggacaatc acctactga >gi568815594r:9975381_10216234|GENSCAN_predicted_peptide_5|531_aa MKLKLSDQAGSFDIHHVCHLKPEDSGNTAMTSGTDTGWLLRSILCCNICVVHNTWRFEEN AYRYTYQHENSDATQGPHQKSSGKHSCLEQFIPRRDRINSDYLVMQTELLFEGSPQPSWS WQAVPNGGARIPVERSLEQRLEYVELHWSTPEYSLSNPCGFCCSGYGVAVIAFDNANPEC QVALPLIKGKAHLVEYIKACDSIRASTRQSYAALKLRAFCLGNPRKRCQQERTPACGDSR ITSRYLKPYEPDAEEEIWEGTRGPPIAAMSRLTVRKTPVPASNTHQAQPLTWGHIKKLTK MGEENLKSVAYRCGSRILREVAHHDKAAYAFIALQKGKGGLAGHSVGPVPTIRAEQRPKV RGIHICTFAYPHQKDAPSRAETGIQTSQRLSEVDKGNIPVKFRADSSSSEELPALEEPAS LKRIYTIVVTTFTSDFAGLVGIESQPGPGHQRQCNRHKRPMELGGVGSTGEVSLQTEKLI DKDNTCSGTQTNIQDVPGSQSNSMKGPRLQESKALGGFFQRHCSCFVIGYI >gi568815594r:9975381_10216234|GENSCAN_predicted_CDS_5|1596_bp atgaagctgaaactttctgaccaagctggcagcttcgacatccatcatgtatgtcatttg aaacctgaggattctgggaacacagcaatgacttctgggacagatacaggttggctttta cggtctatcttatgctgtaacatctgcgtggtacacaatacctggaggtttgaagagaat gcctacagatacacctaccagcacgagaacagtgatgcaacacaaggacctcatcagaag agttcaggaaagcattcctgcctggagcagttcataccacgcagggatcgaatcaactct gactacctcgtcatgcaaactgagttgctctttgaagggtctccacaaccgagctggtct tggcaagcagtgcccaatgggggggctcgaattccagtcgaaaggtcactggagcaacgg ctggagtacgtggaactacactggagcacacccgagtactctttaagcaatccctgtgga ttctgctgctcaggatatggtgttgcagttattgcttttgacaatgccaatcctgagtgc caagttgctctgccacttattaaaggaaaagcacatttggtcgagtatattaaagcttgt gatagtatcagagccagtacccgccagagttatgctgcactaaagctgcgagccttctgc ctggggaacccccgcaaaaggtgtcaacaggagcggacccctgcctgcggggacagcaga attacttctaggtacctgaaaccttatgagccagatgctgaggaagagatttgggaagga acccgaggacccccgattgcagccatgtcgagactgacagtaaggaagacaccagtcccg gcaagcaacactcaccaggcacagccactcacctggggccacatcaagaagttgacaaag atgggagaagaaaatctgaagtctgtagcctacagatgcggatcccgaatcttgcgagaa gtagcccaccatgacaaagctgcctatgcctttatcgctttacaaaaaggaaaagggggc cttgctggacattctgtaggcccagtgccaactattagagctgagcaaaggccaaaagtt agaggcatacatatctgcacctttgcctacccacatcaaaaggatgctcccagcagagct gagactggaatccagacatcacaaaggttgtctgaagtggacaagggaaacatccctgtg aaattcagagcagattcatcttcctcagaggaactgcctgccctggaagagcctgcttct ctcaagaggatttatacaatcgtggtgacaactttcacctcagactttgctggccttgtg gggatagaatcacaaccagggccaggacaccagaggcagtgcaatcgccacaagaggcct atggagttgggtggtgtgggatccacgggagaagtatccctgcagacagagaaattgata gacaaagacaacacctgcagcggcactcagactaacattcaagatgtccctggatcccaa agcaactccatgaagggaccaagactacaggaatccaaggctcttggaggattctttcaa cgtcattgctcctgctttgtcattggctatatctga >gi568815594r:9975381_10216234|GENSCAN_predicted_peptide_6|166_aa MVEMALSCLLVFSRVLPTFLLPRSDLSDTRTKVLMADLSELDVQPSLSELKRQSFDKSAL GGDTPSPPGDWVQKSSASSDSGSVYDGPSGHHGVCWASTGVATTKVLELCLASSSAHIPT KGSSVYELQVWTKAQPCMTVDLVLLSIRDGEQLAAMELVPLRGSPE >gi568815594r:9975381_10216234|GENSCAN_predicted_CDS_6|501_bp atggtagagatggccctcagctgccttctggtgttctccagggtacttcctactttcctg ttgcccaggtctgacttaagtgataccaggactaaggttctcatggctgatctgagtgaa ctcgacgtacagcccagcctttctgagctgaagagacagtcatttgataagagtgcactt ggaggtgataccccatcacccccaggagactgggtccagaagtcttcagcttcctctgac agtggaagtgtgtatgatgggccctctgggcaccacggtgtctgctgggcaagcactgga gtggccacaacaaaagtccttgagctgtgcttggcatcttcctctgctcatatacctacc aagggaagctcagtttatgagctgcaggtttggacaaaggcccaaccgtgtatgactgtg gacttggttctgctgtccatacgagatggtgaacagctggcagcaatggagctggttccc ctcaggggttcccctgagtga