GENSCAN 1.0 Date run: 5-Nov-116 Time: 15:08:49 Sequence gi568815591r:140354520_140579344 : 224825 bp : 47.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 4266 4121 146 0 2 97 67 180 0.729 15.88 1.03 Intr - 15163 15071 93 1 0 93 103 38 0.971 5.96 1.02 Intr - 25871 25763 109 1 1 94 99 54 0.986 7.39 1.01 Init - 28007 27919 89 2 2 87 115 -45 0.644 -1.67 1.00 Prom - 28651 28612 40 -5.06 2.00 Prom + 40886 40925 40 -2.96 2.01 Init + 42283 42308 26 0 2 63 80 13 0.282 -3.79 2.02 Intr + 43786 43996 211 1 1 45 54 137 0.103 4.92 2.03 Intr + 49549 49698 150 0 0 67 115 50 0.105 5.96 2.04 Intr + 53116 53328 213 0 0 48 77 159 0.246 9.71 2.05 Intr + 57355 57538 184 0 1 115 52 306 0.697 29.06 2.06 Term + 71363 71631 269 0 2 86 44 373 0.553 28.36 2.07 PlyA + 76180 76185 6 1.05 3.02 PlyA - 76640 76635 6 -0.45 3.01 Sngl - 81136 80795 342 1 0 62 54 211 0.817 11.30 3.00 Prom - 85395 85356 40 -4.66 4.11 PlyA - 86299 86294 6 1.05 4.10 Term - 92314 92202 113 2 2 51 49 167 0.411 7.82 4.09 Intr - 100210 100084 127 1 1 97 75 183 0.203 18.15 4.08 Intr - 100714 100576 139 0 1 68 60 70 0.989 2.67 4.07 Intr - 101381 101271 111 1 0 54 64 104 0.894 4.09 4.06 Intr - 102347 102133 215 2 2 82 50 315 0.996 24.61 4.05 Intr - 104714 104488 227 0 2 64 106 218 0.805 18.90 4.04 Intr - 105417 105188 230 2 2 80 86 103 0.930 6.81 4.03 Intr - 117492 117364 129 2 0 106 110 61 0.994 9.91 4.02 Intr - 124467 124408 60 2 0 70 62 78 0.614 1.25 4.01 Init - 124825 124641 185 1 2 79 91 162 0.964 12.29 4.00 Prom - 126428 126389 40 -7.66 5.00 Prom + 134078 134117 40 -1.66 5.01 Init + 141219 141338 120 2 0 83 81 105 0.963 8.88 5.02 Term + 152566 152637 72 0 0 116 49 41 0.625 0.91 5.03 PlyA + 156539 156544 6 1.05 6.17 PlyA - 157365 157360 6 1.05 6.16 Term - 162994 162894 101 2 2 45 50 67 0.222 -3.11 6.15 Intr - 165199 165113 87 2 0 90 82 98 0.998 9.34 6.14 Intr - 167581 167336 246 2 0 103 100 427 0.992 42.83 6.13 Intr - 168905 168788 118 2 1 107 127 158 0.998 21.64 6.12 Intr - 171273 171232 42 0 0 110 109 67 0.997 9.44 6.11 Intr - 172976 172799 178 1 1 93 83 316 0.566 31.52 6.10 Intr - 179328 179280 49 1 1 98 92 -16 0.061 -2.46 6.09 Intr - 185772 185579 194 2 2 39 71 145 0.287 7.04 6.08 Intr - 190247 190099 149 2 2 122 80 127 0.416 14.33 6.07 Intr - 192420 192280 141 0 0 93 105 79 0.973 10.65 6.06 Intr - 203693 203624 70 1 1 70 105 25 0.565 1.58 6.05 Intr - 205298 205189 110 2 2 96 96 64 0.562 7.08 6.04 Intr - 212754 212567 188 1 2 102 87 303 0.997 31.01 6.03 Intr - 214294 214244 51 2 0 106 100 65 0.969 8.38 6.02 Intr - 215219 215126 94 2 1 80 70 113 0.999 8.24 6.01 Intr - 219489 219289 201 0 0 118 90 78 0.967 10.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:140354520_140579344|GENSCAN_predicted_peptide_1|146_aa MAWPNVFQRGSLLSQFSHHHVVVFLLTFFSYSLLHASRKTFSNVKVSISEQWTPSAFNTS VELPVEIWSSNHLFPSAEKATLFLGTLDTIFLFSYAVVFVFGALTEWLRFYNKWLYCCLW IVNGLLQSTGWPCVVAVMGNWFGKAG >gi568815591r:140354520_140579344|GENSCAN_predicted_CDS_1|438_bp atggcctggccaaatgtttttcaaagagggtctctgctgtcccagttcagccatcatcat gttgtagtgttcctgctcactttcttcagttattcgttgctccatgcttcacgaaaaaca tttagcaatgtcaaagtcagtatctctgagcagtggaccccaagtgcttttaacacgtca gttgagctgcctgtggagatctggagcagcaaccatttgttccccagtgcagagaaagcg actcttttcctcggcacactggataccattttcctcttctcctatgctgtggtgtttgtc tttggtgcgctcacagaatggctgcgtttctacaacaaatggctgtactgctgcctgtgg attgtgaacggcctgctgcagtccactggttggccctgtgtggttgctgttatgggcaac tggtttgggaaagccggn >gi568815591r:140354520_140579344|GENSCAN_predicted_peptide_2|350_aa MVSFKRQARGAATYISPPPSRPGKPLGLDPPPPQQVFGPTHHRLRTRDPQWSPLSSPGSA RRVSLAPLPASPAASSPLADAKPDTWGQNYTSHQAARQEEEGGTGGLKSKPRRKRKHYLL TSRGERARQVARTMHFSSSARAADENFDYLFKIILIGDSNVGKTCVVQHFKSGVYTETQQ NTIGVDFTVRSLDIDGKKVKMQVWDTAGQERFRTITQSYYRSAHAAIIAYDLTRRSTFES IPHWIHEIEKYGAANVVIMLIGNKCDLWEKRHVLFEDACTLAEKYGLLAVLETSAKESKN IEEVFVLMAKELIARNSLHLYGESALNGLPLDSSPVLMAQGPSEKTHCTC >gi568815591r:140354520_140579344|GENSCAN_predicted_CDS_2|1053_bp atggtgtcattcaagagacaagccaggggggccgccacctacatctccccaccgccctcc cgacccgggaagcccctcggattggacccgcccccgccccagcaggtcttcggccccacg catcaccgcttgcgcacccgagatccgcagtggtcgccgctctccagccccggctccgct cgccgggtcagcttggctccgctgcccgcctcccccgccgccagctcacccctggcggac gccaaacccgacacctggggccagaactacacttcccaccaagcagcgaggcaggaggaa gaagggggcactggggggctgaaaagcaaacccagaagaaagcgaaaacattacctgctg accagcagaggtgagagagcgagacaggtggcaagaaccatgcacttctccagctcagcc agggcagcagatgagaactttgactatttgttcaagattatcctcattggggattccaat gtggggaagacgtgtgtggtgcagcatttcaagtctggagtctacactgagacacagcag aacacgattggagtggactttaccgtgcgttcccttgatattgacggcaaaaaagtgaag atgcaggtgtgggacacagctggccaggagcgcttccgcaccatcacccaaagctactac cgcagtgcccacgcagccatcatcgcctatgacctcacccggcggtccacgttcgagtcc atccctcactggattcatgagatagagaaatatggagctgcaaatgtggtcattatgctg attggaaataaatgtgacctctgggaaaagcggcacgtcctgttcgaggatgcctgcaca ctggctgagaagtacggcctcctggccgttttggagacatctgccaaggagtcaaagaac atagaagaagtcttcgtgctcatggccaaggagctgatcgcgcgcaacagcctgcaccta tatggggagagtgccctgaacggcctccccctggactccagccccgttcttatggcccag ggtccaagtgaaaagacccactgcacttgctaa >gi568815591r:140354520_140579344|GENSCAN_predicted_peptide_3|113_aa MGHSGGSWGTGSSALRGGFTLVGKDAGAQRVLAGHREGPRGPAGEALLPKACEGSSVVPR PGTIQGDFSAHISRNFFQASNSVEGARRWIPLWLPSRDLVSWARRQHSRSHPA >gi568815591r:140354520_140579344|GENSCAN_predicted_CDS_3|342_bp atggggcacagcggcggctcgtgggggacgggatccagcgctttgagaggtggcttcacg ctggtggggaaggacgcaggcgcacagcgcgtccttgccgggcaccgcgagggcccgcga gggcccgcgggggaagccctgctacccaaggcctgcgaagggtccagtgtggtcccgcgc cccgggaccatacagggagatttcagcgcccacatcagcaggaacttcttccaggcaagc aactccgtggaaggggctcggagatggatcccgctgtggctcccgagcagggacctggtg agctgggcacggcgccagcacagccgcagccacccagcctga >gi568815591r:140354520_140579344|GENSCAN_predicted_peptide_4|511_aa MAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVT CSHCVNPRWAAEPSALSAEAWRYFMHGVCKEGDNCRYSHDLSDSPYSVVCKYFQRGYCIY GDRCRYEHSKPLKQEEATATELTTKSSLAASSSLSSIVGPLVEMNTGEAESRNSNFATVG AGSEDWVNAIEFVPGQPYCGRTAPSCTEAPLQGSVTKEESEKEQTAVETKKQLCPYAAVG ECRYGENCVYLHGDSCDMCGLQVLHPMDAAQRSQHIKSCIEAHEKDMELSFAVQRSKDMV CGICMEVVYEKANPSERRFGILSNCNHTYCLKCIRKWRSAKQFESKIIKSCPECRITSNF VIPSEYWVEEKEEKQKLILKYKEAMSNKACRYFDEGRGSCPFGGNCFYKHAYPDGRREEP QRQKVGTSSRYRAQRRNHFWELIEERENSNPFDNDEEEVVTFELGEMLLMLLAAAFISTS FLSMDGQCFIAGVLNPRTVYKYQYEYQSVAC >gi568815591r:140354520_140579344|GENSCAN_predicted_CDS_4|1536_bp atggcggaggctgcaactcccggaacaacagccacaacatcaggagcaggagcggcagcg gcgacggcggcagcagcctcccccaccccgatccccacagtcaccgccccgtccctgggg gcgggcggagggggcggcggcagcgacggcagcggcggcggctggactaaacaggtcacc tgcagtcactgcgtcaacccccgctgggccgcggagccctcagcgctgtccgcggaggcc tggcggtattttatgcatggggtttgtaaggaaggagacaactgtcgctactcgcatgac ctctctgacagtccgtatagtgtagtgtgcaagtattttcagcgagggtactgtatttat ggagaccgctgcagatatgaacatagcaaaccattgaaacaggaagaagcaactgctaca gagctaactacaaagtcatcccttgctgcttcctcaagtctctcatcgatagttggacca cttgttgaaatgaatacaggcgaagctgagtcaagaaattcaaactttgcaactgtagga gcaggttcagaggactgggtgaatgctattgagtttgttcctgggcaaccctactgtggc cgtactgcgccttcctgcactgaagcacccctgcagggctcagtgaccaaggaagaatca gagaaagagcaaaccgccgtggagacaaagaagcagctgtgcccctatgctgcagtggga gagtgccgatacggggagaactgtgtgtatctccacggagattcttgtgacatgtgtggg ctgcaggtcctgcatccaatggatgctgcccagagatcgcagcatatcaaatcgtgcatt gaggcccatgagaaggacatggagctctcatttgccgtgcagcgcagcaaggacatggtg tgtgggatctgcatggaggtggtctatgagaaagccaaccccagtgagcgccgcttcggg atcctctccaactgcaaccacacctactgtctcaagtgcattcgcaagtggaggagtgct aagcaatttgagagcaagatcataaagtcctgcccagaatgccggatcacatctaacttt gtcattccaagtgagtactgggtggaggagaaagaagagaagcagaaactcattctgaaa tacaaggaggcaatgagcaacaaggcgtgcaggtattttgatgaaggacgtgggagctgc ccatttggagggaactgtttttacaagcatgcgtaccctgatggccgtagagaggagcca cagagacagaaagtgggaacatcaagcagataccgggcccaacgaaggaaccacttctgg gaactcattgaggaaagagagaacagcaacccctttgacaacgatgaagaagaggttgtc acctttgagctgggcgagatgttgcttatgcttttggctgcagcatttatcagtacttca tttctttcaatggatggacagtgttttattgcaggggtcctcaaccctcggaccgtgtac aagtaccagtacgagtaccaatccgtggcctgttag >gi568815591r:140354520_140579344|GENSCAN_predicted_peptide_5|63_aa MREERTGQREQQVWPGGWALLGVIQDLLTPLWLDPGSQAKAPPCFAQGSTSPLSPESDNV NIS >gi568815591r:140354520_140579344|GENSCAN_predicted_CDS_5|192_bp atgagggaagagcgcacaggccagagggaacagcaggtatggcccggaggctgggccctg cttggtgtgatccaggacctgctgacgccactgtggctggacccagggagccaggccaag gcacccccttgttttgcacagggctccacctcccccctgtctcctgaatctgacaatgtc aacatcagctga >gi568815591r:140354520_140579344|GENSCAN_predicted_peptide_6|672_aa QSLSKPAFFRQNSERRNFKLLDTRKLSRDGTGSPSKISPPSTPSSPDDIFFNLGDPQNGR KKRKIPKLVLRINAIYEVRRGKKRVKRLSQSMESNSGKVTDENSESDSDTEEKLKAHSQR LVNVKSRLKQAPRYPSLARELIEYQERQLFEYFVVVSLHKKQAGAAYVPELTQQFPLKLE RSFKFMREAEDQLKAIPQFCFPDAKDWVPVQQFTSETFSFVLTGEDGSRRFGYCRRLLIL DEVEKRRGISPALVQPLMRSVMEAPFPALGKTILVKNFLPGSGTEVIELCRPLDSRLEHV DFESLFSSLSVRHLVCVFASLLLERRVIFIADKLRGFLLQQLPNVRRCEDGISVGEEKAA GTWIIVVPSLEREPGHGTVMVCECEKDRHWVFRPFTAFRAPCAPWAPANEVFMITSILSK CCHAMVALIYPFAWQHTYIPVLPPAMVDIVCSPTPFLIGLLSSSLPLLRELPLEEVLVVD LVNSRFLRQMDDEDSILPRKLQVALEHILEQRNELACEQDEGPLDGRHGPESSPLNEVVS EAFVRFFVEIVGHYSLFLTSGEREERTLQREAFRKAVSSKSLRHFLEVFMETQMFRGFIQ ERELRRQDAKGLFEVRAQEYLETLPSGEHSGVNKFLKGLALTSDPHRNTKGRGADLNNGP EGPYCLAGVLNL >gi568815591r:140354520_140579344|GENSCAN_predicted_CDS_6|2019_bp cagtcattgtccaaacctgcttttttccgacaaaattcagagaggaggaacttcaagctg ctggacactaggaagctgagtcgggatggaactgggtccccttccaaaatcagccctccc tccactcccagcagccctgatgacattttctttaaccttggagacccacagaacggcagg aagaagagaaagatacccaagctggtgttgcgaatcaacgccatttatgaggtccggaga ggaaagaaacgggtgaagaggctgtcccagtcaatggagagcaactcaggaaaagtgaca gatgagaacagtgagtctgacagtgacacagaggagaagctgaaagctcacagccagcgc ctggtcaacgtgaagtcccggctgaagcaggcgcctcggtacccatcacttgcccgggaa ctcatcgagtaccaggagaggcagctcttcgagtactttgtggttgtgtctttgcacaag aagcaggccggggctgcctacgtgccagaactcacccaacagttccctctgaagttggaa aggtctttcaagttcatgagagaagctgaggaccaactgaaggccattccccagttctgt tttcccgatgccaaggattgggttcctgtccagcagttcaccagtgaaacattctcattt gtcttaactggagaagatgggagcagaaggttcggttactgccgaagactgctgatcttg gatgaggtggaaaaaagacgaggcatctctcctgccctggttcagccactcatgagaagt gtcatggaagcccctttcccagccctgggcaaaaccatccttgtcaagaacttcctgcca ggttcaggaactgaggtgatcgaactgtgccgcccgctggactcccggctcgagcacgtg gactttgagtctctcttctcctccctcagcgtccgccacctggtctgtgtgtttgcctcc ctgcttctggagaggagggtcatcttcattgcagacaagctcaggggttttcttctgcag cagctgccaaatgtcaggcgctgtgaggatgggatttctgtgggagaagagaaggcagca ggaacgtggataattgtggtgccctccttggagcgagagccaggccatggtacagtaatg gtgtgcgagtgtgagaaagaccgtcattgggtttttagaccctttacagctttcagagca ccctgtgctccatgggctccagctaacgaggtattcatgataaccagcatcctgtccaag tgctgccacgcgatggtggcgctgatctaccccttcgcctggcagcacacctacatcccg gtgctgccacccgccatggtcgacatcgtgtgctcgccgacgcccttcctcatcgggctg ctctccagctcgctgccactgctcagggagctgccgctggaagaggtccttgtggttgac ctcgtcaacagccggttcctcagacagatggacgatgaggactccatcctgccccggaag cttcaggtggccctggaacacattctggaacagaggaacgagctggcttgtgagcaggac gaagggcccctagacggcaggcacggtccagagtccagccccttgaacgaggtggtgtct gaagcctttgtccgcttcttcgtggagattgtgggacactactctttgttcctgacgtcg ggcgagcgtgaggagagaaccctgcagcgggaggccttccgcaaagctgtctcctccaag agcctccgccacttcctggaggtcttcatggagactcagatgtttcggggcttcatccag gagcgggagctgcgccggcaggatgccaaaggtctgtttgaggtccgagcccaagagtat ctggaaacactccccagtggagagcacagcggtgtcaataagttcctgaagggactagct ctcaccagcgaccctcacagaaacacaaaagggaggggcgccgacctcaacaatggccca gaggggccatactgcctggcaggggttctcaacctttag