GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:50:20 Sequence gi568815597r:193080297_193281562 : 201266 bp : 36.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1068 1184 117 2 0 117 81 106 0.992 12.54 1.02 Intr + 1890 2003 114 2 0 71 110 84 0.996 8.62 1.03 Intr + 2266 2412 147 0 0 49 93 119 0.945 8.01 1.04 Term + 4283 4435 153 1 0 96 41 161 0.999 9.14 1.05 PlyA + 5373 5378 6 1.05 2.06 PlyA - 5634 5629 6 1.05 2.05 Term - 14179 14118 62 2 2 53 48 59 0.196 -4.61 2.04 Intr - 16463 16352 112 2 1 111 69 9 0.443 0.33 2.03 Intr - 17464 17288 177 1 0 74 74 92 0.726 5.59 2.02 Intr - 20908 20845 64 0 1 82 89 93 0.539 6.60 2.01 Init - 25086 24968 119 0 2 71 55 138 0.549 6.53 2.00 Prom - 27747 27708 40 -7.65 3.00 Prom + 28850 28889 40 -6.45 3.01 Init + 33002 33241 240 1 0 82 11 132 0.273 2.82 3.02 Intr + 41710 42035 326 0 2 39 98 356 0.622 25.35 3.03 Intr + 44816 44921 106 2 1 68 87 23 0.613 -0.50 3.04 Intr + 49878 49947 70 2 1 58 67 95 0.882 2.24 3.05 Intr + 55095 55157 63 1 0 75 95 46 0.730 1.77 3.06 Intr + 55241 55293 53 0 2 82 88 116 0.971 8.71 3.07 Intr + 57789 57877 89 2 2 50 82 104 0.861 3.85 3.08 Intr + 61554 61770 217 0 1 4 86 293 0.840 18.28 3.09 Intr + 67571 67669 99 1 0 50 83 82 0.868 3.29 3.10 Term + 83944 84006 63 0 0 96 43 111 0.916 4.31 3.11 PlyA + 84288 84293 6 1.05 4.02 PlyA - 85960 85955 6 1.05 4.01 Sngl - 101227 99998 1230 1 0 94 43 416 0.550 33.25 4.00 Prom - 122485 122446 40 -3.35 5.00 Prom + 130282 130321 40 -3.65 5.01 Init + 136124 136234 111 1 0 58 41 130 0.151 5.56 5.02 Term + 136851 136922 72 2 0 48 50 95 0.108 -1.37 5.03 PlyA + 137640 137645 6 1.05 6.00 Prom + 138496 138535 40 -1.95 6.01 Init + 139297 139438 142 0 1 95 6 110 0.029 3.84 6.02 Intr + 150406 150475 70 2 1 52 98 7 0.012 -4.58 6.03 Intr + 152697 152858 162 0 0 84 99 116 0.917 10.47 6.04 Intr + 155960 156060 101 2 2 106 69 39 0.860 2.63 6.05 Intr + 161280 161652 373 1 1 19 74 199 0.346 4.80 6.06 Intr + 167110 167246 137 1 2 136 55 -7 0.443 0.09 6.07 Intr + 169434 169575 142 1 1 89 91 128 0.619 11.69 6.08 Term + 182830 182872 43 1 1 70 47 48 0.008 -5.45 6.09 PlyA + 183187 183192 6 1.05 7.03 PlyA - 183421 183416 6 1.05 7.02 Term - 192730 192131 600 1 0 13 36 281 0.806 8.94 7.01 Init - 194724 194590 135 0 0 62 91 48 0.677 2.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 139297 139644 348 0 0 95 44 110 0.888 3.09 S.002 Sngl - 145829 145620 210 2 0 44 53 173 0.849 4.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:193080297_193281562|GENSCAN_predicted_peptide_1|176_aa TVEPTGKRFLLAVDVSASMNQRVLGSILNASTVAAAMCMVVTRTEKDSYVVAFSDEMVPC PVTTDMTLQQVLMAMSQIPAGGTDCSLPMIWAQKTNTPADVFIVFTDNETFAGGVHPAIA LREYRKKMDIPAKLIVCGMTSNGFTIADPDDRGMLDMCGFDTGALDVIRNFTLDMI >gi568815597r:193080297_193281562|GENSCAN_predicted_CDS_1|531_bp acagttgaaccaactggaaaacgtttcttactagctgttgatgtcagtgcttctatgaac caaagagttttgggtagtatactcaacgctagtacagttgctgcagcaatgtgcatggtt gtcacacgaacagaaaaagattcttatgtagttgctttttccgatgaaatggtaccatgt ccagtgactacagatatgaccttacaacaggttttaatggctatgagtcagatcccagca ggtggaactgattgctctcttccaatgatctgggctcagaagacaaacacacctgctgat gtcttcattgtattcactgataatgagacctttgctggaggtgtccatcctgctattgct ctgagggagtatcgaaagaaaatggatattccagctaaattgattgtttgtggaatgaca tcaaatggtttcaccattgcagacccagatgatagaggcatgttggatatgtgcggcttt gatactggagctctggatgtaattcgaaatttcacattagatatgatttaa >gi568815597r:193080297_193281562|GENSCAN_predicted_peptide_2|177_aa MIWRRAALAGTRLVWSRSGSAGWLDRAAGAAGAAAAAASGMESNTSSSLENLATAPVNQI QETISDNCVVIFSKTSCSYCTMAKKLFHDMNVNYKVVELDLLEYGNQFQDALYKMTGERT VPRIFVNGTFIGGATDTHRLHKEGKLLPLVHQCYLKKSIKGRDKLLQGIETLTSTRL >gi568815597r:193080297_193281562|GENSCAN_predicted_CDS_2|534_bp atgatttggcgccgcgcggcgctggcggggacgcggctggtttggagcaggagcggctcg gcaggctggcttgacagggcggcgggagctgcgggagctgcggcagctgcggcctctggg atggagagcaatacatcatcatctttggagaatttagcgacggcgcctgtgaaccagatc caagaaacaatttctgataattgtgtggtgattttctcaaaaacatcctgttcttactgt acaatggcaaaaaagcttttccatgacatgaatgttaactataaagtggtggaactggac ctgcttgaatatggaaaccagttccaagatgctctttacaaaatgactggtgaaagaact gttccaagaatatttgtcaatggtacttttattggaggtgcaactgacactcataggctt cacaaagaaggaaaattgctcccactagttcatcagtgttatttaaaaaaaagtattaag ggtagagacaagttacttcaaggtatagaaacactgaccagtactcgactgtga >gi568815597r:193080297_193281562|GENSCAN_predicted_peptide_3|441_aa MGPGSPLCKHRELPETPPLWAGWLEFLQGLLPMWLSQCCGYVHVFSYLCDELQQLLRISR KIALQFGILSKNMQECSGPVRGVLGGLGGYCPCCCRRRGRLLVLLLLVRRGGEGGGGRGR GDKRRRRQARRQRRRPEPAEARGGKMADVLSVLRQYNIQKKEIVVKGDEVIFGEFSWPKN VKTNYVVWGTGKEGQPREYYTLDSILFLLNNVHLSHPVYVRRAATENIPVVRRPDRKDLL GYLNGEASTSASIDRSAPLEIGLQRSTQVKRAADEVLAEAKKPRIEDEECVRLDKERLAA RLEGHKEGIVQTEQIRSLSEAMSVEKIAAIKAKIMAKKRSTIKTDLDDDITALKQRSFVD AEVDVTRDIVSRERVWRTRTTILQSTGKNFSKNIFAILQSVKAREEGRAPEQRPAPNAAP VIKGEITAGDEQQCGGFVAYG >gi568815597r:193080297_193281562|GENSCAN_predicted_CDS_3|1326_bp atggggccaggctccccgctctgcaaacatcgtgaacttcctgagactccacccctgtgg gcagggtggttggagtttcttcagggactccttcccatgtggctgtctcaatgctgtggc tacgttcatgtatttagttacctatgtgatgagctacagcaactactcagaatcagcaga aagatagctctgcaatttggcatcctcagcaagaatatgcaggaatgctcagggcctgtg cgcggggtcctcggcggcctgggtggctactgcccctgctgctgtcgtaggcgaggacgg ctgttagtgctgctgctgttggttcgtcgcggcggcgaaggaggaggaggaagagggcga ggcgacaagagaagaaggaggcaggcgcggcggcagcggcggcgccccgagccggcggag gcgaggggggggaagatggcggacgtgcttagcgtcctgcgacagtacaacatccagaag aaggagattgtggtgaagggagacgaagtgatcttcggggagttctcctggcccaagaat gtgaagaccaactatgttgtttgggggactggaaaggaaggccaacccagagagtactac acattggattccattttatttctacttaataacgtgcacctttctcatcctgtttatgtc cgacgtgcagctactgaaaatattcctgtggttagaagacctgatcgaaaagatctactt ggatatctcaatggtgaagcgtcaacatcggcaagtatagacagaagcgctcccttagaa ataggtcttcagcgatctactcaagtcaaacgagctgcagatgaagttttagcagaagca aagaaaccacgaattgaggatgaagagtgtgtgcgccttgataaagagagattggctgcc cgtttggagggtcacaaagaagggattgtacagactgaacagattaggtctttgtctgaa gctatgtcagtggaaaaaattgctgcaatcaaagccaaaattatggctaagaaaagatct actatcaagactgatctagatgatgacataactgcccttaaacagaggagttttgtggat gctgaggtagatgtgacccgagatattgtcagcagagagagagtatggaggacacgaaca actatcttacaaagcacaggaaagaatttttccaagaacatttttgcaattcttcaatct gtaaaagccagagaagaagggcgtgcacctgaacagcgacctgccccaaatgcagcacct gtgatcaaaggagaaataactgctggtgatgaacagcagtgtggaggttttgttgcctat ggttga >gi568815597r:193080297_193281562|GENSCAN_predicted_peptide_4|409_aa MTWNAKRSLFRTHLIGVLSLVFLFAMFLFFNHHDWLPGRAGFKENPVTYTFRGFRSTKSE TNHSSLRNIWKETVPQTLRPQTATNSNNTDLSPQGVTGLENTLSANGSIYNEKGTGHPNS YHFKYIINEPEKCQEKSPFLILLIAAEPGQIEARRAIRQTWGNESLAPGIQITRIFLLGL SIKLNGYLQRAILEESRQYHDIIQQEYLDTYYNLTIKTLMGMNWVATYCPHIPYVMKTDS DMFVNTEYLINKLLKPDLPPRHNYFTGYLMRGYAPNRNKDSKWYMPPDLYPSERYPVFCS GTGYVFSGDLAEKIFKVSLGIRRLHLEDVYVGICLAKLRIDPVPPPNEFVFNHWRVSYSS CKYSHLITSHQFQPSELIKYWNHLQQNKHNACANAAKEKAGRYRHRKLH >gi568815597r:193080297_193281562|GENSCAN_predicted_CDS_4|1230_bp atgacctggaatgccaaaaggtctctgttccgcactcatcttattggagtactttctcta gtgtttctttttgctatgtttttgtttttcaatcatcatgactggctgccaggcagagct ggattcaaagaaaaccctgtgacatacactttccgaggatttcggtcaacaaaaagtgag acaaaccacagctcccttcggaacatttggaaagaaacagtccctcaaaccctgaggcct caaacagcaactaactctaataacacagacctgtcaccacaaggagttacaggcctggag aatacacttagtgccaatggaagtatttacaatgaaaaaggtactggacatccaaattct taccatttcaaatatattattaatgagcctgaaaaatgccaagagaaaagtcctttttta atactactaatagctgcagagcctggacaaatagaagctagaagagctattcggcaaact tggggcaatgaaagtctagcacctggtattcaaatcacaagaatatttttgttgggctta agtattaagctaaatggctaccttcaacgtgcaatactggaagaaagcagacaatatcat gatataattcaacaggaatacttagatacgtactataatttgaccattaaaacactaatg ggcatgaactgggttgcaacatactgtccacatattccatatgttatgaaaactgacagt gacatgtttgtcaacactgaatatttaatcaataagttactgaagccagatctgcctccc agacataactatttcactggttacctaatgcgaggatatgcacccaatcgaaacaaagat agcaagtggtacatgccaccagacctctacccaagtgagcgttatcctgtcttctgttct ggaactggttatgttttttctggagatctggcagaaaagatttttaaagtttctttaggt atccgccgtttgcacttggaagatgtatatgtagggatctgtcttgccaagttgagaatt gatcctgtaccccctcccaatgagtttgtgttcaatcactggcgagtctcttattcgagc tgtaaatacagccacctaattacctctcatcagttccagcctagtgaactgataaaatac tggaaccatttacaacaaaataagcacaatgcctgtgccaacgcagcaaaagaaaaggca ggcaggtatcgccaccgtaaactacattag >gi568815597r:193080297_193281562|GENSCAN_predicted_peptide_5|60_aa MSSEFESVEKNPTNRKPPNPKPDRLTAEFYQMYKGELLLLKPLGEHTDGQAVELRLHGIV >gi568815597r:193080297_193281562|GENSCAN_predicted_CDS_5|183_bp atgagttccgaatttgaatcagtagaaaagaatcctaccaaccggaagccccccaacccc aaaccagacagattaacagctgaattctaccagatgtataaaggagagctgctgctgctg aaacctctaggggagcacaccgacgggcaggctgtggagctcagactccacggcattgtc tag >gi568815597r:193080297_193281562|GENSCAN_predicted_peptide_6|389_aa MDAAGSHYPNKLTEKQKSKYHMFYVGARHRVHVGHKEGNKRHQGLLEVVQYNGVAHNHCV LDLMNCSRVNRFVPSDEKKKQGCQRENETLIQRRKDQMQPGGTAISVTVPYRVVDQPLKL MPQDWDRVVAVFVQGPAWQFKGWPWLLPDGSPVDIFAKKVHTVTNCGFDGLDRTVRRPTG GVCGWVTVVVVVAGWLGVSSAPGKSAQVPTVEDVAGPPSGPKMCAQALLGRQKGRTEPSQ VGLSSGLQVAGRGGVFFRSLAVCYQHLGPAAGESSVVFSGNGQPLFFLRHSSIEIKPINN PTVTSKCLNERKSCTSLILNQKLDMIKLIKAFHLKYDEVRLDPNVQKWDVTVLELSYHKR HLDRPVFLRFWETLDSPSSCGNYALSKEF >gi568815597r:193080297_193281562|GENSCAN_predicted_CDS_6|1170_bp atggatgcagctggaagccattatcctaacaaattaacagagaaacagaaatccaaatac cacatgttctacgtgggagctagacatcgagtacatgttggacacaaagagggaaacaaa agacaccagggcctacttgaggtggtacaatataatggtgtagctcacaatcattgtgtc ttagatttgatgaactgcagtagggtgaacagatttgtcccatcagatgaaaagaagaaa caaggttgtcaacgagaaaatgaaactctaatacaaagaagaaaagaccagatgcaacca gggggcactgcaattagtgttacagtaccttatagagtagtagaccagccccttaaactt atgcctcaagactgggaccgcgttgtagccgtttttgtgcagggtcctgcatggcagttc aaaggttggccatggcttttgcctgatggatcaccagttgatatatttgctaaaaaggtc cacactgttaccaattgtggctttgatgggctggacaggacagtgcgcaggcccacaggg ggtgtgtgcgggtgggtaacagttgtggtggtagtggcaggttggttgggcgtgtcttct gcccctgggaaaagtgctcaggtgccaacagtggaggatgttgcagggccaccctcaggc cccaaaatgtgtgctcaggcactgctagggcggcagaaggggaggacagagcccagccaa gttggcctgtcctcaggcctccaggtcgcaggcaggggtggggtgttcttcaggtccttg gcagtgtgctaccagcacttgggtcctgctgctggggagagcagcgttgttttcagtggc aacggccagcctctctttttcctgagacacagcagtattgaaattaagcctattaataac cctacagtgacctccaagtgtttaaatgaaaggaagagttgcacatctctcattttaaac caaaagttagacatgattaagcttattaaagccttccatctgaagtatgatgaagttcgt ctggatccaaatgttcagaaatgggatgtaacagtattagaactcagctatcacaaacgt catttggatagaccagtgttcttacggttttgggaaacattggacagcccaagttcctgt ggcaactatgccctctccaaagagttctga >gi568815597r:193080297_193281562|GENSCAN_predicted_peptide_7|244_aa MQIQEAQRTSGKSITKRSSPRPTVIRLSKVKTKERILRAVRHKHQVENDGTLPNSFYEAS ITLIPKPGKDITKKENYRQISLMNIDAKILNKILANQIQQHIKKIIHHDQVGFIPGIQGW FNIHKSINVVYHINRIKNKNHMIISINAEKVFDKIQHPFMIKNLSKISIQGTYLNVVKAI YDKPTANIILNGEKLKAFPPRTGTIQGCPLPPLLFNTVLEVLARAIRQEKEIKGIQIGKE EVKL >gi568815597r:193080297_193281562|GENSCAN_predicted_CDS_7|735_bp atgcaaatacaagaagcacaaagaacatctgggaaatccatcacaaaaagatcatcacct aggcccactgtcatcagattatccaaagttaagacaaaggaaagaatcttaagagctgtg agacacaagcaccaggtagagaatgatggaaccctccctaattcattctatgaagccagc atcaccctaataccaaaaccaggaaaggatataaccaaaaaagaaaactacagacagata tccttgatgaacatagatgctaaaatccttaacaaaatactagctaaccaaatccaacaa catatcaaaaagataatccaccatgatcaagtgggtttcataccagggattcagggatgg tttaatatacacaagtcaataaatgtggtataccacataaacagaattaaaaacaaaaat cacatgatcatctcaataaatgcagaaaaagtatttgacaaaatccagcatccctttatg attaaaaatctcagcaaaatcagcatacaagggacatacctcaatgtagtaaaagccatc tatgacaaacccacagccaacataatactaaatggggaaaagttgaaagcattccctccg agaaccggaacaatacaaggatgcccactcccaccactcctcttcaacacagtactggaa gttctagccagagcaatcagacaagagaaagaaataaagggtatccaaattggtaaagag gaagtcaaactgtaa