GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:42:54 Sequence gi568815594f:80931285_81153533 : 222249 bp : 36.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 9312 9191 122 2 2 54 86 126 0.423 8.22 1.02 Intr - 13984 13884 101 1 2 84 70 75 0.788 3.29 1.01 Init - 15046 14894 153 1 0 58 89 43 0.416 1.43 1.00 Prom - 57822 57783 40 -3.35 2.03 PlyA - 57992 57987 6 1.05 2.02 Term - 65094 64616 479 1 2 34 38 235 0.245 7.12 2.01 Init - 80189 80081 109 2 1 86 43 121 0.042 7.83 2.00 Prom - 80740 80701 40 -6.05 3.03 PlyA - 82459 82454 6 1.05 3.02 Term - 85450 85143 308 2 2 32 54 163 0.526 1.59 3.01 Init - 86052 85917 136 0 1 53 78 133 0.589 9.15 3.00 Prom - 92642 92603 40 -7.25 4.00 Prom + 97827 97866 40 -6.15 4.01 Init + 100001 100316 316 1 1 90 109 465 0.950 44.34 4.02 Intr + 100384 100500 117 2 0 68 69 56 0.512 1.22 4.03 Term + 114691 115409 719 2 2 25 39 720 0.000 53.46 4.04 PlyA + 115815 115820 6 1.05 5.00 Prom + 136840 136879 40 -1.95 5.01 Init + 143841 143949 109 2 1 44 115 88 0.794 7.53 5.02 Intr + 149398 149577 180 2 0 16 57 122 0.072 0.82 5.03 Term + 168410 169344 935 0 2 67 48 271 0.859 12.29 5.04 PlyA + 170430 170435 6 1.05 6.04 PlyA - 170914 170909 6 1.05 6.03 Term - 179327 179106 222 2 0 79 36 119 0.382 1.73 6.02 Intr - 181791 181677 115 2 1 68 70 33 0.235 -0.97 6.01 Init - 192800 192745 56 2 2 60 52 74 0.266 1.81 6.00 Prom - 195077 195038 40 -3.05 7.10 PlyA - 195830 195825 6 1.05 7.09 Term - 198464 197682 783 2 0 48 42 242 0.172 7.95 7.08 Intr - 204012 203871 142 2 1 1 91 100 0.445 1.03 7.07 Intr - 206198 206109 90 1 0 108 95 12 0.522 2.09 7.06 Intr - 209385 209249 137 0 2 81 100 91 0.933 8.05 7.05 Intr - 211663 211510 154 0 1 53 61 84 0.974 1.35 7.04 Intr - 213046 212948 99 0 0 48 100 104 0.977 5.91 7.03 Intr - 213635 213483 153 1 0 98 23 80 0.505 0.77 7.02 Intr - 217668 217600 69 2 0 99 70 67 0.808 3.38 7.01 Intr - 220770 220676 95 0 2 53 68 88 0.630 1.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 81262 81585 324 0 0 17 49 235 0.852 6.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:80931285_81153533|GENSCAN_predicted_peptide_1|126_aa MRITLAQMTVELSFKPWTVTIYTYMWKRKQLLSCLNYYLGISDICSCISFQQQVAGHLQD RGKHMRGLLAPWCNSVLGMEKCFHFNSDFNNDLGTVCICGKLEVQWRDSRTPFVEKKFED KHTEVX >gi568815594f:80931285_81153533|GENSCAN_predicted_CDS_1|378_bp atgaggatcaccctggcccagatgactgtggagctgtcatttaaaccttggactgtcacc atctacacttacatgtggaagagaaagcagcttctatcctgtttaaattactatcttggc atttctgatatttgtagctgtatctcatttcagcaacaagtggctggacatctgcaggat cgggggaagcacatgcgtggcctactggctccctggtgcaattctgtgctaggcatggag aagtgtttccacttcaacagcgattttaacaatgatctaggaactgtatgcatctgtggg aaactggaagtccagtggagagattctagaaccccgttcgtggaaaaaaagtttgaggat aaacacactgaagtggnn >gi568815594f:80931285_81153533|GENSCAN_predicted_peptide_2|195_aa MVEGKEGQVMSYMDGIRQRENEEDAKAETPDKTITSPQNLLKLISNFSKVSGYKINVQKS QAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEINEDTNK WKNIPCSWVARINIMKMAIMPKVIYRFNAIPIKQPMTFFTELEKNYFKLHMEPKKSLHRQ VNPKPKEQSWRHHAT >gi568815594f:80931285_81153533|GENSCAN_predicted_CDS_2|588_bp atggtggaaggcaaggagggacaagtcatgtcttacatggatggcatcaggcaaagagag aatgaggaagatgcaaaagcagaaacccctgataaaaccatcacatctccccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacttaggaatccaacttacaagggacgtgaag gacctcttcaaggagaactacaaaccactgctcaatgaaataaatgaggatacaaacaaa tggaagaacattccatgctcatgggtagcaagaatcaatatcatgaaaatggccataatg cccaaggtaatttatagattcaatgccatccccatcaagcaaccaatgactttcttcaca gaattggaaaaaaactactttaaacttcatatggaaccaaaaaagagcctgcatcgccaa gtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctga >gi568815594f:80931285_81153533|GENSCAN_predicted_peptide_3|147_aa MWESFEFPRDLLNGFDQNAESDMDDKVQAEVVSDRDEELVGNWSRAAPFVTKVGQGTAQA VASEGASPTPWQLPCGVEPVGAQKSRIEVWDPLPRFQKMYGNAWMSRQKFAAGAGLSWRT YARALHKRNVGSETPHKVPTGHCLVEL >gi568815594f:80931285_81153533|GENSCAN_predicted_CDS_3|444_bp atgtgggaaagttttgaatttcctagagacttgttgaatggctttgaccaaaatgctgaa agtgatatggacgacaaagtccaggctgaggtggtctcagatagagatgaagaacttgtt gggaactggagtagagctgctccctttgtgactaaagtgggccaaggtacagctcaggct gtggcttcagagggtgcaagccccacaccttggcagcttccatgtggtgttgagcctgtg ggggcacagaagtcaagaattgaggtttgggaccctctgcctagatttcagaagatgtat ggaaatgcttggatgtccaggcagaagtttgctgcaggggcagggctttcatggagaacc tatgctagggcattgcacaagagaaatgtgggatcagagaccccacacaaagttcctact gggcactgcctagtggagctgtga >gi568815594f:80931285_81153533|GENSCAN_predicted_peptide_4|383_aa MAGASRLLFLWLGCFCVSLAQGERPKPPFPELRKAVPGDRTAGGGPDSELQPQDKVSEHM LRLYDRYSTVQAARTPGSLEGGSQPWRPRLLREGNTVRSFRAAAAASLSAPCLVALPRDL SPPSSAALWIPPSSHTPRVASCISLDMAKSHRDIMSWLSKDITQLLRKAKENEEFLIGFN ITSKGRQLPKRRLPFPEPYILVYANDAAISEPESVVSSLQGHRNFPTGTVPKWDSHIRAA LSIERRKKRSTGVLLPLQNNELPGAEYQYKKDEVWEERKPYKTLQAQAPEKSKNKKKQRK GPHRKSQTLQFDEQTLKKARRKQWIEPRNCARRYLKVDFADIGWSEWIISPKSFDAYYCS GACQFPMPKVAIVLCPVLTSYFH >gi568815594f:80931285_81153533|GENSCAN_predicted_CDS_4|1152_bp atggctggggcgagcaggctgctctttctgtggctgggctgcttctgcgtgagcctggcg cagggagagagaccgaagccacctttcccggagctccgcaaagctgtgccaggtgaccgc acggcaggtggtggcccggactccgagctgcagccgcaagacaaggtctctgaacacatg ctgcggctctatgacaggtacagcacggtccaggcggcccggacaccgggctccctggag ggaggctcgcagccctggcgccctcggctcctgcgcgaaggcaacacggttcgcagcttt cgggcggcagcagcagcttccttgtctgccccttgcttggtggcgctgcctagggacctt tctccgccctcctcagctgccctctggattcctccgtccagtcacaccccgcgtgtcgcc agctgcatctccttggatatggccaaatctcatcgagatattatgtcctggctgtctaaa gatatcactcaactcttgaggaaggccaaagaaaatgaagagttcctcataggatttaac attacgtccaagggacgccagctgccaaagaggaggttaccttttccagagccttatatc ttggtatatgccaatgatgccgccatttctgagccagaaagtgtggtatcaagcttacag ggacaccggaattttcccactggaactgttcccaaatgggatagccacatcagagctgcc ctttccattgagcggaggaagaagcgctctactggggtcttgctgcctctgcagaacaac gagcttcctggggcagaataccagtataaaaaggatgaggtgtgggaggagagaaagcct tacaagacccttcaggctcaggcccctgaaaagagtaagaataaaaagaaacagagaaag gggcctcatcggaagagccagacgctccaatttgatgagcagaccctgaaaaaggcaagg agaaagcagtggattgaacctcggaattgcgccaggagatacctcaaggtagactttgca gatattggctggagtgaatggattatctcccccaagtcctttgatgcctattattgctct ggagcatgccagttccccatgccaaaggtagccattgttctctgtcctgtacttacttcc tatttccattag >gi568815594f:80931285_81153533|GENSCAN_predicted_peptide_5|407_aa MDEENVVCTYNGKLFSLIKEEKFATCDNTDESGRQYGRGWNSLEGSEEDRKMWESMELPR DMLNGFAQNANSNMDNKVLAEVVSDGHEELVGNWSKVLEVLARAIRQEKEIKGIQLGKEE VKLSLFADDMIVYLGNPTVSAQNLLKLIGNFSKVSGYKINVQKSQAFLNTNNRQKESQIM SELPFTIASKRIKYPGIQLTRHVKNLFKENYKPLLDEIKEDTNKWKNIPCSLVGRINIVK MAILPKVIYRFNAISIKLPMTFFTKLEKTTLKFIWNQKRARIAKSILSQKNKGGGITLPD FKLYYKATVTKTACYWYQNRDADQWNRTEPSEIMPHIYNHLIFDKPDKNKKWGNDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPRTIKTLEEKF >gi568815594f:80931285_81153533|GENSCAN_predicted_CDS_5|1224_bp atggatgaagaaaatgtggtatgtacatataatggaaaattattcagccttataaaagaa gaaaagtttgctacatgtgacaacactgatgaatctggaagacagtatggcagaggttgg aacagtttggagggctcagaagaagacaggaaaatgtgggaaagtatggaacttcctaga gacatgttgaatggctttgcccaaaatgctaacagcaatatggacaataaagtcctggct gaggtggtttcagatggacatgaggaacttgttgggaactggagcaaagtgttggaagtt ctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtgtatctaggaaaccccactgtctca gcccaaaatctccttaagctcataggcaacttcagcaaagtctcaggatacaaaatcaat gtgcaaaaatcacaagcattcttaaacaccaataacagacaaaaagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacccaggaatccaacttaca aggcatgtgaagaacctcttcaaggagaactacaaaccactgctcgatgaaataaaagag gatacaaacaaatggaagaacattccatgctcattggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttatagattcaatgccatctccatcaagctaccaatg actttcttcacaaaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cggattgccaagtcaatcctaagccaaaagaacaaaggtggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaacaaaaacagcatgttactggtaccaaaacaga gatgcagaccaatggaacagaacagagccctcagaaataatgccgcacatctacaaccat ctgatctttgacaaacctgacaaaaacaagaaatggggaaacgattctctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctagaaccata aaaaccctagaagaaaaattctag >gi568815594f:80931285_81153533|GENSCAN_predicted_peptide_6|130_aa MIPDEGSDMPQGVVTTKKWPFHWLFLLVARLPLGCLQSSLTYLTEISTQICSNATASVDF GFAKKIGSGQKTWTFCGTPEYVAPEVILNKGHDFSVDFWSLGILVYELLTGKYVPSSFMQ PPLQRNAKIT >gi568815594f:80931285_81153533|GENSCAN_predicted_CDS_6|393_bp atgatccctgatgaaggatctgatatgccacaaggagtggtgaccacaaagaaatggcct tttcactggctcttccttttggtggcacgactgccgctcggatgcttacagagctctctt acttacctcactgagatttctactcaaatctgctcaaatgctaccgcctcagttgacttt ggatttgcgaagaaaatagggtctggacagaaaacatggacattctgtgggactccagaa tatgtagctcctgaagtcattctcaacaagggacatgacttcagtgtggatttctggtca ctgggaattctagtgtatgagctcctaacgggcaagtatgtaccttcaagttttatgcag ccacctcttcagagaaatgcaaaaataacctaa >gi568815594f:80931285_81153533|GENSCAN_predicted_peptide_7|573_aa VKVTQSTEGHDQPQLIKTLQKGEYFGEKALISDDVRSANIIAEENDVACLVIDREDMDEA GNHHSQQTIARTKNQTLHVLTHRWELNNENTWTQEGEHDTPGPVMGTFNQTVGTFEELQK YLEGYVANLNRDDEKRHAKRSMSNWKLSKALSLEMIQLKEKVARFSSSSPFQNLEIIATL GVGGFGRVELVKVKNENVAFAMKCIRKKHIVDTKQQEHVYSEKRILEELCSPFIVKLYRT FKDNKYVYMLLEACLGGELWSILRDRGSFDEPTSKFCVACVTEAFDYLHRLGIIYRDLKP ENLILDAEGYLKLENRLMCKDTHRLKMQGWRNIYQANGKQKNSGVAILVSDKTNFKPTKI KKDKEGHYIVVKGSMQPEELTILNMYAPNTGEPRFIKQVLTDLQRDLDSYTIIVGDINTP LSTLDRSMDRKLTGILNIPGIFNIQELLSALDQADLVAVYRTLHCKSTEYLFFSAPHHTY SKIDHIIGNKTLLSKCKRTEILSLSDNRAIKLELRIKKLTQNCTTTWQLNNLLLNDYWVN NKIKAEINKLFETNENKDTTYQNFWDTAKEGNL >gi568815594f:80931285_81153533|GENSCAN_predicted_CDS_7|1722_bp gtaaaagtaacacagagcacagaaggccatgatcaaccacagctgataaaaacactgcag aaaggagaatactttggagaaaaagctcttatcagtgatgatgtcaggtcagctaacatt attgctgaagaaaatgatgttgcatgcctggttatagatcgagaggacatggatgaagct ggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacgctgcatgttctc actcataggtgggaactgaacaatgagaacacttggacacaggaaggggaacatgacaca ccggggcctgtcatgggaacattcaaccaaactgtcggtacatttgaagagctgcaaaaa taccttgaaggatatgtggcaaacctgaaccgtgatgatgaaaaaagacatgcgaagcgg tccatgtctaactggaagctgtccaaagcactctctctggaaatgattcagctgaaggag aaggtggccagattttcctcatcatccccattccagaaccttgagattattgcaacactg ggcgttggtgggttcggaagagttgagcttgttaaagtaaaaaatgagaatgttgctttt gctatgaagtgtataaggaagaagcacatagttgacaccaagcagcaggagcatgtctac tcagagaagaggatcctagaggagctgtgctctccattcattgtgaaattatatcgtact ttcaaggacaataagtatgtatacatgcttctggaggcctgcttaggtggggagctctgg agtatattaagggacagaggcagctttgatgaacccacctccaaattctgcgttgcttgt gtgacagaagcatttgattacctgcatcgactaggtattatctacagagacttgaaacca gaaaacttaattctagatgctgagggttaccttaaattggagaaccgtctcatgtgcaaa gacacacataggctcaaaatgcagggatggaggaatatttaccaagcaaatggaaagcaa aagaattcaggggttgcaatcctagtctctgataaaacaaactttaaaccaacaaagata aaaaaagacaaagaagggcattacatagtggtaaagggatcaatgcaaccagaagagcta actatcctgaatatgtatgcacccaatacaggagaacccagattcataaagcaagttctt acagacctacaaagagacttagattcctacacaataatagtgggagacattaacactcca ctgtcaacattagacagatcaatggacagaaaattaacaggaattttaaatattcctgga atatttaatattcaggaattgctgtcagctctggaccaagcagacctagtagctgtctac agaactctccactgcaagtcaacagaatatttattcttctcagcaccacatcacacttat tctaaaatcgaccacataattggaaataaaacactcctcagcaaatgcaaaagaacggaa atcctaagtctctcagacaacagggcaatcaaattagaactcaggattaagaaactcact caaaactgcacaactacatggcaactgaacaacctgctcctgaatgactactgggtaaat aacaaaattaaggcagaaataaataagctgtttgaaaccaatgagaacaaagacacaaca taccagaatttctgggacacagctaaagagggaaatttatag