GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:16:09 Sequence gi568815595f:113678754_113909424 : 230671 bp : 39.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1142 1137 6 1.05 1.03 Term - 7591 7451 141 1 0 94 41 46 0.590 -2.55 1.02 Intr - 12779 12625 155 0 2 55 92 110 0.732 6.97 1.01 Init - 17680 17617 64 1 1 90 114 202 0.999 22.36 1.00 Prom - 31640 31601 40 -4.65 2.03 PlyA - 32021 32016 6 1.05 2.02 Term - 43184 43007 178 1 1 95 47 243 0.965 17.08 2.01 Init - 46042 45840 203 1 2 73 63 30 0.217 -2.55 2.00 Prom - 57659 57620 40 -2.65 3.00 Prom + 60653 60692 40 -4.95 3.01 Init + 62427 62539 113 2 2 25 83 127 0.669 5.63 3.02 Term + 66784 67276 493 1 1 18 36 312 0.675 12.09 3.03 PlyA + 69832 69837 6 1.05 4.00 Prom + 76548 76587 40 -6.25 4.01 Init + 100001 100082 82 1 1 51 119 52 0.849 5.78 4.02 Intr + 102297 102389 93 1 0 66 23 140 0.478 4.32 4.03 Intr + 105471 105685 215 1 2 40 79 225 0.554 14.31 4.04 Intr + 105943 106080 138 0 0 29 84 97 0.898 3.14 4.05 Intr + 107479 107630 152 0 2 70 94 178 0.999 14.64 4.06 Intr + 109960 110122 163 1 1 82 58 176 0.997 13.06 4.07 Intr + 110979 111087 109 2 1 52 105 70 0.992 4.04 4.08 Intr + 116119 116241 123 2 0 80 111 99 0.992 11.04 4.09 Intr + 116337 116451 115 1 1 107 68 79 0.985 6.39 4.10 Intr + 117123 117186 64 0 1 58 91 62 0.976 1.30 4.11 Intr + 119490 119693 204 2 0 78 113 190 0.991 18.97 4.12 Intr + 124830 124924 95 2 2 101 115 106 0.999 12.44 4.13 Intr + 126601 126772 172 1 1 97 71 145 0.999 12.72 4.14 Term + 130582 130674 93 0 0 90 43 138 0.998 6.35 4.15 PlyA + 132437 132442 6 1.05 5.06 PlyA - 132451 132446 6 1.05 5.05 Term - 137783 137592 192 2 0 56 48 121 0.222 1.34 5.04 Intr - 147410 147377 34 1 1 75 105 19 0.447 -0.39 5.03 Intr - 149657 149412 246 1 0 47 89 198 0.321 11.45 5.02 Intr - 160086 159968 119 0 2 38 36 107 0.104 -1.16 5.01 Init - 160484 160332 153 2 0 101 17 111 0.278 5.33 5.00 Prom - 163909 163870 40 -4.15 6.02 PlyA - 165249 165244 6 1.05 6.01 Sngl - 171762 171484 279 0 0 33 41 246 0.657 9.76 6.00 Prom - 177132 177093 40 -2.85 7.00 Prom + 190766 190805 40 -4.35 7.01 Init + 206545 206718 174 0 0 43 94 187 0.956 12.78 7.02 Term + 206832 207278 447 2 0 47 48 319 0.929 18.03 7.03 PlyA + 207879 207884 6 1.05 8.03 PlyA - 210096 210091 6 1.05 8.02 Term - 216848 216720 129 2 0 109 47 55 0.299 0.70 8.01 Intr - 224884 224622 263 2 2 94 42 235 0.100 15.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:113678754_113909424|GENSCAN_predicted_peptide_1|119_aa MTQAGVAAVAAATAAETAQPADFRNGTEKISEGKAGYYCQKAKTTDLLQSSSDLAVWKSF VELGLSSFRNKSQPCRENRCAWGRESKVIVQLCIGIQCCSVTVESNTGQNSADAHRGSI >gi568815595f:113678754_113909424|GENSCAN_predicted_CDS_1|360_bp atgacccaggcgggggtcgccgccgtcgccgccgccaccgcggccgagaccgcgcagccg gcggatttcaggaatgggacagagaagatctccgaaggaaaggcaggatattattgtcag aaggccaaaacaacagatttattacaaagttcatcggatttggcagtgtggaagtctttt gttgagcttggtttaagcagttttaggaataaaagtcagccatgcagagagaatcgctgt gcttgggggagggagagcaaagtgattgtgcaactttgcattggaattcagtgttgctct gtcacagtggaaagcaacacagggcagaattcagctgatgcccacagagggagcatttag >gi568815595f:113678754_113909424|GENSCAN_predicted_peptide_2|126_aa MRATAPMIRRVPLTTWELWELQDEIWMGTQPNHITPVAFLMDSFHPGVIFKMVDSFVSFP ATLVLKMGHVQISNESAIDFYRKFGFEIIETKKNYYKRIEPADAHVLQKNLKVPSGQNAD VQKTDN >gi568815595f:113678754_113909424|GENSCAN_predicted_CDS_2|381_bp atgagggcaaccgcccccatgattcgccgggtccctctcacaacatgggaattatgggaa ctacaagatgaaatttggatggggacacagccaaaccatatcactcctgttgctttctta atggatagttttcatcctggggtgattttcaagatggtggattcctttgtttcgtttcca gcaactctggttctgaaaatggggcatgtccagatcagcaatgagtcggcaattgacttc tacaggaagtttggctttgagattattgagacaaagaagaactactataagaggatagag cccgcagatgctcatgtgctgcagaaaaacctcaaagttccttctggtcagaatgcagat gtgcaaaagacagacaactga >gi568815595f:113678754_113909424|GENSCAN_predicted_peptide_3|201_aa MLENSETNQLPSIPAPGPQYTDCRHLTREMATVPQTTSSRVESGGGRQPNHSTWSCGPAW LSGPLPGRRPRALHNSLSMLPHSVASPHSTTHTRLRTETLKPPGSCLRWPFSPPTARASP PPPPGAEGTEKQRNLPPPVFALNLSSPLRLSPSTWARTLLPHRPGPARLPFLTLSSSPPA EAVVTTDINAVVVAALRSPHP >gi568815595f:113678754_113909424|GENSCAN_predicted_CDS_3|606_bp atgttggaaaacagtgaaacaaatcagctgcccagcattcctgctccaggtcctcagtac acagactgcagacatcttacccgggaaatggcaactgtgccacagaccacaagttccaga gttgagtctggaggtggaagacaaccaaaccactccacctggagctgcgggccggcgtgg ctcagcggtccgctcccgggaaggaggccccgggcgcttcacaatagcctttccatgttg ccccactcggtcgcttctccccactcaacaacccacactcgtctccgaaccgaaaccctg aagccccccgggtcttgcctccgctggcccttcagcccgccaacagcccgagcctcgcct ccgccccctccgggagccgagggaactgagaagcagagaaatctccctccgcccgtcttc gccctgaatctctcctcgcctttgcggctctccccctctacatgggcccggacccttcta ccccaccggccgggccctgcccggctgcccttcctcaccctttcatcttccccgcctgct gaggccgtcgttaccaccgatatcaacgccgtcgtagtcgccgcccttaggtctccgcac ccttag >gi568815595f:113678754_113909424|GENSCAN_predicted_peptide_4|605_aa MDFSKLPKILDEDKESTFGYVHGVSGPVVTACDMAGAAMYELVRVGHSELVGEIIRLEAG VSVGDPVLRTGKPLSVELGPGIMGAIFDGIQRPLSDISSQTQSIYIPRGVNVSALSRDIK WDFTPCKNLRVGSHITGGDIYGIVSENSLIKHKIMLPPRNRGTVTYIAPPGNYDTSDVVL ELEFEGVKEKFTMVQVWPVRQVRPVTEKLPANHPLLTGQRVLDALFPCVQGGTTAIPGAF GCGKTVISQSLSKYSNSDVIIYVGCGERGNEMSEVLRDFPELTMEVDGKVESIMKRTALV ANTSNMPVAAREASIYTGITLSEYFRDMGYHVSMMADSTSRWAEALREISGRLAEMPADS GYPAYLGARLASFYERAGRVKCLGNPEREGSVSIVGAVSPPGGDFSDPVTSATLGIVQVF WGLDKKLAQRKHFPSVNWLISYSKYMRALDEYYDKHFTEFVPLRTKAKEILQEEEDLAEI VQLVGKASLAETDKITLEVAKLIKDDFLQQNGYTPYDRFCPFYKTVGMLSNMIAFYDMAR RAVETTAQSDNKITWSIIREHMGDILYKLSSMKFKDPLKDGEAKIKSDYAQLLEDMQNAF RSLED >gi568815595f:113678754_113909424|GENSCAN_predicted_CDS_4|1818_bp atggatttttccaagctacccaaaatactcgatgaagataaagaaagcacatttggttat gtgcatggggtctcaggacctgtggttacagcctgtgacatggcgggtgcagccatgtat gagctggtgagagtgggccacagcgaattggttggagagattattcgattggaggctggt gtgtctgttggagatcctgtacttcgcactggtaaacccctctctgtagagcttggtcct ggcattatgggagccatttttgatggtattcaaagacctttgtcggatatcagcagtcag acccaaagcatctacatccccagaggagtaaacgtgtctgctcttagcagagatatcaaa tgggactttacaccttgcaaaaacctacgggttggtagtcatatcactggcggagacatt tatggaattgtcagtgagaactcgcttatcaaacacaaaatcatgttacccccacgaaac agaggaactgtaacttacattgctccacctgggaattatgatacctctgatgttgtcttg gagcttgaatttgaaggtgtaaaggagaagttcaccatggtgcaagtatggcctgtacgt caagttcgacctgtcactgagaagctgccagccaatcatcctctgttgactggccagaga gtccttgatgccctttttccgtgtgtccagggaggaactactgctatccctggagccttt ggctgtggaaagacagtgatatcacagtctctatccaagtattctaacagtgatgtaatc atctatgtaggatgtggtgaaagaggaaatgagatgtctgaagtcctccgggacttccca gagctcacaatggaggttgatggtaaggtagagtcaattatgaagaggacagctttggta gccaatacctccaatatgcctgttgctgctagagaagcctctatttatactggaatcaca ctgtcagagtacttccgtgacatgggctatcatgtcagtatgatggctgactctacctct agatgggctgaggcccttagagaaatctctggtcgtttagctgaaatgcctgcagatagt ggatatccagcctatcttggtgcccgtctggcctcgttttatgaacgagcaggcagggtg aaatgtcttggaaatcctgaaagagaagggagtgtcagcattgtaggagcagtttctcca cctggtggtgatttttctgatccagttacatctgccactcttggtatcgttcaggtgttc tggggcttagataagaaactagctcaacgtaagcatttcccctctgtcaattggctcatc agctacagcaagtatatgcgtgccttggatgaatactatgacaaacacttcacagagttc gttcctctgaggacgaaagctaaggaaattctgcaggaagaagaagacctggcagaaatt gtacagcttgtgggaaaggcttctttggcagaaacagataaaatcactctggaggtagca aaacttatcaaagatgatttcctacaacaaaatggatatactccttatgacaggttctgc ccattctacaagacagtagggatgctgtccaacatgattgcattttatgatatggctcgt agagctgttgaaaccactgcccagagtgacaataaaatcacatggtccattattcgtgag cacatgggagacatcctctataaactttcctccatgaaattcaaggatccactgaaagat ggtgaggcaaagatcaaaagcgactatgcacaacttcttgaagacatgcagaatgcattc cgtagccttgaagattag >gi568815595f:113678754_113909424|GENSCAN_predicted_peptide_5|247_aa MGPDPRGPGAETSAPGHFGTKFKPSTPTGKQMSPPPPVGKGLAEAQETVSSRALRVTGSA RLLRRPSEYFQPQLLAPSGGREGFLTMGVTGNLEQLSSSVLAHGLSEVALNQVDIQSSED LIVAGGSTSMVSHLHGRAPGTGYWFGTSIPLHMDIFTELLQSSLHGSAFPQSRHLVSGPG AQSLPSIAALGAVICIHGHLSTYLYATGYSTVKRQQFMHSMTVFQGQTIPQIQVSYPTKQ AIDCKEN >gi568815595f:113678754_113909424|GENSCAN_predicted_CDS_5|744_bp atggggcctgacccaagggggcctggggctgagacatccgcaccaggacatttcgggacc aaattcaagccctcaacgcccacagggaagcaaatgtccccaccaccgcccgtagggaaa gggctcgcagaggcacaggaaaccgtttcctcgcgcgcgctgcgagttacaggctccgcg cgcctcctccgccggccctccgagtacttccagccgcagctccttgcgccctctggcggc cgtgaaggattcctgacgatgggtgttaccgggaacctggagcagctcagctcctcagtt ctagctcatgggctttcagaggttgcccttaatcaagtggatatccagtcatcagaagac ctgatagtggctggaggatccacttctatggtgtctcacttacatggccgggcacctggt actggctattggtttgggacctcaattcctctccatatggacatcttcacagagctgctt cagtcctcattgcatggcagtgcctttcctcagagcagacatctagtgtctggtccaggt gctcaaagtctgccatccattgctgctttaggggctgtgatatgtattcacggacacctg agtacgtatttgtatgcaactggctacagcactgtgaagagacaacagtttatgcattct atgactgtgttccaaggacagactattccccaaatccaggtcagctatccaacaaagcaa gctattgattgcaaagagaattga >gi568815595f:113678754_113909424|GENSCAN_predicted_peptide_6|92_aa MRASSISMIIFICPGDCAATLRCSHPETGRPWPKGLEGERPARLTKGEADRDTYRRSFVP PGANKKVEAGAVSATEFQFKGRFGRGRGQPPQ >gi568815595f:113678754_113909424|GENSCAN_predicted_CDS_6|279_bp atgagggcatccagtatctccatgattatcttcatctgccccggagactgtgctgccacc ctacgctgcagccatccagagactggcaggccttggcctaaaggtctggagggtgagcga cctgcaagactcacaaaaggggaagccgacagagatacctacagacggagttttgtgccc cctggtgccaacaagaaagtcgaggctggggctgtgtcagcaaccgaattccagtttaaa ggcagatttggtcgtggacgtggtcagccacctcagtaa >gi568815595f:113678754_113909424|GENSCAN_predicted_peptide_7|206_aa MVRTQCLLGLRTFVAFAAKLWSFFIYLLRGQIRTVIQYQTVRYDILPLSPVSRNWLAQVV IDKHSVRFFVRKRPHVDFFLEVVSQWYELVVFTESMEIYGFAVAGKLDNSRSILKRRYYR QHCTLQLSSYIKDLSVVHSDLSSIVILDNSLGAYRSHPDNAIPIKSWFSDPSDTALLNLL PMLDALRFTADVHSVLSRNLHQHRLW >gi568815595f:113678754_113909424|GENSCAN_predicted_CDS_7|621_bp atggtgcggacgcagtgtctgctgggactgcgcacgttcgtggccttcgctgccaagctc tggagcttcttcatttaccttttgcgggggcagatccgcacggtaattcagtaccaaact gttcgatatgatatcctccccttatctcctgtgtcccggaattggctagcccaggtggta atagacaaacattctgtccggttttttgtacgtaagaggccccatgtggatttcttcctg gaagtggtgagccagtggtacgagctggtggtgtttacagaaagcatggagatctatggc tttgctgtggcaggtaaactggacaatagcagaagcatccttaagaggagatattacaga cagcactgcactttgcagttgagcagctacatcaaggacctctctgtggtccacagtgac ctctccagcattgtgatcctggataactccctaggggcttacaggagccatccagacaat gccatccccatcaaatcctggttcagtgaccccagcgacacagcccttctcaacctgctc ccgatgctggatgccctcaggttcaccgctgacgttcattctgtgctgagccgaaacctt caccaacatcggctctggtga >gi568815595f:113678754_113909424|GENSCAN_predicted_peptide_8|130_aa XLRTRFKKQLSNDHMAKTIINVLGSAFPFSRSKKKGKPIKYNGSTTDLISNAQQDGGEMW KNREERRRKKEGEIANKEEEDGRTEEIKVGLNVAFIRLIALLENFWNSSNSKIKEQGDLI MLRRLKEKAG >gi568815595f:113678754_113909424|GENSCAN_predicted_CDS_8|393_bp nccctaagaacaaggtttaagaaacagctgtcaaatgatcatatggccaagactataata aatgtcctgggatctgctttccctttctcaaggagcaaaaagaaaggaaaacctattaaa tacaatgggagcacaactgatctaatttccaatgctcaacaggatggaggtgagatgtgg aaaaacagagaggaaaggaggaggaagaaggaaggagagattgcaaataaagaggaagaa gatggaagaaccgaggaaataaaggtaggtctgaatgttgcatttattagattgatagca ctactggaaaacttttggaattccagtaactccaaaataaaagaacaaggggatttgata atgttgcgcaggctaaaggagaaagctggatga