GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:13:23 Sequence gi568815597f:93010228_93237024 : 226797 bp : 38.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1340 1446 107 0 2 80 39 125 0.642 4.39 1.02 PlyA + 2932 2937 6 -0.45 2.00 Prom + 3009 3048 40 -3.65 2.01 Init + 9851 9918 68 1 2 92 78 63 0.423 6.30 2.02 Intr + 57188 57355 168 2 0 111 77 39 0.006 3.24 2.03 Intr + 68933 69208 276 2 0 61 100 204 0.000 14.61 2.04 Intr + 100003 100142 140 1 2 55 32 162 0.181 6.49 2.05 Intr + 104461 104556 96 2 0 66 86 82 0.324 4.86 2.06 Intr + 104761 104861 101 2 2 24 71 56 0.686 -3.59 2.07 Intr + 105243 105391 149 2 2 80 89 119 0.990 9.31 2.08 Intr + 109106 109174 69 2 0 111 103 23 0.891 3.48 2.09 Intr + 119051 119221 171 2 0 33 93 171 0.991 10.24 2.10 Intr + 126443 126758 316 2 1 64 56 357 0.060 25.24 2.11 Intr + 130209 130256 48 2 0 79 116 28 0.060 2.66 2.12 Intr + 138641 138812 172 1 1 42 99 86 0.131 3.69 2.13 Term + 140839 140939 101 2 2 127 43 35 0.328 0.21 2.14 PlyA + 142310 142315 6 1.05 3.06 PlyA - 142527 142522 6 1.05 3.05 Term - 144661 144443 219 1 0 97 41 137 0.994 5.96 3.04 Intr - 146256 146073 184 2 1 92 65 128 0.991 9.77 3.03 Intr - 149999 149902 98 2 2 91 113 63 0.881 6.99 3.02 Intr - 161015 160891 125 0 2 41 89 47 0.078 -0.52 3.01 Init - 161197 161125 73 1 1 28 101 90 0.150 5.58 3.00 Prom - 164748 164709 40 -6.75 4.03 PlyA - 165642 165637 6 1.05 4.02 Term - 169753 169616 138 1 0 74 45 108 0.670 2.08 4.01 Init - 170015 169827 189 2 0 82 108 321 0.999 30.56 4.00 Prom - 171762 171723 40 -8.65 5.00 Prom + 172884 172923 40 -8.85 5.01 Init + 173135 173268 134 1 2 47 111 97 0.729 7.56 5.02 Intr + 173751 173919 169 0 1 89 25 111 0.958 3.93 5.03 Intr + 176118 176276 159 2 0 105 53 97 0.983 7.16 5.04 Intr + 181773 181879 107 2 2 72 84 75 0.991 3.59 5.05 Intr + 183389 183517 129 2 0 57 95 64 0.874 2.89 5.06 Intr + 191665 191821 157 1 1 80 97 138 0.583 12.99 5.07 Intr + 195283 195404 122 0 2 61 87 74 0.927 2.97 5.08 Intr + 196880 197171 292 2 1 99 100 241 0.980 22.61 5.09 Intr + 200575 200699 125 0 2 60 72 23 0.609 -3.64 5.10 Intr + 201874 202034 161 1 2 95 97 110 0.997 11.31 5.11 Intr + 204516 204739 224 1 2 28 74 176 0.483 7.12 5.12 Intr + 206409 206519 111 2 0 83 53 91 0.964 4.76 5.13 Intr + 207511 207642 132 0 0 86 89 89 0.997 8.72 5.14 Intr + 216106 216222 117 0 0 95 84 91 0.997 9.14 5.15 Intr + 222214 222366 153 0 0 35 76 132 0.736 6.05 5.16 Intr + 226021 226163 143 0 2 33 98 176 0.928 11.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 42017 41802 216 2 0 72 53 156 0.822 5.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:93010228_93237024|GENSCAN_predicted_peptide_1|35_aa XLMPFSKAEQEKQDLDKGPDTCIRPRGFQALTIKS >gi568815597f:93010228_93237024|GENSCAN_predicted_CDS_1|108_bp ntgctcatgcccttcagcaaagcagaacaagagaagcaggatttggacaagggccctgac acttgcatcagaccgagaggcttccaagcacttactatcaagagctag >gi568815597f:93010228_93237024|GENSCAN_predicted_peptide_2|624_aa MTDSRTGEGKTQDVPGACCSAEKLILCPFFALVCDQARPTSLVYITPAPLPSGFWLGQDT IKAPYKLTVWRSEGTRKESPSRGRRARCSSALAAQSLPPRGFGELPFGTGVAPRSQNAPA VRGKPKWRGAVLKWAVFEAGVRTLILPPTLVSKDLGLCVHMCRVPGGAGAQDSTGAGNSL VHKRSPLRRNQKTPTSLTKLSLQDGHKAKKPACKFEEGATGSGEMVCTICQEEYSEAPNE MVICDKCGQGYHQLCHTPHIDSSVIDSDEKWLCRQCVFATTTKRGGALKKGPNAKALQVM KQTLPYSVADLEWDAGHKTNVQQCYCYCGGPGEFYTFICSVCSSGPEYLKRLPLQWFMSG KEIKKKKHLFGLRIRVPPVPPNVAFKAEKEPEGTSHEFKIKGRKASKPISDSRLSDSRKR TRTGRSWPAAIPHLRRRRGRLPRRALQTQNSEIVKDDEGKEDYQFDELNTEILNNLADQE LQLNHLKNSITSYFGAAGRIACGEKYRVLARRVTLDGKLISHVQENKNWTVKCKGIIRFV RATYNCLIFEHIWSAIAADPNTYIMQCWISFTIAHVITFIKAAVDELKPETDAGIQSPSP LCPVLCNIILALDSLIGVLSPDVH >gi568815597f:93010228_93237024|GENSCAN_predicted_CDS_2|1875_bp atgactgattctagaactggggaaggaaagacacaagatgtacctggagcatgttgcagt gccgaaaagttgattctctgtcctttctttgccctggtctgtgaccaggcaagaccgact tctctggtctatattaccccagctcccttgccctctggcttctggttgggtcaagacacc ataaaggcaccttataagcttactgtatggagatcagagggcaccaggaaagaaagcccc agtcggggacgcagggcgcggtgttcctccgcgctcgccgcgcagtccctgcccccccgc ggctttggagagctgccattcggcaccggagtcgctccgcgctcccagaatgcaccggca gtccgcgggaaaccaaaatggcgaggggctgtattgaagtgggctgtgtttgaggccggt gtaagaacgctcattctacccccaacccttgtctccaaggacctcggtttgtgcgtgcat atgtgccgggtacccggtggggcgggtgcccaagactctacaggggcaggtaattcactg gtccacaagcggtctcctttacgtcgaaaccaaaagaccccaacatccttgaccaagctg tctttacaggatggacataaagccaaaaagccagcatgtaaatttgaagagggagccact ggaagtggggaaatggtctgtacaatatgtcaagaagagtattcagaagctcccaatgaa atggttatatgtgacaagtgtggccaaggatatcatcagttgtgtcacacacctcatatt gattccagtgtgattgattcagatgaaaaatggctctgtcggcagtgtgtttttgcaaca acaacaaagaggggtggtgcacttaagaaaggaccaaatgccaaagcattgcaagtcatg aagcagacattaccctatagtgtggcagaccttgaatgggatgcaggtcataaaaccaat gtccagcagtgttactgctattgtggaggccctggagaattttatacgtttatatgctct gtctgcagttctggaccagaatacctcaaacgtctaccattacagtggtttatgtctggg aaagaaataaagaagaagaagcatttgtttgggttgcgaattcgtgttcctcctgtgcca ccaaatgtggctttcaaagcagagaaagaacctgaaggaacatctcatgaatttaaaatt aaaggcagaaaggcatccaaacctatatctgattcaagattatctgactccagaaaaaga acgcgtacaggaagatcttggcctgctgcaataccacatttgcggagaagaagaggtcgt cttccaagaagagcactccagactcagaactcagaaattgtaaaagatgatgaaggcaaa gaagattatcagtttgatgaactcaacacagagattctgaataacttagcagatcaggag ttacaactcaatcatctaaagaactccattaccagttattttggtgctgcaggtagaata gcatgtggcgaaaaataccgagttttggcacgtcgggtgacacttgatggaaagttgatc agtcatgttcaggaaaataaaaactggacagtgaaatgtaagggcatcattcggtttgtc agggccacatacaactgcctgatatttgaacacatttggtcagcaattgctgcagaccct aatacgtatataatgcagtgctggatatcattcactattgctcatgtgataacattcatc aaagctgcagtggatgaattaaaaccagaaacagatgcagggatccagagtccatctcca ttgtgccctgtcttatgtaacatcatcctagctctagattctttaattggtgtcctttcc ccagacgtacattag >gi568815597f:93010228_93237024|GENSCAN_predicted_peptide_3|232_aa MFHCVRNWRVLGLTDFKNEAADPRGVKLQTFAVRVTALKVACLELFVPPGGFVVSLASGV KLQTFVVLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISE KVIFFELILDNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQTLL RAFEARDRNIQESNFDRVNFWSMVNLVVMVVVSAIQVYMLKSLFEDKRKSRT >gi568815597f:93010228_93237024|GENSCAN_predicted_CDS_3|699_bp atgttccactgtgtccggaattggcgggttcttggtctcactgacttcaagaatgaagcc gcggaccctcgcggagtgaagctgcagacctttgcagtgagagttacagctcttaaggtg gcgtgtctggagttgttcgttcctcctggtgggtttgtggtctcgctggcttcaggagtg aagctgcagaccttcgtggttttagatggagcaggattagatattgatttccatcttgcc tctccagaaggcaaaaccttagtttttgaacaaagaaaatcagatggagttcacactgta gagactgaagttggtgattacatgttctgctttgacaatacattcagcaccatttctgag aaggtgattttctttgaattaatcctggataatatgggagaacaggcacaagaacaagaa gattggaagaaatatattactggcacagatatattggatatgaaactggaagacatcctg gaatccatcaacagcatcaagtccagactaagcaaaagtgggcacatacaaactctgctt agagcatttgaagctcgtgatcgaaacatacaagaaagcaactttgatagagtcaatttc tggtctatggttaatttagtggtcatggtggtggtgtcagccattcaagtttatatgctg aagagtctgtttgaagataagaggaaaagtagaacttaa >gi568815597f:93010228_93237024|GENSCAN_predicted_peptide_4|108_aa MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEI EYQKLLVDGPGGEEARAVQEASPLRAGARGEARLSLGPGTQNLGWTWN >gi568815597f:93010228_93237024|GENSCAN_predicted_CDS_4|327_bp atgggcgacaagatctggctgcccttccccgtgctccttctggccgctctgcctccggtg ctgctgcctggggcggccggcttcacaccttccctcgatagcgacttcacctttaccctt cccgccggccagaaggagtgcttctaccagcccatgcccctgaaggcctcgctggagatc gagtaccaaaaacttctggtggacggccctggtggtgaagaggcgagggccgttcaggaa gccagcccgctccgcgcaggcgcgcgaggcgaggcgaggctctcgctggggccggggacc cagaacttaggctggacctggaattga >gi568815597f:93010228_93237024|GENSCAN_predicted_peptide_5|812_aa MESSSSDYYNKDNEEESLLANVASLRHELKITEWSLQSLGEELSSVSPSENSDYAPNPSR SEKLILDVQPSHPGLLNYSPYENVCKISGSSTDFQKKPRDKMFSSSAPVDQEIKSLREKL NKLRQQNACLVTQNHSLMTKFESIHFELTQSRAKVSMLESAQQQAASVPILEEQIINLEA EVSAQDKVLREAENKLEQSQKMVIEKEQSLQESKEECIKLKVDLLEQTKQGKRAERQRNE ALYNAEELSKAFQQYKKKVAEKLEKVKGSCANSVFCITVYIPTVKVQAEEEILERNLTNC EKENKRLQERCGLYKSELEILKEKLRQLKEENNNGKEKLRIMAVKNSEVMAQLTESRQSI LKLESELENKDEILRDKFSLMNENRELKVRVAAQNERLDLCQQEIESSRVELRSLEKIIS QLPLKRELFGFKSYLSKYQMSSFSNKEDRCIGCCEANKLVISELRIKLAIKEAEIQKLHA NLTANQLSQSLITCNDSQESSKLSSLETEPVKLGGHQVAESVKDQNQHTMNKQYEKERQR LVTGIEELRTKLIQIEAENSDLKVNMAHRTSQFQLIQEELLEKASNSSKLESEMTKKCSQ LLTLEKQLEEKIVAYSSIAAKNAELEQELMEKNEKIRSLETNINTEHEKICLAFEKAKKI HLEQHKEMEKQIERVRQLDSALEICKEELVLHLNQLEGNKEKFEKQLKKKSEEKELKIKN HSLQETSEQNVILQHTLQQQQQMLQQETIRNGELEDTQTKLEKQVSKLEQELQKQRESSA EKLRKMEEKCESAAHEADLKRQKVIELTGTAS >gi568815597f:93010228_93237024|GENSCAN_predicted_CDS_5|2436_bp atggaatctagttcatcagactactataataaagacaatgaagaggaaagtttgcttgca aatgttgcttccttaagacatgaactgaagataacagaatggagtttgcagagtttaggg gaagagttatccagtgttagtccaagtgaaaattctgattatgcccctaatccttcaagg tctgaaaagctaattttggatgttcagcctagccaccctggacttttgaattattcacct tatgaaaacgtctgtaaaatatctggtagcagcactgattttcaaaaaaagccaagagat aagatgttttcatcttctgcccctgtggatcaggagattaaaagccttcgagagaaacta aataaacttaggcaacagaatgcttgtttggtcacacagaatcattccttaatgactaaa tttgaatctattcactttgaattaacacagtcaagagcaaaagtttctatgcttgagtct gctcaacagcaggcagccagtgtcccaatcttagaagaacagattataaatttggaagca gaggtttcagctcaagataaagttttgagagaggcagaaaataagctggaacagagccag aaaatggtaattgaaaaggaacagagtttgcaggagtccaaagaggaatgtataaaatta aaggtggacttacttgaacaaaccaaacaaggaaaaagagctgaacgacaaaggaatgaa gcactatataatgccgaagagctgagtaaagctttccaacaatataaaaaaaaagtggct gaaaaactggaaaaggtaaaaggcagttgtgcaaattcagtgttttgtattactgtctat attccaacagtaaaggttcaagctgaagaagaaatattagagagaaatctaactaactgt gaaaaagaaaataaaaggctacaagaaaggtgtggtctatataaaagtgaacttgaaatt ctgaaagagaaattaaggcagttaaaagaagaaaataacaacggaaaagaaaaattaagg atcatggcagtgaaaaattcagaagtcatggcacaactaactgaatctagacaaagtatt ttgaagctagagagtgagttagagaacaaagacgaaatacttagagacaaattttcttta atgaatgaaaaccgagaattaaaggtccgtgttgcagcacagaatgagcgactagattta tgtcaacaagaaattgaaagttcaagggtagaactaagaagtttggaaaagattatatcc cagttgccattaaaaagagaattatttggctttaaatcatatctttctaaataccagatg agtagcttctcaaacaaggaagaccgttgcattggctgctgtgaggcaaataaattggtg atttcggaattgagaattaagcttgcaataaaagaggcagaaattcaaaagcttcatgca aacctgactgcaaatcagttatctcagagtcttattacttgtaatgacagccaagaaagt agcaaattaagtagtttagaaacagaacctgtaaagctaggtggtcatcaagtagcagaa agcgtaaaagatcaaaatcaacatactatgaacaagcaatatgaaaaagagaggcaaaga cttgttactggaatagaagaactacgtactaagctgatacaaatagaagctgaaaattct gatttgaaggttaacatggctcacagaactagtcagtttcagctgattcaagaggagctg ctagagaaagcttcaaactccagcaaactggaaagtgaaatgacaaagaaatgttctcaa cttttaactcttgagaaacagctggaagaaaagatagttgcttattcctctattgctgca aaaaatgcagaactagaacaggagcttatggaaaagaatgaaaagataaggagtctagaa accaatattaatacagagcatgagaaaatttgtttagcctttgaaaaagcaaagaaaatt cacttggaacagcataaagaaatggaaaagcagattgaaagagttaggcaactagattca gcattggaaatttgtaaggaagaacttgtcttgcatttgaatcaattggaaggaaataag gaaaagtttgaaaaacagttaaagaagaaatctgaagagaaagagctaaagataaaaaat cacagtcttcaagagacttctgagcaaaacgttattctacagcatactcttcagcaacag cagcaaatgttacaacaagagacaattagaaatggagagctagaagatactcaaactaaa cttgaaaaacaggtgtcaaaactggaacaagaacttcaaaaacaaagggaaagttcagct gaaaagttgagaaaaatggaggagaaatgtgaatcagctgcacatgaagcagatttgaaa aggcaaaaagtgattgagcttactggcactgccagn