GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:34:02 Sequence gi568815586f:100257392_100520156 : 262765 bp : 40.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 4377 4339 39 1 0 117 94 37 0.587 4.48 1.05 Intr - 5018 4873 146 1 2 52 86 144 0.959 9.61 1.04 Intr - 6301 6106 196 2 1 47 88 76 0.116 1.05 1.03 Intr - 9989 9529 461 1 2 117 68 196 0.186 12.40 1.02 Intr - 10267 10045 223 2 1 63 93 151 0.494 9.36 1.01 Init - 15166 15097 70 1 1 71 29 81 0.213 1.76 1.00 Prom - 19480 19441 40 -7.95 2.00 Prom + 19610 19649 40 -6.15 2.01 Init + 25580 25756 177 1 0 112 109 119 0.988 15.71 2.02 Intr + 34112 34269 158 1 2 68 99 162 0.991 13.19 2.03 Intr + 40640 40784 145 2 1 91 91 77 0.593 7.66 2.04 Term + 46116 46292 177 2 0 72 50 117 0.010 3.00 2.05 PlyA + 47397 47402 6 1.05 3.00 Prom + 50775 50814 40 -3.65 3.01 Init + 51625 51681 57 0 0 78 74 34 0.799 2.36 3.02 Intr + 53653 53802 150 0 0 92 80 48 0.879 3.84 3.03 Intr + 55041 55262 222 2 0 81 68 68 0.731 1.50 3.04 Intr + 56031 56147 117 2 0 40 113 52 0.858 2.64 3.05 Intr + 57098 57223 126 1 0 77 76 63 0.937 3.96 3.06 Intr + 58167 58343 177 2 0 76 91 88 0.945 7.09 3.07 Intr + 60412 60534 123 0 0 93 92 -2 0.569 0.56 3.08 Intr + 66134 66247 114 1 0 98 115 27 0.978 6.12 3.09 Intr + 69231 69363 133 2 1 91 38 82 0.996 2.80 3.10 Intr + 71810 71928 119 0 2 99 91 108 0.999 11.46 3.11 Intr + 76775 76875 101 1 2 51 100 64 0.914 1.89 3.12 Intr + 78234 78300 67 0 1 75 110 46 0.977 3.49 3.13 Intr + 78420 78515 96 2 0 83 70 127 0.998 9.69 3.14 Intr + 79996 80115 120 0 0 82 71 99 0.962 7.37 3.15 Term + 81137 81781 645 1 0 67 51 453 0.983 32.63 3.16 PlyA + 82726 82731 6 1.05 4.00 Prom + 95557 95596 40 -3.55 4.01 Init + 100001 100101 101 1 2 64 98 95 0.343 7.88 4.02 Intr + 103604 103649 46 2 1 85 113 30 0.179 2.69 4.03 Intr + 116721 116854 134 2 2 40 72 59 0.010 -1.98 4.04 Intr + 123310 123562 253 1 1 80 77 313 0.799 25.91 4.05 Intr + 133610 133728 119 1 2 69 110 58 0.893 4.44 4.06 Intr + 135978 136092 115 0 1 75 75 53 0.869 2.23 4.07 Intr + 140167 140251 85 0 1 59 93 86 0.930 4.67 4.08 Intr + 144386 144472 87 0 0 99 101 34 0.961 4.82 4.09 Intr + 144949 145088 140 2 2 96 84 56 0.949 5.26 4.10 Intr + 146647 146779 133 0 1 152 69 69 0.984 10.70 4.11 Intr + 155379 155489 111 1 0 128 95 22 0.951 6.33 4.12 Intr + 160638 160765 128 1 2 84 98 104 0.962 10.48 4.13 Term + 162424 162768 345 0 0 65 35 414 0.999 27.31 4.14 PlyA + 163180 163185 6 1.05 5.04 PlyA - 164197 164192 6 1.05 5.03 Term - 167777 167720 58 1 1 113 36 38 0.315 -2.72 5.02 Intr - 173372 173126 247 0 1 57 97 218 0.679 15.20 5.01 Init - 174110 174008 103 2 1 60 68 173 0.554 10.99 5.00 Prom - 178400 178361 40 -5.35 6.05 PlyA - 178642 178637 6 1.05 6.04 Term - 193949 193816 134 0 2 75 48 107 0.213 2.67 6.03 Intr - 206123 205978 146 1 2 46 84 108 0.058 5.21 6.02 Intr - 218199 218076 124 1 1 122 94 63 0.316 9.02 6.01 Init - 227777 227720 58 2 1 66 88 68 0.977 6.12 6.00 Prom - 232289 232250 40 -2.35 7.00 Prom + 236374 236413 40 -4.85 7.01 Init + 239629 239767 139 0 1 76 20 96 0.040 1.95 7.02 Intr + 245977 246105 129 2 0 86 97 10 0.174 1.45 7.03 Intr + 253387 253752 366 2 0 78 121 211 0.913 17.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 44228 43967 262 1 1 29 48 213 0.896 5.41 S.002 Sngl - 117627 117367 261 0 0 35 32 221 0.808 6.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:100257392_100520156|GENSCAN_predicted_peptide_1|379_aa MQAKNLEKTVQTESKNWCTMSQYGLFRGLGPLGSAPRSSLLVQPRFARAPTADRPAGRRQ ARHQATPTSLSGPGRPNPGASPPFPPRRLREPRRESRRRGGAGALSPRPPNKRLLAVPAL PSCLATDSGFTPYPPPPPRPQTSELEGKGWGGEGPGTKSKGGGKRLKDLPAGGMTSLPAG SEEAEPVLLSAPGVRWGGAMVPGEEPARELMAVLLTPRFRRLVSQNELPGPGLNGPSSRN RRDGFCRKRRTGVHLCQVLMNHKVFEPVGMKKLFKKEKELEFEDSNISLYRFLGNKSSYD CCKRQKDAENEFNETLRPGYEMISNPLAQEIGEERIEELIHTINGNPALCPNITVQKPFL RLSKEGVSFYTAITSDKTX >gi568815586f:100257392_100520156|GENSCAN_predicted_CDS_1|1137_bp atgcaggccaaaaacttagaaaaaaccgtccagactgaaagtaaaaattggtgcacgatg agccagtatggcctctttcgaggactcggccctctcggatccgctccccggagctcgctg ctcgtccagccccgctttgcgcgggcgccgaccgccgatcgcccggccggccgccggcag gcccgtcaccaggcaacacccaccagcctgagcggccctgggaggcctaacccgggcgcg agtcctccattccctcctcgccggctccgggagcccaggcgagagagccgccggaggggc ggtgctggggctctttccccccgcccccctaataaacgcctcctggctgttcctgctctg ccttcgtgcctggccactgactctggctttactccatatcctccgccgcccccgcggccg caaacgtcggagctagaaggaaaggggtggggaggggagggaccggggacgaagagtaag ggagggggaaaaagactaaaagacctgccggccgggggcatgacgtcacttcctgccggt tcggaagaggcggagccagtacttctctccgccccgggtgtcaggtggggcggggctatg gtgccaggggaggagccagcgcgcgagcttatggcggttcttttgactccgaggttccgt agacttgtcagtcagaacgagcttccgggcccagggctgaacgggccaagttccaggaac cgtagagatggcttctgccggaaaaggaggacaggggttcatctttgccaagttctaatg aatcacaaagtatttgaaccagtaggaatgaagaagcttttcaaaaaagaaaaggaatta gaatttgaagattccaacattagtctctacaggtttctaggcaataaatcatcttacgat tgttgcaaaagacaaaaggatgctgagaatgagttcaatgaaaccttgagaccaggatat gaaatgatttcaaatcctctagcacaggagattggtgaggaaagaattgaggaacttatt catacaataaatgggaatccagctttatgtccaaatatcacagttcagaaaccttttctc cggctttcaaaagaaggagtttccttctacacagcaattacatcagataagacagnn >gi568815586f:100257392_100520156|GENSCAN_predicted_peptide_2|218_aa MESMLNKLKSTVTKVTADVTSAVMGNPVTREFDVGRHIASGGNGLAWKIFNGTKKSTKQE VAVFVFDKKLIDKYQKFEKDQIIDSLKRGVQQLTRLRHPRLLTVQHPLEESRDCLAFCTE PVFASLANVLGNWENLPSPISPDIKDYKLYDVETKYGLLQSGPSAACLLEFAGGPLQTLF AWVSPADAAEQQRLLPNPSSGSFVLERHLPDASQSSPV >gi568815586f:100257392_100520156|GENSCAN_predicted_CDS_2|657_bp atggagtccatgcttaataaattgaagagtactgttacaaaagtaacagctgatgtcact agtgctgtaatgggaaatcctgtcactagagaatttgatgttggtcgacacattgccagt ggtggcaatgggctagcttggaagatttttaatggcacaaaaaagtcaacaaagcaggaa gtggcagtttttgtctttgataaaaaactgattgacaagtatcaaaaatttgaaaaggat caaatcattgattctctaaaacgaggagtccaacagttaactcggcttcgacaccctcga cttcttactgtccagcatcctttagaagaatccagggattgcttggcattttgtacagaa ccagtttttgccagtttagccaatgttcttggtaactgggaaaatctaccttcccctata tctccagacattaaggattataaactttatgatgtagaaaccaaatatggtttgcttcag tcaggcccctctgctgcatgtctgttggagtttgctggaggtccactccagaccctgttt gcctgggtatcaccagcggacgctgcagaacagcaaagattgctgcctaatccttcctct ggaagctttgtcctagagaggcacctgccagatgccagccagagctctcctgtatga >gi568815586f:100257392_100520156|GENSCAN_predicted_peptide_3|788_aa MKEHFFSASFSDTKLKPGTVSEGLSFLHSSVKMVHGNITPENIILNKSGAWKIMGFDFCV SSTNPSEQEPKFPCKEWDPNLPSLCLPNPEYLAPEYILSVSCETASDMYSLGTVMYAVFN KGKPIFEVNKQDIYKSFSRQLDQLSRLGSSSLTNIPEEVREHVKLLLNVTPTVRPDADQM TKIPFFDDVGAVTLQYFDTLFQRDNLQKSQFFKGLPKVLPKLPKRVIVQRILPCLTSEFV NPDMVPFVLPNVLLIAEECTKEEYVKLILPELGPVFKQQEPIQILLIFLQKMDLLLTKTP PDEIKNSVLPMVYRALEAPSIQIQELCLNIIPTFANLIDYPSMKNALIPRIKNACLQTSS LAVRVNSLVCLGKILEYLDKWFVLDDILPFLQQIPSKEPAVLMGILGIYKCTFTHKKLGI TKEQLAGKVLPHLIPLSIENNLNLNQFNSFISVIKEMLNRLESEHKTKLEQLHIMQEQQK SLDIGNQMNVSEEMKVTNIGNQQIDKVFNNIGADLLTGSESENKEDGLQNKHKRASLTLE EKQKLAKEQEQAQKLKSQQPLKPQVHTPVATVKQTKDLTDTLMDNMSSLTSLSVSTPKSS ASSTFTSVPSMGIGMMFSTPTDNTKRNLTNGLNANMGFQTSGFNMPVNTNQNFYSSPSTV GVTKMTLGTPPTLPNFNALSVPPAGAKQTQQRPTDMSALNNLFGPQKPKVSMNQLSQQKP NQWLNQFVPPQGSPTMGSSVMGTQMNVIGQSAFGMQGNPFFNPQNFAQPPTTMTNSSSAS NDLKDLFG >gi568815586f:100257392_100520156|GENSCAN_predicted_CDS_3|2367_bp atgaaagagcacttcttcagtgcctctttcagtgatacaaagttaaaaccaggtactgtt tctgaaggattgtcattcttgcatagcagtgtgaaaatggtgcatggaaatatcactcct gaaaatataattttgaataaaagtggagcctggaaaataatgggttttgatttttgtgta tcatcaaccaatccttctgaacaagagcctaaatttccttgtaaagaatgggacccaaat ttaccttcattgtgtcttccaaatcctgaatatttggctcctgaatacatactttctgtg agctgtgaaacagccagtgatatgtattctttaggaactgttatgtatgctgtatttaat aaagggaaacctatatttgaagtcaacaagcaagatatttacaagagtttcagtaggcag ttggatcagttgagtcgtttaggatctagttcacttacaaatatacctgaggaagttcgt gaacatgtaaagctactgttaaatgtaactccgactgtaagaccagatgcagatcaaatg acaaagattcccttctttgatgatgttggtgcagtaacactgcaatattttgatacctta ttccaaagagataatcttcagaaatcacagtttttcaaaggactgccaaaggttctacca aaactgcccaagcgtgtcattgtgcagagaattttgccttgtttgacttcagaatttgta aaccctgacatggtaccttttgttttgcccaatgttctacttattgctgaggaatgcacc aaagaagaatatgtcaaattaattcttcctgaacttggccctgtgtttaagcagcaggag ccaatccagattttgttaattttcctacaaaaaatggatttgctactaaccaaaacccct cctgatgagataaagaacagtgttctacccatggtttacagagcactagaagctccttcc attcagatccaggagctctgtctaaacatcattccaacctttgcaaatcttatagactac ccatccatgaaaaacgctttgataccaagaattaaaaatgcttgtctacaaacatcttcc cttgcggttcgtgtaaattcattagtgtgcttaggaaagattttggaatacttggataag tggtttgtacttgatgatatcctacccttcttacaacaaattccatccaaggaacctgcg gtcctcatgggaattttaggtatttacaaatgtacttttactcataagaagttgggaatc accaaagagcagctggccggaaaagtgttgcctcatcttattcccctgagtattgaaaac aatcttaatcttaatcagttcaattctttcatttccgtcataaaagaaatgcttaataga ttggagtctgaacataagactaaactggagcaacttcatataatgcaagaacagcagaaa tctttggatataggaaatcaaatgaatgtttctgaggagatgaaagttacaaatattggg aatcagcaaattgacaaagtttttaacaacattggagcagaccttctgactggcagtgag tccgaaaataaagaggacgggttacagaataaacataaaagagcatcacttacacttgaa gaaaaacaaaaattagcaaaagaacaagagcaggcacagaagctgaaaagccagcagcct cttaaaccccaagtgcacacacctgttgctactgttaaacagactaaggacttgacagac acactgatggataatatgtcatccttgaccagcctttctgttagtacccctaaatcttct gcttcaagtactttcacttctgttccttccatgggcattggtatgatgttttctacacca actgataatacaaagagaaatttgacaaatggcctaaatgccaatatgggctttcagact tcaggattcaacatgcccgttaatacaaaccagaacttctacagtagtccaagcacagtt ggagtgaccaagatgactctgggaacacctcccactttgccaaacttcaatgctttgagt gttcctcctgctggtgcaaagcagacccaacaaagacccacagatatgtctgcccttaat aatctctttggccctcagaaacccaaagttagcatgaaccagttatcacaacagaaacca aatcagtggcttaatcagtttgtacctcctcaaggttctccaactatgggcagttcagta atggggacacagatgaacgtgataggacaatctgcttttggtatgcagggtaatcctttc tttaacccacagaactttgcacagccaccaactactatgaccaatagcagttcagctagc aatgatttaaaagatctttttgggtga >gi568815586f:100257392_100520156|GENSCAN_predicted_peptide_4|598_aa MPFKAFDTFKEKILKPGKEGVKNAVGDSLGILQSIQECARYALPAPSTKKRICEIQRGQV ICPQVTQLAYQWQSWKSNPGYLTAQKPGVFHDTGKIDGTTEEEDNIELNEEGRPVQTSRP SPPLCDCHCCGLPKRYIIAIMSGLGFCISFGIRCNLGVAIVEMVNNSTVYVDGKPEIQTA QFNWDPETVGLIHGSFFWGYIMTQIPGGFISNKFAANRVFGAAIFLTSTLNMFIPSAARV HYGCVMCVRILQGLVEESINNRTTTAHAAAINTVVNVSGEGAHEGSYAGAVVAMPLAGVL VQYIGWSSVFYIYGMFGIIWYMFWLLQAYECPAAHPTISNEEKTYIETSIGEGANVVSLS VGLLSAVPHMVMTIVVPIGGQLADYLRSRQILTTTAVRKIMNCGGFGMEATLLLVVGFSH TKGVAISFLVLAVGFSGFAISGFNVNHLDIAPRYASILMGISNGVGTLSGMVCPLIVGAM TRHKTREEWQNVFLIAALVHYSGVIFYGVFASGEKQEWADPENLSEEKCGIIDQDELAEE IELNHESFASPKKKMSYGATSQNCEVQKKEWKGQRGATLDEEELTSYQNEERNFSTIS >gi568815586f:100257392_100520156|GENSCAN_predicted_CDS_4|1797_bp atgccttttaaagcatttgataccttcaaagaaaaaattctgaaacctgggaaggaagga gtgaagaacgccgtgggagattctttgggaattttacaaagcattcaggagtgtgccaga tatgcacttcctgctccatcaacaaagaagaggatatgtgagattcagagaggccaggtt atttgcccccaagtcacacagctcgcgtatcagtggcagagctggaaatcaaatccaggt tatctgactgcccagaagcctggtgtgttccatgatacaggaaaaatcgatgggacaact gaggaagaagataacattgagctgaatgaagaaggaaggccggtgcagacgtccaggcca agccccccactctgcgactgccactgctgcggcctccccaagcgttacatcattgctatc atgagtgggctgggattctgcatttcctttgggatccggtgcaatcttggagttgccatt gtggaaatggtcaacaatagcaccgtatatgttgatggaaaaccggaaattcagacagca cagtttaactgggatccagaaacagtgggccttatccatggatcttttttctggggctat attatgacacaaattccaggtggtttcatttcaaacaagtttgctgctaacagggtcttt ggagctgccatcttcttaacatcgactctgaacatgtttattccctctgcagccagagtg cattacggatgcgtcatgtgtgtcagaattctgcaaggtttagtggaggaatcaatcaac aacagaacaacaacagcacatgccgctgccatcaacacagtggtaaatgtgtcgggggaa ggggcccatgaaggttcctatgcaggggcagtggttgccatgcccctggctggggtgttg gtgcagtacattggatggtcctctgtcttttatatttatggcatgtttgggattatttgg tacatgttttggctgttgcaggcctatgagtgcccagcagctcatccaacaatatccaat gaggagaagacctatatagagacaagcataggagagggggccaacgtggttagtctaagt gtgggtctcttgtcagcagtcccacacatggttatgacaatcgttgtacctattggagga caattggctgattatttaagaagcagacaaattttaaccacaactgctgtcagaaaaatc atgaactgtggaggttttggcatggaggcaaccttactcctggtggttggcttttcgcat accaaaggggtggctatctcctttctggtacttgctgtaggatttagtggcttcgctatt tcaggttttaatgtcaaccacctggacattgccccacgctatgccagcattctcatgggg atctcaaacggagtgggaaccctctctggaatggtctgtcccctcattgtcggtgcaatg accaggcacaagacccgtgaagaatggcagaatgtgttcctcatagctgccctggtgcat tacagtggtgtgatcttctatggggtctttgcttctggggagaaacaggagtgggctgac ccagagaatctctctgaggagaaatgtggaatcattgaccaggacgaattagctgaggag atagaactcaaccatgagagttttgcgagtcccaaaaagaagatgtcttatggagccacc tcccagaattgtgaagtccagaagaaggaatggaaaggacagagaggagcgacccttgat gaggaagagctgacatcctaccagaatgaagagagaaacttctcaactatatcctaa >gi568815586f:100257392_100520156|GENSCAN_predicted_peptide_5|135_aa MKPRTLAVSVTALKVARLEFVPSDVRMCSAFLPSGAQLASPSGSRTGAAGGAACQSRAVC PHSSALGWLMGLGAVEQGAALIGEAQAAQEPMEGVGGSSMAGCRSRALSRGKAAKARSRV NCQSSFHHFDHTSLK >gi568815586f:100257392_100520156|GENSCAN_predicted_CDS_5|408_bp atgaagccgcggaccctcgcggtgagtgttacagctcttaaggtggcgcgtctggagttt gttccttccgatgttcggatgtgttcggcgtttcttccttctggagcccagctggcttca cccagtggatcccgcaccggggctgcaggtggagctgcctgccagtcccgtgccgtgtgc ccacactcctcagcccttgggtggttgatgggactgggtgccgtggagcagggggcggcg ctcatcggggaggctcaggccgcacaggagcccatggagggggtgggaggctcaagcatg gcgggctgcaggtcccgagccctgtcccgcgggaaggcagctaaggcccggtctagagta aactgccagagctcattccatcactttgatcatacttcactgaagtaa >gi568815586f:100257392_100520156|GENSCAN_predicted_peptide_6|153_aa MNESNEPHIQNRAAECDWAEYKGLTTNGYSIAIGPILKRWKQEEGQGFHDFPSPAVRRGQ RNESETEEPTKYLTTGANQSLHTEQGFIKHLPLKGFEHPLQNSLIRHGSSFDTCLSSQAG AIPEKAHISAAATIASYPNVSNFHSLLRDEDWT >gi568815586f:100257392_100520156|GENSCAN_predicted_CDS_6|462_bp atgaatgaaagtaatgagccccatattcagaatagagctgcagaatgtgactgggcagag tacaaaggcctcacaaccaatggatattccattgcaattggcccaatcttgaagaggtgg aaacaagaggaaggacaaggattccatgactttccctctccagctgtgaggagagggcag agaaatgagtctgaaacagaggagccaaccaagtacttaaccacaggagccaatcagtct ttacacacagaacaagggtttatcaagcatcttccactcaaaggctttgaacaccctttg cagaattccctaatcagacatggttcatcctttgacacttgcctttcttcccaggctgga gcaatcccagagaaggctcacatctcagctgctgccaccatcgcctcatatcctaatgtc tccaattttcatagcctcctccgggacgaggactggacttag >gi568815586f:100257392_100520156|GENSCAN_predicted_peptide_7|212_aa MREWRRGSGRCNREGGKTKEVSCQDPWCGGGQLDATGSSTENCQQRVAVNKLRMVMQFQG LENPIQISPHCSCTPSGFFMEMMSMKPAKGVLTEQVAGPLGQNLEVEPYSQYSNVQFPQV QPQISSSSYYSNLGFYPQQPEEWYSPGIYELRRMPAETLYQGETEVAEMPVTKKPRMGAS AGRIKGDELCVVCGDRASGYHYNALTCEGCKX >gi568815586f:100257392_100520156|GENSCAN_predicted_CDS_7|636_bp atgagggagtggaggcgagggagtgggcgatgcaacagggaaggagggaaaaccaaagag gtcagttgtcaagatccctggtgtgggggaggccagcttgatgccactgggagcagcaca gagaactgtcagcagagggtggctgtgaataagctaagaatggtaatgcagtttcagggg ttagaaaatccaattcaaattagtcctcactgcagctgtacgccgtcaggatttttcatg gaaatgatgagtatgaagcccgcgaaaggtgttttaacagaacaagtggcaggtcctctg ggacagaacctggaagtggaaccatactcgcaatacagcaatgttcagtttccccaagtt caaccacagatttcctcgtcatcctattattccaacctgggtttctacccccagcagcct gaagagtggtactctcctggaatatatgaactcaggcgtatgccagctgagactctctac cagggagaaactgaggtagcagagatgcctgtaacaaagaagccccgcatgggcgcgtca gcagggaggatcaaaggggatgagctgtgtgttgtttgtggagacagagcctctggatac cactataatgcactgacctgtgaggggtgtaaagnn