GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:31:19 Sequence gi568815590f:94925009_95134069 : 209061 bp : 42.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 985 980 6 1.05 1.05 Term - 5720 5471 250 1 1 105 38 157 0.893 6.59 1.04 Intr - 11545 11351 195 0 0 49 81 112 0.352 4.21 1.03 Intr - 15212 14852 361 0 1 -4 98 238 0.372 8.95 1.02 Intr - 15924 15822 103 0 1 48 82 148 0.057 9.03 1.01 Init - 19627 19571 57 1 0 60 81 55 0.212 1.37 1.00 Prom - 22423 22384 40 -7.55 2.09 PlyA - 23241 23236 6 1.05 2.08 Term - 23818 23544 275 2 2 29 46 252 0.982 9.75 2.07 Intr - 24577 24442 136 1 1 115 36 39 0.549 0.62 2.06 Intr - 25026 24965 62 1 2 81 75 47 0.318 0.23 2.05 Intr - 33120 32921 200 2 2 64 89 118 0.411 7.57 2.04 Intr - 35722 35632 91 2 1 68 81 51 0.121 0.53 2.03 Intr - 38741 38558 184 2 1 79 37 108 0.276 3.24 2.02 Intr - 59131 59016 116 2 2 38 100 86 0.005 4.05 2.01 Init - 78679 78595 85 1 1 20 90 68 0.126 1.23 2.00 Prom - 80526 80487 40 -5.15 3.00 Prom + 95449 95488 40 -4.05 3.01 Init + 100001 100197 197 1 2 109 89 198 0.043 18.45 3.02 Intr + 106987 107090 104 1 2 112 27 86 0.007 3.80 3.03 Intr + 109383 109399 17 1 2 69 72 17 0.028 -7.46 3.04 Intr + 110446 110568 123 0 0 65 93 122 0.974 10.26 3.05 Intr + 116562 116618 57 2 0 101 70 53 0.837 2.96 3.06 Intr + 123449 123550 102 1 0 67 103 79 0.966 6.75 3.07 Intr + 127166 127222 57 1 0 120 127 23 0.959 7.56 3.08 Term + 132056 132157 102 1 0 79 42 78 0.478 -0.40 3.09 PlyA + 132178 132183 6 1.05 4.11 PlyA - 133037 133032 6 1.05 4.10 Term - 138020 137799 222 2 0 106 39 39 0.066 -3.27 4.09 Intr - 142251 142157 95 1 2 110 84 43 0.115 4.86 4.08 Intr - 145028 144966 63 0 0 110 32 57 0.080 0.07 4.07 Intr - 146597 146476 122 1 2 55 89 58 0.276 1.82 4.06 Intr - 146899 146738 162 0 0 72 38 102 0.259 1.77 4.05 Intr - 148320 147709 612 2 0 80 94 417 0.304 32.07 4.04 Intr - 159832 159786 47 1 2 50 80 53 0.016 -3.01 4.03 Intr - 161155 161027 129 1 0 68 94 15 0.429 0.07 4.02 Intr - 161270 161232 39 2 0 78 97 42 0.684 1.60 4.01 Init - 163234 162956 279 1 0 93 57 235 0.604 16.21 4.00 Prom - 166019 165980 40 -6.85 5.00 Prom + 166328 166367 40 -5.45 5.01 Init + 170283 170500 218 2 2 86 66 138 0.454 9.61 5.02 Intr + 177526 177664 139 1 1 12 107 48 0.281 -1.35 5.03 Intr + 177968 178003 36 1 0 118 64 34 0.113 1.54 5.04 Intr + 184736 184845 110 1 2 95 109 54 0.948 6.36 5.05 Term + 189327 189507 181 0 1 50 32 158 0.548 2.50 5.06 PlyA + 191804 191809 6 1.05 6.03 PlyA - 193427 193422 6 1.05 6.02 Term - 198851 198726 126 2 0 98 44 115 0.783 5.40 6.01 Intr - 208951 208832 120 1 0 42 62 107 0.223 3.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 182325 182474 150 2 0 74 41 82 0.872 2.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:94925009_95134069|GENSCAN_predicted_peptide_1|321_aa MKDELLPHLAGLLSTLSPKRLNKMFVGEVSSSSNQEPEFNEKEDDEWILVDFIDTCTGFS AEEEEEEEDISEESPTEHPSVFSCLPASLECLADTSDSCFLQFESCPMEESWFITPPPCF TAGGLTTIKVETSPMENLLIEHPSMSVYAVHNSCPGLSEATRGTDELHSPSSPRKTACSS AHRADLFLPSLIKVASFQTRCCLFTLELPSTTKQLFVSQLRAIGSVESSSSHSSRVSPRV EAQNEMGQHIHCYVAALAAHTTFLEQPKSFRPSQWIKEHSERQPLNRNSLRRQNLTRDCH PRQVKHNGWVVHQPCPRQYNY >gi568815590f:94925009_95134069|GENSCAN_predicted_CDS_1|966_bp atgaaggatgagctcctccctcacttggctgggcttctgagcactctgtcgcccaagagg ctgaataaaatgtttgtgggtgaagtcagttcttcctccaaccaagaaccagaattcaat gagaaagaagatgatgaatggattcttgttgacttcatagatacttgcactggtttctca gcagaagaagaagaagaagaggaggacatcagtgaagagtcacctactgagcacccttca gtcttttcctgtttaccggcatctcttgagtgcttggctgatacaagtgattcctgcttt ctccagtttgagtcatgtccaatggaggagagctggtttatcaccccacccccatgtttt actgcaggtggattaaccactatcaaggtggaaacaagtcctatggaaaaccttctcatt gaacatcccagcatgtctgtctatgctgtgcataactcctgccctggtctcagtgaggcc acccgtgggactgatgaattacatagcccaagtagtcccaggaaaacagcgtgcagttca gcccacagagccgatttgtttttaccttctttgatcaaagtagcatccttccaaacaaga tgctgtctgttcacgttggaactgccatccacaaccaagcagctctttgtcagtcagttg agagctattggcagtgtggaatccagctcctcacacagttccagagtcagtccaagagtg gaagctcaaaatgaaatggggcagcatattcattgttatgttgcagctcttgctgctcat acaacttttctggaacaacccaagagctttcgcccttcccagtggataaaagaacacagt gaaagacagcctcttaacagaaatagccttcgtcgccaaaatcttaccagggattgccac cctcggcaagtcaagcacaatggctgggttgttcatcagccctgcccgcgtcagtacaat tactaa >gi568815590f:94925009_95134069|GENSCAN_predicted_peptide_2|382_aa MLRKVNKNSLARTRQWKDGSEARHKVLKGPLVSASARKELGSPFYLNSNYKAGQAEKSTT LLAPVRKTCCHGYLPTFTISHLVLWTFEFPEVGSAGSLGFGILYYLVRCLATALVPTRLQ LSQPLALKGGFHERKEDINIHGLKTNTLEDTEMEHLAESQDSMATNHSSRTACAEVTTDH HLLCWTPADSSRAGTMSERPKKRPGDNKRDIGFVGWIYKQGWSNGGRRNPERAASPAGSR WRFLQQAFTLSSRFLLPGLVSELSAPPAGRQVTCVGDNLSQGGPGAPAPPIAWLVFLAAA GPPPLTPPHSPPLLLLRLLSAVGWGRPFGTRRRGSRGHPRDASPKTAALGLVARRSVPAL RSRPLRLPSEFRRPCGYGEPAP >gi568815590f:94925009_95134069|GENSCAN_predicted_CDS_2|1149_bp atgttaagaaaggtgaacaagaactccttagcaagaactcgccaatggaaagatggtagt gaagccagacacaaagtgttgaagggacctctggtttctgcttctgcacgtaaggagctt ggaagtccattctatcttaacagcaactacaaagctggacaggctgaaaaatcaacaact cttcttgcacctgtaagaaagacctgctgtcatgggtatttgccaaccttcacaatttcc cacctggttctttggacttttgagtttcctgaagttggctctgcaggcagcttgggtttt ggaatcctctattacctagttagatgtctggctacggcccttgttccaacccgtctccag ctgagccagcctctggccctcaagggtggtttccatgagaggaaagaggacataaacata catggtctcaaaacaaacactttggaggatactgaaatggagcatttagcagaaagtcag gactccatggccaccaaccactcctcacgaactgcctgtgccgaagtcaccactgaccac catctgttgtgctggactcctgctgactccagtagggctggcaccatgtcagaaaggccg aagaagagacctggagacaacaaacgagacatagggtttgttggatggatttacaaacag ggatggtccaatggcggcaggcggaatccagagcgtgcagcaagcccggccggctctcgg tggcggtttctacagcaggccttcacgctcagctcccggtttttgttgcccgggcttgtt tcggagctgagcgcgccgccggccgggcgccaggtcacgtgcgttggtgacaacctctcg cagggcggccccggggcccccgcaccgccgattgcgtggcttgttttcttggccgcggcg ggacctcctcctctcacccctcctcactcccctccactcctcctcctccgcctgctctcg gccgttggatggggccgccccttcgggactcggcgtcggggctcccgcggccacccccgg gacgcatctccgaagacagcggcgcttgggcttgtggcccggcgctctgtccccgccctg cgatcccgtcccctgcgcctgccctccgagttccggaggccctgcggctatggggaacct gctccgtga >gi568815590f:94925009_95134069|GENSCAN_predicted_peptide_3|252_aa MAASAHGSVWGPLRLGIPGLCCRRPPLGLYARMRRLPGPEVSGRSVAAASGPGAWGTDHY CLELLRKRDYEGYLCSLLLPAESRSSVFALRAFNVELAQAVFSNDPVKDSVSEKTIGLMR MQFWKKTVEDIYCDNPPHQPVAIELWKAVKRHNLTKRWLMKIVDERHGVSQEDFLRRNQD KNVRDVIYDIASQAHLHLKHARSFHKTVPVKAFPAFLQTVLCRRTLGGAAYVDEEVSCRL ERWSTKLSDLPV >gi568815590f:94925009_95134069|GENSCAN_predicted_CDS_3|759_bp atggcggcctccgcgcacggctctgtctgggggccgttgcggcttggcatccccggcctg tgctgccgccggccgcctctgggtctgtacgcgcgcatgcggcggctgcccgggccggag gtgtctgggcggagcgtggctgcggccagcggaccgggcgcctggggcactgaccactac tgcctggagctgctgcggaaacgggattatgaaggttatttatgctccctgctgctccct gcagaatcccgaagctctgtttttgcactgagggcctttaatgtggaactggctcaggct gttttctcaaatgacccagttaaagactcagtctctgagaaaacaattggactgatgcga atgcagttttggaaaaaaactgtggaagatatatactgtgacaatccaccacatcagcct gtggccattgaactatggaaggctgttaaaagacataatctgactaaaagatggcttatg aaaatcgtcgatgaaagacatggtgtttcacaagaggactttctacggaggaaccaagat aaaaatgtgagagatgtaatatatgacattgccagtcaagcacacttgcacctaaagcat gctaggtcctttcacaaaactgttcctgtgaaagcatttcctgcttttcttcagacggtt ctgtgccgcagaactctagggggcgctgcttatgtggacgaggaggtctcgtgccgctta gaaaggtggagcacaaaactaagtgacctgcctgtttaa >gi568815590f:94925009_95134069|GENSCAN_predicted_peptide_4|589_aa MGIAVAEVRGSLWEAVSPGGHLVLIAMRFAGSASSITAAAVQAPGCGQPHGGGEQVAQGA RASMASAGSASLQCCQEEFMSCLSSATRKQPDGLALPVAILTARLKSPVLAMGRQDVILD SHHNPGWSRGSYENIAYRCKEVCRSSCCWFYLWESLVAKAKSYDSGFWHPQRKKRGDPLW PQEAQKHCPGCARKGENLGDPPKAPLGLSTELAQLRAATRDAASFRPVGALLALSASIPS FGPRSHISNDTLADSPASLAASLQRADASPEVVRKGGKAGQPRGSPQPWRSGAEEIVEVG LLPLTPRALLFPQVPTALPAVYHFLAACEGICPHSCAAPSMIELWDFLIQQHQPEKAVAC TTTHIPGLRISALNKHRAWGGTDDDVAASEMPLGAAGAEPWDWNDFCPLPSKFPQATPLP PLGVQVPEALREAAKVTGVGSGATADLLAPTAGVGTSRTACGGGTQFMSEGAPGVMNNGS PDWSSTQFMSEGAPGVMNNGSPDWSSNLSQPQPMHLLYLTTWFLERPMCCPDTTALTQTL PVPSFSYPVCTSHIRQNPHLHFLRAPSPHYTTACSFEPQRFLEQQQDLL >gi568815590f:94925009_95134069|GENSCAN_predicted_CDS_4|1770_bp atgggcatagctgtggcagaggtgagaggcagcctgtgggaggctgtgagcccaggtggc cacctggtccttatagccatgaggtttgcaggcagtgcctcaagcatcacagcagctgca gtgcaggcacctggctgtggacaacctcatgggggtggcgagcaggtggcccaaggtgcc agggccagcatggcatccgcagggtcagctagtcttcagtgctgccaagaagagtttatg agctgcctctcctctgctaccaggaaacagcctgatgggttggcattacctgtcgctatc ctcacagcccgattaaagtcccctgttttggccatgggtagacaggacgttattctggat tctcaccataacccaggctggtctagaggcagttatgagaacatagcctatagatgcaag gaagtttgtcgctcttcctgttgctggttctatttgtgggaaagtttggttgcaaaagct aaatcctacgacagtggcttttggcatccccagaggaaaaaacggggagacccgctttgg cctcaggaagcgcagaaacactgtcctggctgcgcgcgcaagggcgagaacctaggggac ccgcctaaagcgccgttggggctgagcaccgaactcgcacagctccgggcagcgacccgc gatgcggcatccttccgtccggtcggagccctcctagccctcagcgcgtctattcctagc ttcggtccccgcagccatataagcaatgacaccctcgccgactcgcctgccagcttggcc gcctccctgcagagggctgacgcttctccagaagttgtaagaaaaggagggaaagcaggc caacctcgaggatctccccagccttggcgttcaggtgctgaggagatcgtcgaggttggc ctgcttcccctcactcctcgggccttgctcttcccccaggtgcccactgccctccctgct gtgtaccactttcttgctgcctgtgaaggtatttgtcctcactcttgtgctgctccttca atgatagaactttgggactttctcatccagcagcaccagcctgaaaaagcagtggcttgc accaccacacacatacctggtctcaggatctcagccctaaacaagcacagggcgtggggt ggtacagatgatgatgtagctgcttctgagatgcccctgggcgctgctggagctgagccc tgggactggaatgacttttgcccgctgccctcaaaattcccacaggccacaccgctacca cctcttggggtgcaggtccctgaagccctcagggaagctgcaaaggttactggggtcggc tctggggcaacagctgacctcttagctcccactgcaggcgtaggaacctccaggacagcc tgtgggggtggtacacagttcatgagtgaaggagctcctggagtcatgaataatggaagt ccagattggtcaagtacacagttcatgagtgaaggagctcctggagtcatgaataatgga agtccagattggtcaagtaacttgtcccagccccagcccatgcatctgctgtacttaaca acctggtttcttgaacgtcccatgtgttgcccagacaccacagctttgacacagactctt ccagttccatctttctcatatccagtttgtacttcacacatcagacagaatccccattta catttcctgagagcccccagcccacattatactacagcatgttcattcgaaccacaaaga tttttggagcagcaacaggatctactttag >gi568815590f:94925009_95134069|GENSCAN_predicted_peptide_5|227_aa MGVKLAEKIVTLVLLEKEGCEGLVDTVIRMGSRSLILGRSLCPVASSQPRCKLKAIHPEP QGGVQGNPAHVGRGVAVMQCQWNLLEIYSLELAGNSSLGCSGKQFMGKYFPWVPEGGALA LQSHVPYDDKMDLCDYIGSPRLAPACFADKETEIQIGKVKMTYPRSCRQVEVNVREAHRT AKAAGAVSSPVSDFPVISPLIPKAMFAESQWHFLELPMEVYFTGTFK >gi568815590f:94925009_95134069|GENSCAN_predicted_CDS_5|684_bp atgggggtgaagctagctgagaagatagtcaccttagtactgctggagaaggaaggatgt gagggtttggtggacacagtcatccgaatgggatccaggtcattaatccttggccgctct ctgtgcccagtggcctcctctcaaccaaggtgcaaactaaaggcaattcaccctgaaccc cagggtggagtgcaggggaatccagcacacgtgggaagaggagttgctgttatgcagtgt cagtggaacttgctagaaatctattccctggaacttgctggaaattcatctctaggatgc tcaggaaagcaattcatgggaaagtatttcccatgggtacctgagggaggtgccctggct ctacagtctcatgtaccttatgacgacaagatggacctttgtgattacattgggtcacct agactagcccctgcctgctttgctgataaggaaactgaaattcagattggtaaagtgaaa atgacttatccgagatcatgcaggcaggtggaggtgaatgtgagggaggcccacaggacc gcaaaagcagctggtgctgtgagttcaccagtaagtgactttcctgtgatttccccgctt ataccaaaagccatgtttgctgaaagtcaatggcattttctggaactgcctatggaggtg tacttcacaggaacttttaaataa >gi568815590f:94925009_95134069|GENSCAN_predicted_peptide_6|81_aa GADSARSPPGPSRVCAVRSPAQPARSRRGGRAGRQRSRSQAEHHMEAATAWGFHPLKPQP ELYIGPFQPWLEWLEYKAPSP >gi568815590f:94925009_95134069|GENSCAN_predicted_CDS_6|246_bp ggcgcggactctgcccgttccccgccggggccttccagggtgtgtgctgtccgcagcccc gcgcagccggcgcgatccagacgcggtgggcgggctggccgccagcgcagccgctcgcag gctgaacaccacatggaagctgccacagcttggggcttccaccctctgaagccacagccc gagctctacattggcccctttcagccatggctggagtggctggaatacaaggcaccaagt ccctag