GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:21:35 Sequence gi568815597f:19565848_19779365 : 213518 bp : 48.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16198 16413 216 0 0 56 38 119 0.354 2.39 1.02 Term + 23205 23375 171 2 0 109 55 49 0.679 1.53 1.03 PlyA + 26642 26647 6 1.05 2.00 Prom + 30012 30051 40 -3.86 2.01 Sngl + 42267 42779 513 2 0 83 42 349 0.986 25.84 2.02 PlyA + 43800 43805 6 1.05 3.00 Prom + 51811 51850 40 -2.76 3.01 Init + 81902 82003 102 1 0 88 70 85 0.009 5.30 3.02 Intr + 85381 85469 89 0 2 43 68 31 0.002 -4.63 3.03 Intr + 89165 89353 189 2 0 116 75 271 0.054 27.30 3.04 Intr + 89477 89588 112 2 1 111 68 68 0.999 7.48 3.05 Term + 91019 91282 264 1 0 128 54 228 0.999 18.91 3.06 PlyA + 92587 92592 6 1.05 4.00 Prom + 93031 93070 40 -1.76 4.01 Init + 99907 100620 714 0 0 62 82 1284 0.422 118.08 4.02 Intr + 112720 112878 159 0 0 134 95 99 0.990 15.38 4.03 Term + 113072 113521 450 1 0 107 46 640 0.992 56.89 4.04 PlyA + 115099 115104 6 1.05 5.19 PlyA - 115809 115804 6 1.05 5.18 Term - 117597 117193 405 0 0 87 47 420 0.905 32.99 5.17 Intr - 128704 128587 118 0 1 118 89 210 0.989 24.57 5.16 Intr - 135089 134921 169 0 1 44 62 222 0.591 14.20 5.15 Intr - 138945 138711 235 0 1 63 6 163 0.297 2.86 5.14 Intr - 153256 153108 149 2 2 25 9 166 0.000 2.35 5.13 Intr - 171609 171525 85 0 1 111 105 39 0.972 7.29 5.12 Intr - 174113 173977 137 0 2 96 82 91 0.817 9.59 5.11 Intr - 175094 174930 165 0 0 67 119 197 0.968 20.63 5.10 Intr - 179804 179685 120 0 0 50 56 171 0.758 10.57 5.09 Intr - 180752 180609 144 0 0 134 110 131 0.999 20.15 5.08 Intr - 181413 181316 98 2 2 100 109 94 0.400 12.35 5.07 Intr - 186723 186580 144 2 0 70 55 60 0.394 0.30 5.06 Intr - 189919 189787 133 2 1 43 43 177 0.650 8.40 5.05 Intr - 204722 204695 28 2 1 128 115 25 0.573 6.89 5.04 Intr - 205635 205461 175 2 1 34 86 126 0.175 6.94 5.03 Intr - 206714 206326 389 2 2 84 43 153 0.149 3.99 5.02 Intr - 207457 207386 72 1 0 147 65 -6 0.825 2.60 5.01 Init - 210872 210774 99 2 0 78 75 49 0.609 2.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 31199 31262 64 1 1 80 82 87 0.873 8.61 S.002 Term - 47234 47167 68 0 2 94 49 67 0.850 1.50 S.003 Intr - 81882 81757 126 1 0 109 -1 99 0.855 3.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:19565848_19779365|GENSCAN_predicted_peptide_1|128_aa MSSNQDLWINGSPGGLNMFLKEAREIKIDEGATKLGKIFVSGKNLPGPFRKLKGEGGLSQ SHCLSAYASHGVGLFDLIGIPQSCYIIYSEYMTSSSLTGKSKHYPNEILADTYQTGNNTK VSLQPNCW >gi568815597f:19565848_19779365|GENSCAN_predicted_CDS_1|387_bp atgtcttccaaccaggacctctggatcaatggtagtcctggtggtctaaacatgttcttg aaagaagcaagagaaatcaagattgacgaaggagcaacaaagcttggaaagatatttgtc tctgggaagaatcttccaggccctttcaggaagttgaagggagaaggtggtctttctcag agccactgcttgagcgcttatgcatcccatggtgtgggcctctttgatttgattggaatc ccccagtcctgctacattatatattcagagtacatgactagtagctcactcacaggaaag agcaagcactaccccaatgagatactagcagacacctaccagactggcaacaatactaaa gtatcattacagcctaactgttggtga >gi568815597f:19565848_19779365|GENSCAN_predicted_peptide_2|170_aa MAPGKGKEKKEEQVINLGPQVAEGENVFGVCHIFASFNDTFVHVTDLSGKQTICRVTGGM KVKADRDESSPYAAMLTTQDVAQRCKELGIIALHIQLRATGGNRTKTLGPGAQSALRALA CSGMKIGRIEDVTPIPSDSTLRKGVTVVAVCEQDSSKYFLLINCLHVKNK >gi568815597f:19565848_19779365|GENSCAN_predicted_CDS_2|513_bp atggcacctggaaaggggaaggaaaagaaggaagaacaggtcatcaacctcggacctcag gtggctgaaggggagaatgtatttggtgtctgccatatctttgcatccttcaatgacact tttgtccatgtcacggatctttctggcaaacaaaccatctgccgtgtgactggtgggatg aaggtgaaggcagaccgagatgaatcctcgccatatgctgctatgttgaccacccaggat gtggcccagaggtgcaaggagctgggtatcattgccctacacatccaactccgggccaca ggaggaaataggaccaagacccttggacctggggcccagtcggccctcagagcccttgcc tgctcgggtatgaagatcgggaggattgaggacgtcacccccatcccctctgacagcact ctcaggaagggggtcaccgtggtcgccgtctgtgaacaagattcctcaaaatattttctg ttaataaattgccttcatgtaaaaaataaataa >gi568815597f:19565848_19779365|GENSCAN_predicted_peptide_3|251_aa MGRESPALSTEDLSALPSTALHRVVAGTGNRKGLGEPGENAGAPRCWPGEAEAPAPCGEP ARAWALEATGMMLRVLVGAVLPAMLLAAPPPINKLALFPDKSAWCEAKNITQIVGHSGCE AKSIQNRACLGQCFSYSVPNTFPQSTESLVHCDSCMPAQSMWEIVTLECPGHEEVPRVDK LVEKILHCSCQACGKEPSHEGLSVYVQGEDGPGSQPGTHPHPHPHPHPGGQTPEPEDPPG APHTEEEGAED >gi568815597f:19565848_19779365|GENSCAN_predicted_CDS_3|756_bp atggggagggagagccctgccctttccaccgaggatttgtcagccctgccctccacggct ctgcaccgagtggttgctggaaccggtaacaggaaaggcctgggggagcccggggaaaat gctggagctccacgctgctggcccggggaggccgaggccccagcgccctgcggagagcct gctagggcctgggctctggaggccacgggcatgatgcttcgggtcctggtgggggctgtc ctccctgccatgctactggctgccccaccacccatcaacaagctggcactgttcccagat aagagtgcctggtgcgaagccaagaacatcacccagatcgtgggccacagcggctgtgag gccaagtccatccagaacagggcgtgcctaggacagtgcttcagctacagcgtccccaac accttcccacagtccacagagtccctggttcactgtgactcctgcatgccagcccagtcc atgtgggagattgtgacgctggagtgcccgggccacgaggaggtgcccagggtggacaag ctggtggagaagatcctgcactgtagctgccaggcctgcggcaaggagcctagtcacgag gggctgagcgtctatgtgcagggcgaggacgggccgggatcccagcccggcacccaccct cacccccatccccacccccatcctggcgggcagacccctgagcccgaggacccccctggg gccccccacacagaggaagagggggctgaggactga >gi568815597f:19565848_19779365|GENSCAN_predicted_peptide_4|440_aa MVPEPGPTANSTPAWGAGPPSAPGGSGWVAAALCVVIALTAAANSLLIALICTQPALRNT SNFFLVSLFTSDLMVGLVVMPPAMLNALYGRWVLARGLCLLWTAFDVMCCSASILNLCLI SLDRYLLILSPLRYKLRMTPLRALALVLGAWSLAALASFLPLLLGWHELGHARPPVPGQC RLLASLPFVLVASGLTFFLPSGAICFTYCRILLAARKQAVQVASLTTGMASQASETLQVP RTPRPGVESADSRRLATKHSRKALKASLTLGILLGMFFVTWLPFFVANIVQAVCDCISPG LFDVLTWLGYCNSTMNPIIYPLFMRDFKRALGRFLPCPRCPRERQASLASPSLRTSHSGP RPGLSLQQVLPLPLPPDSDSDSDAGSGGSSGLRLTAQLLLPGEATQDPPLPTRAAAAVNF FNIDPAEPELRPHPLGIPTN >gi568815597f:19565848_19779365|GENSCAN_predicted_CDS_4|1323_bp atggtcccagagccgggcccaaccgccaatagcaccccggcctggggggcagggccgccg tcggccccggggggcagcggctgggtggcggccgcgctgtgcgtggtcatcgcgctgacg gcggcggccaactcgctgctgatcgcgctcatctgcactcagcccgcgctgcgcaacacg tccaacttcttcctggtgtcgctcttcacgtctgacctgatggtggggctggtggtgatg ccgccggccatgctgaacgcgctgtacgggcgctgggtgctggcgcgcggcctctgcctg ctctggaccgccttcgacgtgatgtgctgcagcgcctccatcctcaacctctgcctcatc agcctggaccgctacctgctcatcctctcgccgctgcgctacaagctgcgcatgacgccc ctgcgtgccctggccctagtcctgggcgcctggagcctcgccgctctcgcctccttcctg cccctgctgctgggctggcacgagctgggccacgcacggccacccgtccctggccagtgc cgcctgctggccagcctgccttttgtccttgtggcgtcgggcctcaccttcttcctgccc tcgggtgccatatgcttcacctactgcaggatcctgctagctgcccgcaagcaggccgtg caggtggcctccctcaccaccggcatggccagtcaggcctcggagacgctgcaggtgccc aggaccccacgcccaggggtggagtctgctgacagcaggcgtctagccacgaagcacagc aggaaggccctgaaggccagcctgacgctgggcatcctgctgggcatgttctttgtgacc tggttgcccttctttgtggccaacatagtccaggccgtgtgcgactgcatctccccaggc ctcttcgatgtcctcacatggctgggttactgtaacagcaccatgaaccccatcatctac ccactcttcatgcgggacttcaagcgggcgctgggcaggttcctgccatgtccacgctgt ccccgggagcgccaggccagcctggcctcgccatcactgcgcacctctcacagcggcccc cggcccggccttagcctacagcaggtgctgccgctgcccctgccgccggactcagattcg gactcagacgcaggctcaggcggctcctcgggcctgcggctcacggcccagctgctgctt cctggcgaggccacccaggaccccccgctgcccaccagggccgctgccgccgtcaatttc ttcaacatcgaccccgcggagcccgagctgcggccgcatccacttggcatccccacgaac tga >gi568815597f:19565848_19779365|GENSCAN_predicted_peptide_5|954_aa MGDEFCQPGRRGWGITVSEKLLATPSKRVIVRRGVDPKGSPNHLISGQLPRNPTVTVRWL CQKSGVDSAIIPPNALGSPPLCLSSKASFSPRAAPSHPPLGTPVGLHSDAHYGQRLAVAV HVEGTLPFTSSTFAAASVFQKARPWQHPGEQMAFSGKPFLVAAVTSAGGSGAPSVGCTEL ISTSVLGSFCTEFMAGLVQWLELSEAVLPTMTAFASGLGGEGADVFVQILLKDPILKDDP TVITQDLLSFSLKDGHYDARARVLVCHMTSLLQVPLEELDVLEEMFLESLKEIKEEESDT KKSDLKTEGGGEDKDRKHPLMGDKMTESKTDLWIYEMGNAGVLLFRRMAEASRKKKENRR KWKRYLLIGLATVGGGTVIGVTGGLAAPLVAAGAATIIGSAGAAALGSAAGIAIMTSLFG AAGAGLTGYKMKKRVGAIEEFTFLPLTEGRQLHITIAVTGWLASGKYRTFSAPWAALAHS REQYCLAWEAKYLMELGNALETILSGLANMVAQEALKYTVLSGIVAALTWPASLLSVANV IDNPWGVCLHRSAEVGKHLAHILLSRQQGRRPVTLIGFSLGARVIYFCLQEMAQEKEMCQ EGCSSDFFRTQIGANDLFSFDEYRGTWKLQLGGESQVGFSFEWFYVVFLKLSCPWILTVA QECCYYHRWPQGVPALQAAAPRLEKQSQLLAGQTLPGREGALNGRPPEKAAELMVIPGST TIWHSPNVARDTDGLFWGLLPDCQGIIEDVILLGAPVEGEAKHWEPFRKVVSGRIINGYC RGDWLLSFVYRTSSVQLRVAGLQPVLLQDRRVENVDLTSVVSGHLDYAKQMDAILKAVGI RTKPGWDEKGLLLAPGCLPSEEPRQAAAAASSGETPHQVGQTQGPISGDTSKLAMSTDPS QAQVPVGLDQSEGASLPAAASPERPPICSHGMDPNPLGCPDCACKTQGPSTGLD >gi568815597f:19565848_19779365|GENSCAN_predicted_CDS_5|2865_bp atgggagatgagttctgccagccaggaagaaggggctggggaattactgtttcagagaag ctgttggcaacaccatccaagagggtgattgtaaggaggggtgttgatcccaaggggagc cctaatcatctcatttcagggcagcttcccaggaacccgaccgtaacagtgaggtggctg tgtcagaagtctggtgtggactctgccatcatccccccaaatgctctgggctccccacct ctgtgtctcagctcaaaggcttccttctcccctagagcggccccatcccatcctcccctg gggactcctgtaggactgcactcagatgcccactatggacagagactggcggtggcagtg cacgtggagggcactcttcccttcacttcttccacttttgcagcagccagcgttttccag aaagcacgtccttggcagcatccaggagagcagatggccttctctgggaagccttttctg gtggctgccgtcacttctgcaggtggatccggggctccctctgtgggctgcactgaactg atctccacctctgtcctaggctccttctgcacagagttcatggcaggcctggtgcagtgg ctggagttgtctgaagctgtcttgccaaccatgactgcttttgcgagcggcctgggaggt gaaggagcagatgtgtttgttcaaattttactgaaggaccccatcttgaaggacgacccg acggtgatcactcaggaccttctgagcttctcactcaaggatgggcactatgacgcccgg gccagagtcctcgtttgccacatgacctccctgctccaagtgcccttggaggagctggat gtccttgaagagatgttcctggagagcctgaaggaaatcaaagaagaggaatctgacaca aaaaaaagtgatttaaaaacagaaggtggtggtgaggataaggacagaaaacaccctctt atgggagataagatgacagagagtaagaccgatttatggatctatgagatgggcaatgcg ggcgtcttgctcttcaggagaatggccgaggcatcccgaaagaagaaagaaaaccggagg aaatggaagcgttatctcctgataggcctggcgactgtcggaggcggaacggtgatcggt gtgactggaggtctagctgcaccccttgttgccgctggagcagcgacgattattggcagc gccggggcagcggctctgggctcagcagccggcatagccatcatgacctcgctgtttggt gcagctggagctggcctgacaggatacaagatgaagaagcgagtgggagccattgaagag ttcacgtttctgcctctgacggagggcaggcagctgcacatcaccatcgccgtcacgggg tggctcgcttctggcaaataccgcaccttcagtgccccgtgggctgccctggcccacagc cgtgagcagtactgcctggcctgggaagccaagtacctgatggagctcggcaatgccctg gagaccatcctcagtggtctcgccaacatggtggcccaggaggccctaaagtacacagtg ttgtctggcattgtggctgccctgacctggccagcctcactcctcagtgtcgccaatgtc atcgacaacccctggggggtgtgtctccatcgatcagcagaggttggcaagcacctggcc cacatcctgctctcccggcagcaggggcgacgacctgtcaccttgattggcttcagcctg ggagccagagtcatctacttctgtctgcaggagatggctcaagagaaagagatgtgtcag gaaggctgcagcagcgacttcttcagaacccagattggagctaatgatctcttcagcttt gatgagtaccggggtacttggaagttacaactaggtggcgagtctcaagttggattttcg tttgagtggttttacgttgttttcctgaagctcagctgtccgtggatcctgactgtggct caggaatgctgctattaccaccgctggccacaaggggtcccagccctgcaggcagccgca ccaaggctcgaaaagcagtcccagctcttagcagggcagactctgccaggcagagaaggc gccctgaatggccggcccccagagaaagctgctgagctcatggttatccctgggtccaca accatttggcactcacccaatgtggccagggacactgacggccttttctggggtcttttg ccagattgccaaggaatcatcgaggacgtcatcctgctgggtgcgcctgtggagggagaa gccaagcattgggagcctttccggaaggtggtgtccgggaggatcatcaacggctactgc aggggagactggctgctgagtttcgtgtaccgcacatcctcggtgcagctccgtgtcgcc ggcctacagcccgtgctgctgcaggacaggagggtggagaacgtggacctgacctctgtg gtcagcggccacctggactatgccaagcagatggatgccatcctgaaggccgtgggcatc cgcaccaagccaggctgggacgagaaggggctcttgctggccccaggctgcctgccctcc gaggagcctcgccaggcagcagctgccgcctcatcaggcgagaccccccaccaggttggg caaacccagggtcccatatccggagacacctccaaattggccatgtccacagaccccagc caagcccaggtgccagtagggctggaccagtctgaaggggcctcccttcctgctgctgcc agccctgaaaggccccccatctgcagccatggcatggaccccaacccactgggctgcccc gattgtgcctgcaagacccagggccccagcacggggctggactga