GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:44:56 Sequence gi568815587f:121203376_121407509 : 204134 bp : 40.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14622 14672 51 2 0 68 33 77 0.158 1.41 1.02 Intr + 16760 16886 127 1 1 30 86 88 0.167 2.13 1.03 Term + 17619 17752 134 1 2 121 39 112 0.572 6.87 1.04 PlyA + 20853 20858 6 1.05 2.00 Prom + 32583 32622 40 -2.85 2.01 Init + 33938 34094 157 1 1 17 95 197 0.846 13.42 2.02 Term + 35744 35898 155 0 2 108 32 75 0.682 1.00 2.03 PlyA + 36957 36962 6 1.05 3.06 PlyA - 37620 37615 6 1.05 3.05 Term - 38634 37778 857 1 2 9 48 397 0.201 19.76 3.04 Intr - 48055 47984 72 2 0 51 92 71 0.092 2.26 3.03 Intr - 50359 50252 108 2 0 55 108 23 0.053 0.34 3.02 Intr - 62071 61937 135 2 0 83 73 120 0.354 9.62 3.01 Init - 62627 62393 235 2 1 83 42 152 0.459 8.45 3.00 Prom - 72066 72027 40 -4.45 4.05 PlyA - 72396 72391 6 1.05 4.04 Term - 73834 73462 373 0 1 36 38 208 0.680 3.78 4.03 Intr - 77182 77032 151 2 1 77 88 136 0.831 10.80 4.02 Intr - 78206 78001 206 1 2 3 74 120 0.222 0.02 4.01 Init - 81459 81287 173 0 2 77 60 142 0.418 9.26 4.00 Prom - 91889 91850 40 -4.05 5.00 Prom + 96743 96782 40 -2.85 5.01 Init + 96998 97004 7 1 1 49 87 0 0.092 -2.78 5.02 Intr + 100059 100210 152 1 2 38 116 75 0.755 4.26 5.03 Intr + 100986 101118 133 2 1 56 115 92 0.935 8.00 5.04 Intr + 103011 103111 101 1 2 131 98 -21 0.846 2.11 5.05 Intr + 111486 111620 135 2 0 34 46 118 0.011 2.04 5.06 Intr + 115093 115129 37 0 1 92 71 24 0.063 -1.98 5.07 Intr + 122961 123190 230 1 2 92 68 138 0.616 8.87 5.08 Term + 127490 127735 246 1 0 38 39 155 0.712 0.41 5.09 PlyA + 128779 128784 6 1.05 6.03 PlyA - 128806 128801 6 1.05 6.02 Term - 131149 131067 83 2 2 146 49 30 0.558 1.78 6.01 Init - 134128 133837 292 1 1 81 90 166 0.743 13.46 6.00 Prom - 137264 137225 40 -9.25 7.00 Prom + 138570 138609 40 -6.65 7.01 Init + 144261 144443 183 2 0 47 102 152 0.833 11.66 7.02 Term + 155938 156069 132 0 0 59 44 177 0.571 7.51 7.03 PlyA + 156158 156163 6 -1.75 8.10 PlyA - 157029 157024 6 1.05 8.09 Term - 159392 158475 918 2 0 27 45 982 0.493 79.02 8.08 Intr - 161204 161164 41 0 2 96 84 26 0.434 0.02 8.07 Intr - 162435 162291 145 0 1 55 61 138 0.490 6.73 8.06 Intr - 164890 164726 165 1 0 74 52 63 0.398 0.54 8.05 Intr - 168196 168029 168 1 0 57 68 110 0.001 5.12 8.04 Intr - 169385 169242 144 2 0 72 15 102 0.000 0.86 8.03 Intr - 178828 178739 90 1 0 118 70 16 0.042 2.07 8.02 Intr - 178905 178878 28 2 1 108 79 -2 0.043 -1.90 8.01 Intr - 190356 190209 148 1 1 75 28 159 0.057 6.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 173163 173370 208 0 1 63 41 160 0.850 4.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:121203376_121407509|GENSCAN_predicted_peptide_1|103_aa MESTLAAISGRLDEENVDSYKREAGGLSQRQCDNRNKGHLGRRSCEDAMQLTLKMEEGAL FGHHAPCTMVIEKSMLQLSAFPLPVAGYDSFLQAYAPDPNQAH >gi568815587f:121203376_121407509|GENSCAN_predicted_CDS_1|312_bp atggaatcaactttggcggccatcagtggtagactggatgaagaaaatgtggattcttat aagagggaagcaggagggttgagccagaggcaatgtgataacagaaacaaaggacaccta ggcagaaggagctgtgaagatgccatgcagctgactttgaagatggaggaaggggccctg tttggacatcatgctccttgtactatggttatagagaagagcatgctccaactttccgcc ttcccgctgcctgttgctggctatgattcatttttgcaggcttatgcacctgatccaaat caagcccattag >gi568815587f:121203376_121407509|GENSCAN_predicted_peptide_2|103_aa MWALIKAALEPFQTDHEADSDEEEEGECKKLTSDSECEEQKPEEIKERETEKGPFGPLNN QRADALVSAVFADAQAFHSLTHLNAAGLRKRYGHIQKAGKKER >gi568815587f:121203376_121407509|GENSCAN_predicted_CDS_2|312_bp atgtgggcactaataaaagcagctcttgagccatttcaaacagatcatgaggcagattca gatgaggaagaggagggcgagtgtaaaaaactaacttcagattctgaatgtgaggaacag aaaccggaggaaatcaaagaaagggaaaccgaaaaaggaccttttggtcctttaaataat caaagggcagatgcactggtgtctgcagtctttgctgatgcacaagcattccattcttta actcatcttaatgctgcaggccttagaaaaagatatggtcacatacaaaaagctggcaaa aaggaaagataa >gi568815587f:121203376_121407509|GENSCAN_predicted_peptide_3|468_aa MAPTDLPHGTQGKIPQHGWAQLLRSKGPAWGLLTHQSASGAAAITRGRGTMATPPVGSAS AVGGACMGLVIHSSASWSNCHLPITDCHSTITDCHLADCCSLTVDYHDAVFFLHPMLSTK LMQVHKRTKETEVNDIFPRKLLSPYPVRILTIPSSVKAPTSWFIVVDRVAATVSCDSKLE SLERRSPTLLDRAAAAQTAAVYLSLPVLLEGLGAGRISALLGAAAAARPPAAGLGLLLRK ASRRESGASGSPTSSELAGQELPSAASRGSSQVQDSGISSLHPRGHRKVPPTQPLQAQVC LLTLPGLSPLPALLQSGSRVKAEPQGHEWQWQADRFLGGRGRVFSKAPPSGQGCQSCTDW SGDLWCLFQAPMDQSIGTSSRLKYIKALGPARAGERTARAEGRGQRNNQTTSCREELSSP LCESFRDLQRGVNYLPAEKILSLQGLLFLLSTACPPLSSLPFPGPSLC >gi568815587f:121203376_121407509|GENSCAN_predicted_CDS_3|1407_bp atggcaccaactgatcttccacatgggacccaggggaagatccctcagcatgggtgggct cagctgctgaggtcaaaggggcctgcatgggggctgctaactcaccagtctgccagcgga gcagcagctatcacaagaggcaggggcactatggccaccccaccagtgggctcagcttcc gcagtgggaggagcctgcatggggctggtaattcactcttctgccagctggagcaactgc cacttgccaattactgactgccactcaacgatcactgactgccaccttgcagactgctgc tcactgactgtggactatcacgatgctgtcttctttcttcaccccatgttgagcaccaag ctgatgcaggtacataaaagaacaaaagagactgaagtaaatgatattttccccagaaaa ttactcagcccttatcctgtcagaattctaacaatccctagctctgtaaaagcaccaact tcctggtttatcgttgttgacagggtagcagcaacagtctcttgtgattcgaaattggaa agcctggaacgcagaagccccaccctcctggacagagctgcagctgcccaaactgcagct gtctatctgagcctccctgtgctcttggaggggctgggagcaggcaggatctctgccctc ctgggtgcagctgcagctgcccgacccccggctgcaggcctgggcctcctgctccgcaaa gcgagcaggcgagagtctggagcaagtgggagccccacctcttctgagttggcggggcag gagctccccagtgcagcttcacgggggtcctcccaggtgcaggactcaggcatctccagc ttgcaccctcgggggcacaggaaggtgccccccacccaacccctgcaggctcaagtgtgc ctgctcaccctgcctggcctctctccactccctgccctgctccaatctggaagcagggtt aaggctgagccccagggccatgagtggcagtggcaggcagacagattcttggggggaagg gggcgggtcttcagtaaggccccaccttcaggccagggctgccagtcttgcacggactgg agtggggacttgtggtgccttttccaggcccctatggaccaatctataggcacttcctcc cggctgaagtacataaaagccctgggccctgccagagcaggggagaggacagccagagca gagggcagaggacagagaaacaaccagaccaccagctgcagagaggagctatcctctcct ctctgcgagagcttcagagacctgcagagaggtgtgaactacctgcctgcagagaagatt ctctctctccagggcctcctatttctcctctccacagcctgccccccactctcctctctc ccctttccagggccttctctctgctaa >gi568815587f:121203376_121407509|GENSCAN_predicted_peptide_4|300_aa MACSSPAETQAYPRPHVSKSTKTEARVFVVVAARHATAEASALQAATQPLPLWLTAAGQT QHQAVGATKSGIVKGMRQDKLRVHKVAPGSERYYGGCEVPELWKHTLFIGDQTKKQVVKM WGLKGSEATQMRRNQKTNPGNMTEQGSSTPHNNHTSSPAMDPNQEEIPDLPEKESRSQHN TEWGKVEAFPLRTGTRQGCPLLLNIVLAVLARAIRQQKEIKGIQIDKEEVKLSLFTDGMI IYLENPKDSSRKLLELIKEFSEVSRYRINIHKSVALLYTNSDQAENQINNSTPFTIAARK >gi568815587f:121203376_121407509|GENSCAN_predicted_CDS_4|903_bp atggcttgttcatctcctgcagaaacacaagcataccctcgtccccatgttagtaaatct accaaaacagaagccagagtttttgtggttgtagctgcaaggcatgccactgccgaagca tctgctctacaagctgcaactcagcctctgcctctttggttaactgctgcaggccagacc caacaccaggccgtgggggctacaaagtctggcatagtcaaaggaatgagacaagacaag ttaagagtacataaagtggctccagggagtgaacgctactatggaggctgcgaagtcccc gagctctggaagcacacactatttattggtgatcaaacaaagaagcaggtggtgaagatg tgggggttgaaaggtagcgaggctacccaaatgagaaggaaccagaaaacaaaccctggt aatatgacagaacaaggctcatcaacaccccacaacaatcacactagttcaccagcaatg gatccaaaccaagaagaaatccctgatttacctgaaaaagaatccaggagccaacataat actgaatggggaaaagttgaagcattccctctgagaacaggaacaagacaaggatgccca ctcctcctcaacatagtactggcagtcctagccagagcaatcagacaacagaaagaaata aagggcatccaaattgataaagaggaagtcaaactgtccctgtttactgacggtatgatc atttaccttgaaaaccctaaggactcctccagaaagctcctagaactgataaaagaattc agcgaagtttccagatacaggattaatatacataaatcagtagctcttctatacaccaac agcgaccaagcagagaatcaaatcaataactcaaccccttttacaatagctgcaagaaaa taa >gi568815587f:121203376_121407509|GENSCAN_predicted_peptide_5|346_aa MEATWPEDDIFRQAISLLIVTNVGAYILYFFCATLSYYFVFDHALMKHPQFLKNQVRREI KFTVQALPWISILTVALFLLEIRGYSKLHDDLGEFPYGLFELVVSIISFLFFTDMFIYWI HRGLHHRLVYKAQCSSLNPIPRGYSIQGAILEVGTGPLLDTQPAGALILDYPAFRTSFSP KAIRLGQANYNRTPSLPPTSTSEEGYTRKPQKLNNTARKFYALSLSPLPCPRQRSCHGDE QPALFLPSKKNCGVQHLYSKEYSGLEVHDIALLFPAGDLNQLSNMTRRPHMKDQHTHFIT DEVNTAENIFLHTRALPHCELQNKISTCSLSAAHSHQQEESGIRHT >gi568815587f:121203376_121407509|GENSCAN_predicted_CDS_5|1041_bp atggaagccacatggccagaagatgacatcttccgacaagctattagtcttctgattgta acaaatgttggtgcttacatcctttatttcttctgtgcaacactgagctattattttgtc ttcgatcatgcattaatgaaacatccacaatttttaaagaatcaagtccgtcgagagatt aagtttactgtccaggcattgccatggataagtattcttactgttgcactgttcttgctg gagataagaggttacagcaaattacatgatgacctaggagagtttccatatggattgttt gaacttgtcgttagtataatatctttcctctttttcactgacatgttcatctactggatt cacagaggccttcatcatagactggtatataaggcacagtgttcatccctcaaccccatc cccaggggatacagcattcaaggtgccatcttggaagtaggtactgggcccttgctagac acccaaccagctggtgccttgatcttggactacccagccttcagaacttcattcagtcct aaagccatcaggttgggtcaggcaaattacaatcgcacaccctctctccctcccacctct acctctgaagaagggtacactcgcaaaccacagaaactgaacaacactgcccgaaagttc tatgccctttccctgtcacccctaccctgtcccaggcaacgttcctgccatggtgatgag cagcctgctctgtttctgccctcaaaaaagaactgtggtgtccagcatttatacagtaag gaatactcagggctggaggttcatgacatcgcacttctcttccctgctggagatctgaat caattaagtaacatgacaaggcgacctcacatgaaagatcaacatacccactttattact gatgaagtcaatactgcagaaaacatctttctgcacacaagggcacttcctcactgtgaa ctccagaacaagataagcacatgttccctgtcagcagcccattcccatcaacaagaggag agcggaatcaggcacacttag >gi568815587f:121203376_121407509|GENSCAN_predicted_peptide_6|124_aa MQMPPTQPSSRTDINPLIGNNISGLWLQEKHPSIHYLHNTHISVKEAHHLFVKRMNTLQV KRESGAVVDHKPENTQQCNPVPMTDMLYGTCLSQSDPGSSSQKLEFQRGWQSCNVPETLG TKGK >gi568815587f:121203376_121407509|GENSCAN_predicted_CDS_6|375_bp atgcagatgcctccaacccagccttccagcagaacagatatcaatcctttgattggaaac aacatttctgggctctggttgcaagagaaacacccctccatccactacctccacaataca cacatctctgtgaaggaagctcaccacttatttgtaaaaaggatgaacactcttcaagtt aagagagaatcaggtgctgtggtggaccacaagcctgagaatactcagcagtgcaaccca gttccaatgactgacatgctttatgggacctgtttgagtcagtcagacccaggaagcagc agtcagaaattggaatttcagaggggctggcagagttgcaatgtacctgagacactaggt accaaggggaagtaa >gi568815587f:121203376_121407509|GENSCAN_predicted_peptide_7|104_aa MEDHVKTRRMLPDDTAEIGVTQLQAMEATKGWKRQGRILPQVSEGARFCQPLDFELPASR KLSWVTLVDNDAESMSSGNCRPLRRGRKQQPAGKIGMKATCPGH >gi568815587f:121203376_121407509|GENSCAN_predicted_CDS_7|315_bp atggaagaccacgtgaagacacggagaatgttacctgatgacacagcagagattggtgtt acgcagctgcaggctatggaggctactaaaggctggaagagacaaggaaggatcctccca caggtttcagagggagcacggttctgccaacccctcgactttgaacttcccgcttccaga aagctcagttgggtaacactggtggacaatgatgcagaaagtatgtcatcaggcaactgc cgacccctcagaagaggaagaaaacagcagcctgcaggcaagattggaatgaaggccact tgtccaggacactga >gi568815587f:121203376_121407509|GENSCAN_predicted_peptide_8|615_aa XQSTRAHPTPAPSTVMVAHKCARRSWGIVARVSGDECLKHRQGQQPTRACPSKLPSLEQA TFVFSHFSGSSTSPKILFPFVPNSIRQELSTIPPLPCLPLTQQKVVGGLTPKTLLTMGSL VTLVAASHLEDVEKEDTVSLAEMLLYVGNKENTPTTNGQTVEEIWGGYLENIVTTLLYDK TQTGMPPIGYGNQELNLGPSTQMPFCLAPQDTGGREKFYLSVYSKPRSDNAVFTSITILC SVLKSFVQSSYWMAGIGPSIFNANLIFTTVQRPSYCYFYLTNEEAEVQLPHPMEQTACSL AQPIMGPSDHRTIAKQIQMVRQVGKGRYGEVWMGKWRGEKVAVKVFFTTEEASWFRETEI YQTVLMRHENILGFIAADIKGTGSWTQLYLITDYHENGSLYDFLKCATLDTRALLKSAYS AACGLCHLHTEIYGTQGKPAIAHRDLKSKNILIKKNGSCCIADLGLAVKFNSDTNEVDVP LNTRVGTKRYMAPEVLDESLNKNHFQPYIMADIYSFGLIIWEMARRCITGGIVEEYQLPY YNMVPSDPSYEDMREVVCVKRLRPIVSNRWNSDECLRAVLKLMSECWAHNPASRLTALRI KKMLAKMVESQDVKI >gi568815587f:121203376_121407509|GENSCAN_predicted_CDS_8|1848_bp nctcagagtacacgcgcacatcctacgccagcaccaagcacagttatggtggcccacaag tgtgcaagaaggtcctggggaattgtggccagagtgtctggagatgagtgcttgaagcac agacagggccagcagcccactcgtgcctgcccttcaaagctcccttccctagagcaggct acctttgtcttttctcatttctccggctcttccacatctcctaaaatcctgttcccattc gttcccaactcaataagacaagagctttccaccatccccccacttccttgtctgccactc actcaacagaaggtagtagggggtctaactcccaagaccctacttactatgggttcccta gtaaccctggtggcagcctctcatctggaagatgtggagaaggaggacactgtcagctta gccgaaatgctcctttatgttgggaacaaggaaaacacaccaacgaccaatggccaaaca gtagaagagatatggggtggatacctggagaacattgtgaccacattattatatgacaag acacagactggcatgccaccaattgggtacggcaaccaggaattaaatctaggaccttca actcagatgcccttttgtttagctcctcaggatactggagggagagaaaaattttacctg agcgtttactcaaaaccaaggtcagacaatgctgtatttacctctatcaccatcctctgc tcagtcctcaagagctttgtgcaaagctcctactggatggcaggcattgggccaagcatt tttaacgccaatttgatcttcacaactgttcagcggccaagttattgctatttctacctt actaatgaggaagctgaagttcagctacctcatccaatggagcagacagcttgctcttta gctcagcctattatgggaccttctgatcatcgaactattgccaaacagattcagatggtc cggcaagttggtaaaggccgatatggagaagtatggatgggcaaatggcgtggcgaaaaa gtggcggtgaaagtattctttaccactgaagaagccagctggtttcgagaaacagaaatc taccaaactgtgctaatgcgccatgaaaacatacttggtttcatagcagcagacattaaa ggtacaggttcctggactcagctctatttgattactgattaccatgaaaatggatctctc tatgacttcctgaaatgtgctacactggacaccagagccctgcttaaatcggcttattca gctgcctgtggtctgtgccacctgcacacagaaatttatggcacccaaggaaagcccgca attgctcatcgagacctaaagagcaaaaacatcctcatcaagaaaaatgggagttgctgc attgctgacctgggccttgctgttaaattcaacagtgacacaaatgaagttgatgtgccc ttgaataccagggtgggcaccaaacgctatatggctccagaagtgctggacgaaagcctg aacaaaaaccacttccagccctacatcatggctgacatctacagcttcggcctaatcatt tgggagatggctcgtcgttgtatcacaggagggattgtggaagagtaccaattgccatat tacaacatggtaccgagtgatccgtcatacgaagatatgcgtgaggttgtgtgtgtcaaa cgtttgcggccaattgtgtctaatcggtggaacagtgatgaatgtctacgagcagttttg aagctaatgtcagaatgctgggcccacaatccagcctccagactcacagcgttgagaatt aagaagatgcttgccaagatggttgaatcccaagatgtaaaaatctga