GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:13:59 Sequence gi568815592f:12414110_12614721 : 200612 bp : 39.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 18112 18292 181 0 1 28 58 171 0.588 7.49 1.02 Term + 27374 27492 119 0 2 55 44 126 0.228 2.62 1.03 PlyA + 28232 28237 6 1.05 2.00 Prom + 35279 35318 40 -4.05 2.01 Sngl + 35589 35765 177 2 0 83 43 189 0.751 8.60 2.02 PlyA + 37796 37801 6 1.05 3.00 Prom + 40522 40561 40 -5.85 3.01 Init + 57637 57692 56 0 2 60 85 32 0.661 0.91 3.02 Intr + 59890 60094 205 1 1 59 58 171 0.324 9.48 3.03 Intr + 79556 79760 205 1 1 76 84 104 0.843 6.65 3.04 Term + 79977 80590 614 1 2 62 53 176 0.833 5.15 3.05 PlyA + 82591 82596 6 1.05 4.00 Prom + 97286 97325 40 -4.25 4.01 Sngl + 100001 100615 615 1 0 99 34 656 0.999 57.24 4.02 PlyA + 100645 100650 6 1.05 5.04 PlyA - 100708 100703 6 1.05 5.03 Term - 101909 101810 100 1 1 66 38 56 0.222 -4.98 5.02 Intr - 102458 102352 107 2 2 53 68 135 0.226 6.09 5.01 Init - 107929 107867 63 1 0 67 72 32 0.247 0.70 5.00 Prom - 108863 108824 40 -2.25 6.00 Prom + 115067 115106 40 -3.45 6.01 Init + 117156 117180 25 2 1 79 111 15 0.680 2.64 6.02 Intr + 120130 120284 155 2 2 110 23 81 0.512 2.67 6.03 Intr + 120733 120937 205 0 1 68 35 143 0.554 4.85 6.04 Term + 121559 121719 161 0 2 70 41 136 0.561 4.22 6.05 PlyA + 123191 123196 6 1.05 7.04 PlyA - 123238 123233 6 1.05 7.03 Term - 123610 123605 6 1 0 108 41 0 0.489 -5.81 7.02 Intr - 125021 124853 169 1 1 71 77 164 0.730 12.63 7.01 Init - 125773 125679 95 1 2 73 28 94 0.364 1.80 7.00 Prom - 128560 128521 40 -2.95 8.00 Prom + 133963 134002 40 -8.85 8.01 Sngl + 136472 137146 675 1 0 59 43 323 0.486 20.83 8.02 PlyA + 137374 137379 6 1.05 9.00 Prom + 137543 137582 40 -6.15 9.01 Init + 138490 138653 164 0 2 70 72 131 0.272 8.95 9.02 Intr + 139318 140774 1457 1 2 19 50 267 0.050 4.39 9.03 Intr + 142400 142515 116 0 2 55 92 94 0.315 5.75 9.04 Intr + 145942 146112 171 0 0 61 60 145 0.438 8.22 9.05 Intr + 154045 154155 111 0 0 55 2 133 0.018 1.06 9.06 Intr + 163404 163653 250 2 1 73 92 134 0.155 8.39 9.07 Intr + 178716 178854 139 1 1 79 65 113 0.363 6.70 9.08 Intr + 179707 179793 87 1 0 12 76 110 0.148 0.37 9.09 Intr + 187919 187971 53 2 2 90 59 68 0.040 1.73 9.10 Term + 196912 197009 98 2 2 55 40 134 0.029 2.45 9.11 PlyA + 197228 197233 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_1|99_aa MLHIVHQTYSEEHSAGNGLGGELAIWLQEAQQSHPELHDTSYTEEQRCPRVHVIHLTSQH GGGEFGHRHTGKAMRIRRRDRSDASISQVAPRIAGSSRN >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_1|300_bp atgctccacattgtacaccagacttactcagaggagcacagtgctgggaatggcttgggt ggtgaacttgcaatatggcttcaagaggcccagcagtcccatccagagttgcatgacact tcttatacagaagagcagagatgcccaagagttcatgtcattcacttaacaagtcagcat ggaggaggagaatttggacacagacatacggggaaggccatgagaattagaagaagagat cggagtgatgcatctataagccaagtagcaccaaggattgctggcagcagcagaaactag >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_2|58_aa MGSQPRNAGAIRYRRRQRMDSPPEPLQTLDFSLVKLVLSSYPPELRDLESLLFKAPSL >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_2|177_bp atggggtcacagccaaggaatgctggagccatcagataccggaggaggcaaagaatggat tctcccccagaacctctgcagactcttgattttagcctagtaaaactggtgctgtcctcc taccctccagaactacgagatcttgagtctctgttgtttaaagcaccaagtttgtga >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_3|359_aa MESVCADLFMCPPSGEGDWSAAGEAHKIKTLAVDPGTDRHKQAIVGITCTSVEEQGGLLL STWEDHQGNPTNNWERSGVVSEVLVVQMQLEGTSLSKRLDIGKTGTLLADLQREGIENSW REDTEAGLEREEAENPTQSCHALGLFLAPNNSCGGGDFNPRKTVCSVCCRAILPMRRSQS DLSTPQSAGVFWSPSLAMPACNAAPRYLLGSHIIVPALVDCVPDQQSTPVEWPQWTGTSL PAPRPHCSLLHATLPTDTRPRPPPILLSQHTCVGGPTFPFPASVWVHVHPAKPLLQVRMH PALLPLAIPPLSWAHWQSWSPLAPPLPALHSALELLVGNQARRTVDLPPAQSGHYHSCE >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_3|1080_bp atggaaagcgtctgtgcagacttgttcatgtgtccgccctctggagaaggggactggtct gcagctggggaagcacataagatcaaaactctggcagtggatccaggtactgacaggcac aagcaggcaatagtgggcatcacatgtaccagtgtggaagagcagggagggctgctgctg agcacatgggaggaccaccaagggaaccctacaaataactgggagagatctggggtggtg agcgaggtcctcgttgtccaaatgcagttggaaggaacatctctctccaagagactagac attgggaagactggcacactcctagcagatcttcagagggaaggcattgagaacagttgg agggaagacacagaagctgggctggagagggaggaagctgagaaccccacacagagctgc catgcactgggactattcctagcccccaacaattcctgcggagggggagactttaaccct aggaaaactgtctgctctgtgtgctgcagggcgatcttgcccatgagaagaagccagtct gatctgagcacccctcagtctgctggtgtcttctggagccccagcctggccatgcctgct tgcaatgcagcccccagatacctcctggggtcccacatcatagttcctgccctggtggac tgtgtccccgaccagcagagtactccagtagagtggccccagtggacaggaaccagccta cctgcgccccgcccccactgcagcctccttcatgccaccttgcctacagacactcgccca aggccaccccctatattgctttcccagcacacatgtgtgggtggccctacctttcccttc cctgccagtgtgtgggtgcatgtgcaccctgccaagccactgctgcaagtgcgaatgcac cccgccctccttcctctcgccataccaccattgtcatgggcacattggcagtcatggagc cctctagccccacccttgccagcacttcactctgcgctggagctgctagtgggaaatcag gcaaggagaacagtggacctgcccccagcccagagtggccactaccactcatgtgaatga >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_4|204_aa MGAYKYIQELWRKKQSDVMRFLLRVRCWQYRQLSALHRAPRPTRPDKARRLGYKAKQGYV IYRIRVRRGGRKRPVPKGATYGKPVHHGVNQLKFARSLQSVAEERVGRHCGALRVLNSYW VGEDSTYKFFEVILIDPFHKAIRRNPDTQWITKPVHKHREMRGLTSAGRKSRGLGKGHKF HHTIGGSRRAAWRRRNTLQLHRYR >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_4|615_bp atgggtgcatacaagtacatccaggagctatggagaaagaagcagtctgatgtcatgcgc tttcttctgagggtccgctgctggcagtaccgccagctctctgctctccacagggctccc cgccccacccggcctgataaagcgcgccgactgggctacaaggccaagcaaggttacgtt atatataggattcgtgttcgccgtggtggccgaaaacgcccagttcctaagggtgcaact tacggcaagcctgtccatcatggtgttaaccagctaaagtttgctcgaagccttcagtcc gttgcagaggagcgagttggacgccactgtggggctctgagggtcctgaattcttactgg gttggtgaagattccacatacaaattttttgaggttatcctcattgatccattccataaa gctatcagaagaaatcctgacacccagtggatcaccaaaccagtccacaagcacagggag atgcgtgggctgacatctgcaggccgaaagagccgtggccttggaaagggccataagttc caccacactattggtggctctcgccgggcagcttggagaaggcgcaatactctccagctc caccgttaccgctaa >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_5|89_aa MDTKKGAIVTGSYLKMEGERRSRAQIVTAMFGYRSTWPELRSGDEGDTVRLTGSKASLAD CQKIHDNRHAPFQENSDSLIEWLVYYRII >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_5|270_bp atggacacaaagaagggagcaatagtcactgggtcctacttgaagatggagggtgagagg aggtccagggctcagattgtcactgctatgtttggctataggagcacatggcctgagctg cgatccggggatgagggagacaccgttagactgactggctccaaagccagtttagcagac tgccaaaagattcatgacaacagacatgcaccttttcaagagaattcagatagccttata gaatggctggtttattacagaatcatctaa >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_6|181_aa MDESSRNIERLESITAILALQGWVGAFSHYPGGSWNPLPLLLEKAGGSLKENSSKLATEP QHLHPEGDGSLAFVSWCHGEPLSGACEFLEALQCMEDPLKITAAKNGSRFPTTGGIMLQI TCLSRMMDEGSCRKILQAYPDKSPRTVTYVHFSITLEGIQFYVCRAAVARETEVLVRMQL N >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_6|546_bp atggacgaatcttcaaggaatattgaaaggctggagtctatcacggcgattctggccctg cagggctgggtgggtgcattctctcattaccctggaggttcctggaatccactaccactt ctcctagagaaagctggtgggtctttgaaagaaaactcttctaagctagctactgagcct cagcatttacatcctgaaggtgatggaagccttgcatttgtgtcttggtgccatggggag cctctgtctggagcctgtgaatttctagaggcccttcaatgcatggaagaccctctaaaa attacagctgctaaaaatgggagcagattccctaccactgggggcatcatgctgcaaatt acatgtctatccagaatgatggatgagggaagctgcaggaagattttgcaagcatacccg gataaaagcccccgtacagtcacttatgtgcacttctctataacactggagggaattcag ttttatgtctgcagagctgctgttgccagggaaacagaggtcctagtcagaatgcaactg aattaa >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_7|89_aa MKDCTRLRIFAGYLVPKIIPSRPSHWAVPQIGLNFPPRCQTTENSRFLEIGQHRNDLQPV LPWRPLQCVGTSHGAACLRADLTEGKGKA >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_7|270_bp atgaaggattgtaccaggttaaggatctttgcaggttacctggtgcccaagatcatccct tcaaggccatcacactgggctgtccctcagattgggttaaactttccaccccgatgccag accacagagaacagccgcttcctggaaattgggcagcaccgaaatgaccttcagccagtg ttgccctggagaccgttgcagtgtgttggaaccagccacggggctgcgtgtctccgtgct gacctcactgaagggaagggcaaggcataa >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_8|224_aa MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANIQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAGEKGPV IHKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISFPAKLSFINEGVIKYFTDF TDFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKL >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_8|675_bp atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagagcagccggagagaaaggtccggtt atccacaaagggaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatttccagccaaactaagcttcataaatgaaggagtaataaaatactttacagacttt acagactttacagacaagcaaatgctgagagattttgtcaccaccaggcctgccctaaaa gagctcctgaaggaagcactaaacatggaaaggaacaaccgataccagccactgcaaaat catgccaaattgtaa >gi568815592f:12414110_12614721|GENSCAN_predicted_peptide_9|881_aa MDKFLDTYTLPRLNQEEVDSQNRPITGSEIVAIINSLPTKKSPGPDGFTAKFYQRKPHVS AQNLRKLISNFSKVSGYKINVQKSQAFLYTNNRQAESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNR DIDQWNRTEPSEIMLHIYNYLIFDKPEKNKKWGKDSLFNKWCWENWIAICRKLKLDPFLT PYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSRTPKAMATQAKIDKWDLI KLKSFCTAKETTIRVNRQPTKWEKIFTTYSCDKGLISRIYNELKQIYKKKTNNPIKKWAK DMNRHFSKENIYAARKHMKKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGC GEIGIFLHCWWDCKLVQPLWKSVWRFLRDLELEIPFDPVIPLLGIYPKEYKSCCYKDTCT SMQEWPNTTSVCQTLACIGINWNLTKVHIANATPKVSDAESLETAASGIEEEPQEEGDFQ LNFIVISNERAFPGPNPGMANGKCRYKPRNHGRQRGAADEERKVIAMGKDVTECRPADLI LSSFTRGSDQLSKQLPVLAFRGCLPQVMCGFWGKEPTDTPQSQRACIPGTAAQNDAGPSQ QGMACGSQGYARTLRCNLTGSFSDLKEFTKNVLPLLELPQVGNPTSVTTSSGTLVVHVNS EQWKLAFHPYGNGTQRQSPRSGVEGNPRGEATEFATQLTHKKQLEKDSKSSRGFEVCKAL AEIGKKQLRERQKLKAMTTPSVSKDMQHLELLDLAGGDVIQ >gi568815592f:12414110_12614721|GENSCAN_predicted_CDS_9|2646_bp atggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagttgactct cagaatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagcaaaattctaccagagaaaaccccatgtctca gcccaaaatctccgtaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaataacagacaagcagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgttacctgac ttcaaactgtactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgctgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggatagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtctagaacaccaaaagcaatggcaacacaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcacaacctactcatgtgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccctatcaaaaagtgggcgaag gacatgaacagacacttctcaaaagaaaacatttatgcagccagaaaacacatgaaaaaa tgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcgatcattaaaaagtcaggaaacaacaggtgctggagaggatgt ggagaaataggaatatttttacactgttggtgggactgtaaactagttcaaccattgtgg aagtcagtgtggcgattcctcagggatctagaactagaaataccatttgacccagtcatc cccttactgggtatatacccaaaggagtataaatcatgctgctataaagacacatgcaca tccatgcaagaatggcctaatacaaccagtgtttgtcaaactttagcatgtattggaatc aactggaatcttactaaagtacacattgccaatgccacccccaaagtttctgatgcagag tccttggagacagcagccagtggaattgaggaagagccacaggaagaaggagatttccag ctaaacttcatagtaatttcgaatgagcgtgcctttcctgggccgaatccagggatggca aatggaaagtgcagatacaagcccagaaaccatggcaggcaaagaggggcagccgatgaa gaaaggaaggtcattgccatggggaaggatgtgactgaatgtcgtcctgcagacctaata ctttcctcatttacacgtggctcagatcagctttctaagcagctcccagtcttagctttc agaggctgtctgccccaggtcatgtgtggcttctggggaaaggagcctactgacactcct caaagtcagcgggcctgcattccaggcactgctgctcagaatgacgcaggaccctcgcag caagggatggcctgcggctcccagggatatgctagaacactcagatgcaatctgacaggc agcttcagtgatttgaaggaattcacaaagaacgttttgcccctcctggaattgccacaa gttggaaatcccacatcagtcaccacttcctcggggacattagttgttcatgtaaattcc gaacaatggaagctggcttttcatccatacggaaatggaacacaaagacagagccctaga agtggagtggaggggaatcctagaggagaggccactgagtttgccacccagttgacgcat aagaagcaacttgagaaggactccaaaagcagcaggggatttgaggtatgcaaagctttg gctgagattggaaagaaacagctcagagaacggcaaaaattgaaagcaatgacaacacca agtgtcagcaaggacatgcagcatctggagctcttggaccttgctggtggtgatgtcata cagtaa