GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:50:29 Sequence gi568815595r:93960951_94161511 : 200561 bp : 37.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1785 1858 74 1 2 112 54 89 0.728 4.99 1.02 PlyA + 1902 1907 6 1.05 2.03 PlyA - 3900 3895 6 1.05 2.02 Term - 4469 4300 170 0 2 -31 52 224 0.067 3.96 2.01 Init - 12799 12724 76 1 1 63 74 133 0.130 8.70 2.00 Prom - 15301 15262 40 -6.35 3.00 Prom + 17193 17232 40 -3.25 3.01 Init + 17317 17354 38 0 2 -4 64 74 0.018 -4.67 3.02 Intr + 19380 19532 153 0 0 69 96 96 0.023 6.77 3.03 Intr + 35770 35908 139 1 1 38 99 70 0.062 2.65 3.04 Intr + 36297 36351 55 2 1 65 90 73 0.423 2.83 3.05 Term + 42709 42962 254 2 2 58 42 263 0.846 13.52 3.06 PlyA + 44234 44239 6 1.05 4.02 PlyA - 44789 44784 6 1.05 4.01 Sngl - 54319 53435 885 1 0 71 43 491 0.862 38.70 4.00 Prom - 64817 64778 40 -5.15 5.00 Prom + 65526 65565 40 -6.55 5.01 Init + 72228 72265 38 2 2 74 75 38 0.969 0.63 5.02 Intr + 74381 74486 106 2 1 75 64 111 0.996 6.70 5.03 Intr + 75602 75804 203 1 2 67 115 279 0.988 25.66 5.04 Intr + 78930 79038 109 0 1 27 64 125 0.457 3.37 5.05 Intr + 82065 82290 226 2 1 36 90 128 0.387 4.44 5.06 Intr + 88456 88572 117 2 0 60 105 109 0.431 9.32 5.07 Term + 92237 92313 77 0 2 86 41 50 0.694 -2.88 5.08 PlyA + 92499 92504 6 1.05 6.02 PlyA - 93011 93006 6 1.05 6.01 Sngl - 100561 99998 564 1 0 50 34 490 0.582 35.69 6.00 Prom - 106193 106154 40 -7.45 7.03 PlyA - 107697 107692 6 1.05 7.02 Term - 108258 108176 83 1 2 58 53 112 0.569 1.58 7.01 Init - 112926 112686 241 0 1 71 61 156 0.845 9.28 7.00 Prom - 112980 112941 40 -7.05 8.05 PlyA - 113097 113092 6 1.05 8.04 Term - 113886 113132 755 1 2 63 48 179 0.350 4.01 8.03 Intr - 115199 114965 235 2 1 -5 -9 215 0.431 -1.16 8.02 Intr - 116060 115380 681 2 0 26 73 550 0.780 38.15 8.01 Init - 117734 116988 747 2 0 44 54 372 0.772 24.16 8.00 Prom - 117988 117949 40 -12.72 9.03 PlyA - 118038 118033 6 1.05 9.02 Term - 119294 118767 528 2 0 55 41 334 0.913 18.67 9.01 Init - 121484 121338 147 2 0 51 64 98 0.834 3.84 9.00 Prom - 124489 124450 40 -2.75 10.00 Prom + 130316 130355 40 -5.35 10.01 Init + 132710 132767 58 1 1 68 82 100 0.954 8.92 10.02 Intr + 133190 133344 155 0 2 79 75 31 0.951 -0.23 10.03 Intr + 134083 134204 122 0 2 113 116 56 0.933 9.27 10.04 Term + 165261 165540 280 0 1 73 44 149 0.280 2.93 10.05 PlyA + 167300 167305 6 1.05 11.00 Prom + 177669 177708 40 -6.35 11.01 Init + 183735 183915 181 2 1 60 94 66 0.425 3.79 11.02 Term + 191324 191469 146 0 2 15 42 170 0.787 2.19 11.03 PlyA + 192096 192101 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 12685 13002 318 0 0 65 44 202 0.803 9.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_1|24_aa XLRAVDSQWTLQQEFGHVGTFLQK >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_1|75_bp ngactgagagcagtggactctcagtggactcttcagcaagagtttggtcatgttggaaca ttcctgcagaagtga >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_2|81_aa MRVLGGRCGALLACLLLVLPVSEANSMQEQWQAFSQIRSGNGRLAGSGAQRTPFRIRRDG SQQRVCDGGKQQLWTASKSSA >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_2|246_bp atgagggtcctgggtgggcgctgcggggcgctgctggcgtgtctcctcctagtgcttccc gtctcagaggcaaactccatgcaggaacaatggcaagcctttagccagatcaggagcggc aatgggcgcctcgctggatcaggagcacagcggacacccttccggatccggagggatggg agtcagcagcgggtctgcgacggcggcaaacagcagttgtggacggcgagcaaaagctca gcttga >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_3|212_aa MSVGCDCYDEKKRGVSECRRPPGKSGVPVSAPGSDGKWWEERPGMFSLMASCCGWFKRWR EPVSSIYASDFVGHLECITSQASKVSPQPRGRGPISVCGLVLRNQPTKQESMEKLSSAKL VPGAADLKEYPEDVAPTVGFSKINLRQGKFEVTIFDLGGGIRIRGIWKNYYAESYGVIFV VDSSDEERMEETKEAMSEMLRHPRISGKPILV >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_3|639_bp atgagtgttggctgtgactgttatgatgagaagaaaaggggcgtctcggagtgccggagg cccccggggaagagcggggtgccggtgtccgctccgggctcggatgggaagtggtgggag gagcgacccgggatgttcagtctgatggccagttgctgcggctggttcaagcggtggcgg gagcctgtcagcagtatctatgctagtgactttgttgggcacctagaatgtatcaccagt caggcatctaaagtgagtccccaaccccggggccgtggaccaatatctgtctgtgggctg gttcttaggaaccagcccacaaagcaggagtccatggaaaaattgtcttctgcaaaactg gtccctggtgctgctgatctaaaggaataccctgaagatgtagctcctactgttggattt tcaaaaattaaccttagacaaggaaagtttgaagtcaccatctttgacttgggaggtgga ataagaattcggggaatctggaagaattactatgctgaatcctatggggtaatatttgtt gtggattccagtgatgaagagagaatggaagagacaaaagaggctatgtcagaaatgcta agacatcctaggatatcgggaaagcctatattggtgtaa >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_4|294_aa MKDRLQELKQRTKEIELSRDSHVSTTETEEQGVFLQQAVIYEREPVAERHLHEIQKLQES INNLADNVQKFGQQQKSLVASMRRFSLLKRESTITKEIKIQAEYINRSLNDLVKEVKKSE VENGPSSVVTRILKSQHAAMFRHFQQIMFIYNDTIAAKQEKCKTFILRQLEVAGKEMSEE DVNDMLHQGKWEVFNESLLTEINITKAQLSEIEQRHKELVNLENQIKDLRDLFIQISLLV EEQGESINNIEMTVNSTKEYVNNTKEKFGLAVKYKKRNPCRVLCCWCCPCCSSK >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_4|885_bp atgaaagaccgacttcaagaactaaagcagagaacaaaggaaattgaactctctagagac agtcatgtatcaactacagaaacagaggaacaaggggtgtttctacagcaagctgttatt tatgaaagagagcctgtagctgagagacacctacatgaaatccaaaaactacaggaaagt attaacaatttggcagataatgttcaaaaatttgggcagcaacagaaaagtctggtggct tcaatgagaaggtttagtctacttaagagagagtctaccattacaaaggagataaaaatt caggcagaatacatcaacagaagtttgaatgatttagttaaagaagttaaaaagtcagag gttgaaaatggtccatcttcagtggtcacaaggatacttaaatctcagcatgctgcaatg ttccgccattttcagcaaatcatgtttatatacaatgacacaatagcagcaaagcaagag aagtgcaagacatttattttacgtcagcttgaagttgctggaaaagagatgtctgaagaa gatgtaaatgatatgcttcatcaaggaaaatgggaagtttttaatgaaagcttacttaca gaaatcaatatcactaaagcacaactttcagagattgaacagagacacaaggaacttgtt aatttggagaaccaaataaaggatttaagggatcttttcattcagatatctcttttagta gaggaacaaggagagagcatcaacaatattgaaatgacagtgaatagtacaaaagagtat gttaacaatactaaagagaaatttggactagctgtaaaatacaaaaaaagaaatccttgc agagtactgtgttgttggtgctgtccatgctgtagctcaaaataa >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_5|291_aa MECGLIDIGDLDRLANKQDKEGALGEADVIECLSLEKLVNEHKCLCQIEPCSAISGYGKK IDKSIKKGLYWLLHVIARDFDALNERIQKETTEQRALEEQEKQERAERVRKLREERKQNE QEQAELDGTSGLAELDPEPTNPFQPIASVIIENEGKLEREKKNQKMEKDSDGCHLKHKME HEQIETQGQVNHNGQKNNEFGLVENYKEALTQQLKNEDETDRPSLESANGKKKTKKLRMK RNHRVEPLNIDDCAPESPTPPPPPPPDFYRKPLPPLAVPQRPNSDAHDVIS >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_5|876_bp atggagtgtggattaatagacattggagacttggacaggttggcaaataaacaagataaa gaaggagctttaggagaagctgatgtcattgaatgtctatctctggaaaaattggtcaat gagcacaagtgcctgtgtcagatagaaccatgttcagcaatctcggggtatggaaagaaa attgacaagtccattaaaaaaggcctttattggctgctacatgttattgcaagagacttt gatgccttaaatgaacgcatccaaaaagagacaacagagcagcgtgctcttgaggaacaa gagaaacaagaaagagctgaacgagtgcgaaaattacgagaagaaagaaaacaaaatgaa caggagcaggctgaactcgatggaaccagtggtctggctgagttggacccagaaccaacg aatcctttccagccaatagcatctgtaatcattgagaatgaaggaaaacttgaaagagag aaaaaaaaccaaaaaatggagaaagacagtgatggctgccacctgaaacataaaatggag catgagcaaatagagacacaaggccaggttaatcacaatggccaaaaaaataatgaattt ggactagtagaaaattataaggaggcattaacacagcagttaaagaatgaagatgagaca gaccggccatcattggaatcagctaatggtaaaaagaaaactaagaaactaagaatgaaa aggaaccaccgggtagaaccacttaatatagatgactgtgctcctgagagtccaacgcca cccccaccccctcctcctgatttctataggaagccactgcctcccctggctgtgccacag cgacctaacagtgatgctcatgatgtgatctcataa >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_6|187_aa MFLLLNCIVAVSQNMGIGKNGDLPRPPLRNEFRYFQRMTTTSSVEGKQNLVIMGRKTWFS IPEKNRPLKDRINLVLSRELKEPPQGAHFLARSLDDALKLTERPELANKVDMIWIVGGSS VYKEAMNHLGHLKLFVTRIMQDFESDTFFSEIDLEKYKLLPEYPGVLSDVQEGKHIKYKF EVCEKDD >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_6|564_bp atgtttcttttgctaaactgcatcgtcgctgtgtcccaaaacatgggcatcggcaagaac ggggacctgcccaggccgccgctcaggaatgaattcaggtatttccagagaatgaccaca acttcttcagtagagggtaaacagaatctggtgattatgggtaggaagacctggttctcc attcctgagaagaatcgacctttaaaggatagaattaatttagttctcagcagagaactc aaggaacctccacaaggagctcattttcttgccagaagtttggatgatgccttaaaactt actgaacgaccagaattagcaaataaagtagacatgatttggatagttggtggcagttct gtttataaggaagccatgaatcacctaggccatcttaaactatttgtgacaaggatcatg caggactttgaaagtgacacgtttttttcagaaattgacttggagaaatataaacttctg cctgaatacccaggtgttctctctgatgtccaggaggggaaacacatcaagtacaaattt gaagtatgtgagaaggatgattaa >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_7|107_aa MGKDFMSKTPKATAAKAKIDKWDLIKRKSFCTVKETTIRVNRQPTEWEKIFAIYPTDKGL ISRIYKEFKQIYEKKTTSSKNRKSFAFLVEHQKAKNPEHKSPSYYGI >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_7|324_bp atgggcaaagacttcatgtctaaaacaccaaaagcaacggcagcaaaagccaaaattgac aaatgggatctaattaaacgaaagagcttctgcacagtaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctatccaactgacaaagggcta atatccagaatctacaaagaatttaaacaaatttacgagaaaaaaacaacctcatcaaaa aataggaaatcctttgcgtttctggtggaacatcaaaaggctaaaaatcctgaacataag agtccttcatactacgggatctga >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_8|805_aa MVKGSMQQEELTILNIYAPNTGASRFMKQVLRDLQRDLDSHTIIVGDFNTPLSILDRSMR QKINKDIQDLNSALDQADLIDIYRTLHPKSTEYTLFSASHHTYSKTDHIIGSKLLSKCKR MEIITNSLSEHSEIKLELRIKKLTQNRTTTQKLNNLLLNDYWVNNEMQAEIKMFFETNEN KDTTYQNLWDPFKAVCKGRFTALNAHKRKQERSKIDNLTSKLKELEKQQQTNSKASRRQE ITKIRAELKTRRERFQHSSTCPGYTHTLDEQKHLVIASVQLPNDDAHFDASHYDSEKGEF GGFGSVSGKIEIEIKINHEGEVNRARYMPQNPCIITTKTPSSDVLILDYTKHPKPDLSGD CNSDLRLHGHQKKGYGLSWPNLSGHLLSASDDHTIYPWDISAVPKEGNVVDAKTIFTGHT VVVEDVSWHLFHKSLFGSVADHQKLMIWDTCSNNTSKPSHSVDAHTAEVNCLSILIDLSK IGEEQSPEDAEDGSPELLFIHGGHTAKISDFSWNPNEPWVICSVSEDNIMQVWQMAENIY NDEDPEGIWIQKDKVLEFLARAIRHEKEIKGIQIGREEVKLSLFADDMIVYLESPIVSAQ NLLKLISNFSKISRNKINVQKSQAFLYTNNRQTDSQIMSELPSTIATKRIKYLGIQLTRD VKDLFKENYKPLLEEIREDTNKWKILPCSWIRRINTVKMAILPRVTYRFNAIPIQLPLTF FTELEKQNTLNFIWNQKRTHIAKTILRKKNKAGGIMLPDFKLYYKATVTKTAWNWYQNRY IQPMEQNRALRNNTTHLQLSDLWQT >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_8|2418_bp atggtaaagggatcaatgcaacaagaagagctaactatcctaaatatatatgcacccaac acaggagcatccagattcatgaagcaagttcttagagacctacaaagagacttagactcc cacacaataatagtgggagactttaacaccccgctgtcaatattagacagatcaatgaga cagaaaattaacaaggatattcaggacttgaactcagctctggaccaagcagacctaata gacatctacagaactctccacccaaaatcaacagaatatacattattctcagcatctcat cacacttattctaaaactgaccacataattggaagtaaactcctcagcaaatgtaaaaga atggaaatcataacaaacagtctctcagaacacagtgaaatcaaattagaactcaggatt aagaaactcactcaaaaccgcacaactacacagaaactgaacaacctgctcctgaatgac tactgggtaaataatgaaatgcaggcagaaataaagatgttctttgaaaccaatgagaac aaagacacaacataccagaatctctgggacccatttaaagcagtgtgtaaagggagattt acagcactaaatgctcacaagagaaagcaggaaagatctaaaattgacaacctaacatca aaattaaaagaactagagaagcaacagcaaacaaattcaaaagctagcagaagacaagaa ataactaagatcagagcagaactgaagaccagaagggaaagatttcagcattcatcgact tgtcctgggtacacacacacattggatgaacaaaaacatcttgttatagccagtgtgcaa ctccctaatgatgatgctcattttgatgcatcacactacgacagtgagaaaggagaattt ggaggttttggttcagttagtggaaaaattgaaatagaaatcaagatcaaccatgaagga gaagtaaacagggcccgttatatgccccagaatccttgtattatcacaacaaagactcct tccagtgatgttcttatcttggactatacaaaacatcctaaaccagatctttctggagac tgcaactcagacttgcgtctccatggacatcagaaaaaaggctatgggctttcttggcca aatctcagtgggcacttacttagtgcttcagatgatcacaccatctacccgtgggacatc agtgctgttccaaaggagggaaatgtggtggatgcaaagaccatctttacagggcataca gtagtagtagaagatgtttcctggcatctgttccataagtctctgtttgggtcagttgct gatcatcagaaacttatgatttgggatacttgttcaaacaatacttccaaaccaagccac tcagttgacgctcacactgctgaagtgaactgcctttcaatccttatagatttaagtaaa attggagaggaacaatccccagaagatgcagaagacgggtcaccagagttgttgtttatt catggtggtcacactgccaagatatctgatttctcctggaatcccaatgaaccttgggtg atttgttctgtatcagaagacaatatcatgcaagtgtggcaaatggcagagaacatttat aatgatgaagaccctgaaggaatatggatccagaaggacaaggtattggaatttctagcc agggcaatcaggcacgagaaggaaataaagggtattcaaataggaagagaggaagtcaaa ttgtctctgtttgcagatgacatgattgtatatttagaaagccccatcgtctcagcccaa aatctccttaagctaataagcaacttcagcaaaatctcaaggaacaaaatcaatgtgcaa aaatcacaagcattcctatataccaataacagacaaacagacagccaaatcatgagtgaa ctcccatccacaattgctactaagagaataaaatacctgggaatacaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcgaggaaataagagaggacaca aacaaatggaaaatacttccatgctcatggataagaagaatcaatacagtgaaaatggcc atactgcccagagtaacttatagattcaatgctatccccatccagctaccattgactttc ttcacagaattggaaaaacaaaatactttaaacttcatatggaatcaaaaaagaacccac atagccaagacaatcctaaggaaaaagaacaaagctggaggcatcatgctacctgacttc aaactatactacaaggctacagtaaccaaaacggcatggaactggtaccaaaacagatat atacaaccaatggaacagaacagagccctcagaaataataccacacatctacaactatct gatctttggcaaacctga >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_9|224_aa MTTSFLKKEVRKLKCASRVCKTDCRDLARVSDSVGLKESLKFVALASFQPPLVIPRQTRS GVDLQQTPADLQQRGPTVRRKTNKQKGIAHPLKDPTSQRSPTSKTKGDISTKMGRNQCKK AENSKNQNASSPPKDHNSSPAREQNWMENEFDKLTEVGFRRWVITNSSKLKEHILTQCKE AKNLEKSLDKLLTRITSLEKNINDLMELKNTARELREAYTSINS >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_9|675_bp atgacaacctctttccttaagaaagaggttcgcaaacttaagtgtgcatcaagggtctgt aaaacagactgcagggacctagccagagtttctgattcagtaggtctgaaagagagcctg aaattcgtagctctagcaagttttcagcctccgctggtgatacccaggcaaacaaggtct ggagtggacctccagcaaactccagcagacctgcagcagaggggcccaactgttagaagg aaaactaacaaacagaaaggaatagcacatccactcaaagaccccaccagccaaaggtca ccaacatcaaagaccaaaggggatatatccacaaagatggggagaaaccagtgcaaaaag gctgaaaattccaaaaaccagaatgcctcttctcctccaaaggatcacaactcctcgccg gcaagggaacaaaactggatggagaatgagtttgacaaactgacagaagtaggcttcaga aggtgggttataacaaactcttccaagctaaaggagcatattctaacccaatgcaaggaa gctaagaaccttgaaaaaagtttagacaaattgctaactagaataaccagtttagagaag aacataaatgacctgatggagctgaaaaacacagcacgagaacttcgtgaagcatacaca agtatcaatagctga >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_10|204_aa MVPVGQEAFKPGQFLRLPQCYLHCNEYDSLRLRWLRQTLESFIPQPLINVIKVSELDGRK MGDAQPEMFDKVLVDAPCSNDRSWLFSSDSQKASCRISQRRNLPLLQIELLRSAIKALRP GGILVYSTCTLSKAENQDVISEILNSHGNIMPMDIKGIARTCSHDFTFAPTGQECGLLVI PDKGKAWGPMYVAKLKKSWSTGKW >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_10|615_bp atggttcctgtgggacaggaagcttttaagccaggacagttcctgcgtctgccgcaatgt tatcttcattgtaatgaatatgatagtctgagattgaggtggctaaggcagacgttggaa tctttcatcccacagcctttgataaatgtaattaaagtgtctgaattggatggcagaaaa atgggagatgcccagcctgaaatgtttgacaaggtgttagtggatgctccgtgttcaaat gatcgaagctggttgttttcttctgactctcagaaggcatcctgtaggataagtcaaagg aggaatttgcctcttctacagatagagctgttaaggtctgcaattaaggccttacgtcct ggagggatacttgtatactctacatgcacgctttccaaggcagaaaatcaagatgtgatc agtgaaattttaaactcccacggtaacatcatgcctatggacattaaaggaatagcaagg acttgctcccacgacttcacatttgctcccactggccaggaatgtgggctcttagtgatt ccagataagggcaaagcctggggcccaatgtatgtagccaaattgaagaaatcatggagc acaggaaaatggtga >gi568815595r:93960951_94161511|GENSCAN_predicted_peptide_11|108_aa MEKKQESAILAKERTNFKKEQSITRIKCYREVREKENNHSSKQIGFVNFGCSMVMRWYQR ALYKTLNLNSKKKDTSIENRRLGVSANSQMGDVFNRKFNTPTGCCGSQ >gi568815595r:93960951_94161511|GENSCAN_predicted_CDS_11|327_bp atggaaaagaagcaggagagtgccatacttgccaaggaaagaactaactttaagaaggag cagtcgattactagaatcaagtgctacagagaggtcagagaaaaggagaacaatcatagc agcaaacagattggatttgttaattttggatgctcaatggtaatgagatggtatcagaga gcactatataagaccttgaacctaaattcgaaaaagaaggacaccagtatagagaaccga agattaggtgtcagcgcaaatagccaaatgggagatgtattcaacaggaaatttaataca ccaacagggtgttgtggcagccaataa