GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:22:20 Sequence gi568815595f:93964435_94226487 : 262053 bp : 37.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15896 16048 153 1 0 69 96 141 0.549 12.35 1.02 Intr + 32813 32867 55 1 1 65 90 73 0.423 2.83 1.03 Term + 39225 39478 254 1 2 58 42 263 0.846 13.52 1.04 PlyA + 40750 40755 6 1.05 2.02 PlyA - 41305 41300 6 1.05 2.01 Sngl - 50835 49951 885 0 0 71 43 491 0.862 38.70 2.00 Prom - 61333 61294 40 -5.15 3.00 Prom + 62042 62081 40 -6.55 3.01 Init + 68744 68781 38 1 2 74 75 38 0.969 0.63 3.02 Intr + 70897 71002 106 1 1 75 64 111 0.996 6.70 3.03 Intr + 72118 72320 203 0 2 67 115 279 0.988 25.66 3.04 Intr + 75446 75554 109 2 1 27 64 125 0.457 3.37 3.05 Intr + 78581 78806 226 1 1 36 90 128 0.387 4.44 3.06 Intr + 84972 85088 117 1 0 60 105 109 0.431 9.32 3.07 Term + 88753 88829 77 2 2 86 41 50 0.694 -2.88 3.08 PlyA + 89015 89020 6 1.05 4.02 PlyA - 89527 89522 6 1.05 4.01 Sngl - 97077 96514 564 0 0 50 34 490 0.582 35.69 4.00 Prom - 102709 102670 40 -7.45 5.03 PlyA - 104213 104208 6 1.05 5.02 Term - 104774 104692 83 0 2 58 53 112 0.569 1.58 5.01 Init - 109442 109202 241 2 1 71 61 156 0.845 9.28 5.00 Prom - 109496 109457 40 -7.05 6.05 PlyA - 109613 109608 6 1.05 6.04 Term - 110402 109648 755 0 2 63 48 179 0.350 4.01 6.03 Intr - 111715 111481 235 1 1 -5 -9 215 0.431 -1.16 6.02 Intr - 112576 111896 681 1 0 26 73 550 0.780 38.15 6.01 Init - 114250 113504 747 1 0 44 54 372 0.772 24.16 6.00 Prom - 114504 114465 40 -12.72 7.03 PlyA - 114554 114549 6 1.05 7.02 Term - 115810 115283 528 1 0 55 41 334 0.913 18.67 7.01 Init - 118000 117854 147 1 0 51 64 98 0.834 3.84 7.00 Prom - 121005 120966 40 -2.75 8.00 Prom + 126832 126871 40 -5.35 8.01 Init + 129226 129283 58 0 1 68 82 100 0.954 8.92 8.02 Intr + 129706 129860 155 2 2 79 75 31 0.951 -0.23 8.03 Intr + 130599 130720 122 2 2 113 116 56 0.933 9.27 8.04 Term + 161777 162056 280 2 1 73 44 149 0.280 2.93 8.05 PlyA + 163816 163821 6 1.05 9.00 Prom + 174185 174224 40 -6.35 9.01 Init + 180251 180431 181 1 1 60 94 66 0.426 3.79 9.02 Intr + 198937 199110 174 2 0 60 79 85 0.017 3.79 9.03 Term + 215171 215253 83 0 2 81 42 85 0.017 0.08 9.04 PlyA + 215782 215787 6 1.05 10.03 PlyA - 217117 217112 6 1.05 10.02 Term - 231463 231335 129 1 0 118 49 52 0.681 1.50 10.01 Init - 240938 240894 45 2 0 93 75 67 0.509 6.63 10.00 Prom - 243507 243468 40 -6.35 11.03 PlyA - 243556 243551 6 1.05 11.02 Term - 249985 249875 111 1 0 75 42 71 0.519 -1.22 11.01 Init - 251529 251470 60 0 0 69 66 96 0.855 6.80 11.00 Prom - 259124 259085 40 -3.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 9201 9518 318 2 0 65 44 202 0.821 9.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_1|153_aa GRLGVPEAPGEERGAGVRSGLGWEVVGGATRDVQSDGQLLRLVQAVAGACQSMEKLSSAK LVPGAADLKEYPEDVAPTVGFSKINLRQGKFEVTIFDLGGGIRIRGIWKNYYAESYGVIF VVDSSDEERMEETKEAMSEMLRHPRISGKPILV >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_1|462_bp gggcgtctcggagtgccggaggcccccggggaagagcggggtgccggtgtccgctccggg ctcggatgggaagtggtgggaggagcgacccgggatgttcagtctgatggccagttgctg cggctggttcaagcggtggcgggagcctgtcagtccatggaaaaattgtcttctgcaaaa ctggtccctggtgctgctgatctaaaggaataccctgaagatgtagctcctactgttgga ttttcaaaaattaaccttagacaaggaaagtttgaagtcaccatctttgacttgggaggt ggaataagaattcggggaatctggaagaattactatgctgaatcctatggggtaatattt gttgtggattccagtgatgaagagagaatggaagagacaaaagaggctatgtcagaaatg ctaagacatcctaggatatcgggaaagcctatattggtgtaa >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_2|294_aa MKDRLQELKQRTKEIELSRDSHVSTTETEEQGVFLQQAVIYEREPVAERHLHEIQKLQES INNLADNVQKFGQQQKSLVASMRRFSLLKRESTITKEIKIQAEYINRSLNDLVKEVKKSE VENGPSSVVTRILKSQHAAMFRHFQQIMFIYNDTIAAKQEKCKTFILRQLEVAGKEMSEE DVNDMLHQGKWEVFNESLLTEINITKAQLSEIEQRHKELVNLENQIKDLRDLFIQISLLV EEQGESINNIEMTVNSTKEYVNNTKEKFGLAVKYKKRNPCRVLCCWCCPCCSSK >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_2|885_bp atgaaagaccgacttcaagaactaaagcagagaacaaaggaaattgaactctctagagac agtcatgtatcaactacagaaacagaggaacaaggggtgtttctacagcaagctgttatt tatgaaagagagcctgtagctgagagacacctacatgaaatccaaaaactacaggaaagt attaacaatttggcagataatgttcaaaaatttgggcagcaacagaaaagtctggtggct tcaatgagaaggtttagtctacttaagagagagtctaccattacaaaggagataaaaatt caggcagaatacatcaacagaagtttgaatgatttagttaaagaagttaaaaagtcagag gttgaaaatggtccatcttcagtggtcacaaggatacttaaatctcagcatgctgcaatg ttccgccattttcagcaaatcatgtttatatacaatgacacaatagcagcaaagcaagag aagtgcaagacatttattttacgtcagcttgaagttgctggaaaagagatgtctgaagaa gatgtaaatgatatgcttcatcaaggaaaatgggaagtttttaatgaaagcttacttaca gaaatcaatatcactaaagcacaactttcagagattgaacagagacacaaggaacttgtt aatttggagaaccaaataaaggatttaagggatcttttcattcagatatctcttttagta gaggaacaaggagagagcatcaacaatattgaaatgacagtgaatagtacaaaagagtat gttaacaatactaaagagaaatttggactagctgtaaaatacaaaaaaagaaatccttgc agagtactgtgttgttggtgctgtccatgctgtagctcaaaataa >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_3|291_aa MECGLIDIGDLDRLANKQDKEGALGEADVIECLSLEKLVNEHKCLCQIEPCSAISGYGKK IDKSIKKGLYWLLHVIARDFDALNERIQKETTEQRALEEQEKQERAERVRKLREERKQNE QEQAELDGTSGLAELDPEPTNPFQPIASVIIENEGKLEREKKNQKMEKDSDGCHLKHKME HEQIETQGQVNHNGQKNNEFGLVENYKEALTQQLKNEDETDRPSLESANGKKKTKKLRMK RNHRVEPLNIDDCAPESPTPPPPPPPDFYRKPLPPLAVPQRPNSDAHDVIS >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_3|876_bp atggagtgtggattaatagacattggagacttggacaggttggcaaataaacaagataaa gaaggagctttaggagaagctgatgtcattgaatgtctatctctggaaaaattggtcaat gagcacaagtgcctgtgtcagatagaaccatgttcagcaatctcggggtatggaaagaaa attgacaagtccattaaaaaaggcctttattggctgctacatgttattgcaagagacttt gatgccttaaatgaacgcatccaaaaagagacaacagagcagcgtgctcttgaggaacaa gagaaacaagaaagagctgaacgagtgcgaaaattacgagaagaaagaaaacaaaatgaa caggagcaggctgaactcgatggaaccagtggtctggctgagttggacccagaaccaacg aatcctttccagccaatagcatctgtaatcattgagaatgaaggaaaacttgaaagagag aaaaaaaaccaaaaaatggagaaagacagtgatggctgccacctgaaacataaaatggag catgagcaaatagagacacaaggccaggttaatcacaatggccaaaaaaataatgaattt ggactagtagaaaattataaggaggcattaacacagcagttaaagaatgaagatgagaca gaccggccatcattggaatcagctaatggtaaaaagaaaactaagaaactaagaatgaaa aggaaccaccgggtagaaccacttaatatagatgactgtgctcctgagagtccaacgcca cccccaccccctcctcctgatttctataggaagccactgcctcccctggctgtgccacag cgacctaacagtgatgctcatgatgtgatctcataa >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_4|187_aa MFLLLNCIVAVSQNMGIGKNGDLPRPPLRNEFRYFQRMTTTSSVEGKQNLVIMGRKTWFS IPEKNRPLKDRINLVLSRELKEPPQGAHFLARSLDDALKLTERPELANKVDMIWIVGGSS VYKEAMNHLGHLKLFVTRIMQDFESDTFFSEIDLEKYKLLPEYPGVLSDVQEGKHIKYKF EVCEKDD >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_4|564_bp atgtttcttttgctaaactgcatcgtcgctgtgtcccaaaacatgggcatcggcaagaac ggggacctgcccaggccgccgctcaggaatgaattcaggtatttccagagaatgaccaca acttcttcagtagagggtaaacagaatctggtgattatgggtaggaagacctggttctcc attcctgagaagaatcgacctttaaaggatagaattaatttagttctcagcagagaactc aaggaacctccacaaggagctcattttcttgccagaagtttggatgatgccttaaaactt actgaacgaccagaattagcaaataaagtagacatgatttggatagttggtggcagttct gtttataaggaagccatgaatcacctaggccatcttaaactatttgtgacaaggatcatg caggactttgaaagtgacacgtttttttcagaaattgacttggagaaatataaacttctg cctgaatacccaggtgttctctctgatgtccaggaggggaaacacatcaagtacaaattt gaagtatgtgagaaggatgattaa >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_5|107_aa MGKDFMSKTPKATAAKAKIDKWDLIKRKSFCTVKETTIRVNRQPTEWEKIFAIYPTDKGL ISRIYKEFKQIYEKKTTSSKNRKSFAFLVEHQKAKNPEHKSPSYYGI >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_5|324_bp atgggcaaagacttcatgtctaaaacaccaaaagcaacggcagcaaaagccaaaattgac aaatgggatctaattaaacgaaagagcttctgcacagtaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctatccaactgacaaagggcta atatccagaatctacaaagaatttaaacaaatttacgagaaaaaaacaacctcatcaaaa aataggaaatcctttgcgtttctggtggaacatcaaaaggctaaaaatcctgaacataag agtccttcatactacgggatctga >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_6|805_aa MVKGSMQQEELTILNIYAPNTGASRFMKQVLRDLQRDLDSHTIIVGDFNTPLSILDRSMR QKINKDIQDLNSALDQADLIDIYRTLHPKSTEYTLFSASHHTYSKTDHIIGSKLLSKCKR MEIITNSLSEHSEIKLELRIKKLTQNRTTTQKLNNLLLNDYWVNNEMQAEIKMFFETNEN KDTTYQNLWDPFKAVCKGRFTALNAHKRKQERSKIDNLTSKLKELEKQQQTNSKASRRQE ITKIRAELKTRRERFQHSSTCPGYTHTLDEQKHLVIASVQLPNDDAHFDASHYDSEKGEF GGFGSVSGKIEIEIKINHEGEVNRARYMPQNPCIITTKTPSSDVLILDYTKHPKPDLSGD CNSDLRLHGHQKKGYGLSWPNLSGHLLSASDDHTIYPWDISAVPKEGNVVDAKTIFTGHT VVVEDVSWHLFHKSLFGSVADHQKLMIWDTCSNNTSKPSHSVDAHTAEVNCLSILIDLSK IGEEQSPEDAEDGSPELLFIHGGHTAKISDFSWNPNEPWVICSVSEDNIMQVWQMAENIY NDEDPEGIWIQKDKVLEFLARAIRHEKEIKGIQIGREEVKLSLFADDMIVYLESPIVSAQ NLLKLISNFSKISRNKINVQKSQAFLYTNNRQTDSQIMSELPSTIATKRIKYLGIQLTRD VKDLFKENYKPLLEEIREDTNKWKILPCSWIRRINTVKMAILPRVTYRFNAIPIQLPLTF FTELEKQNTLNFIWNQKRTHIAKTILRKKNKAGGIMLPDFKLYYKATVTKTAWNWYQNRY IQPMEQNRALRNNTTHLQLSDLWQT >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_6|2418_bp atggtaaagggatcaatgcaacaagaagagctaactatcctaaatatatatgcacccaac acaggagcatccagattcatgaagcaagttcttagagacctacaaagagacttagactcc cacacaataatagtgggagactttaacaccccgctgtcaatattagacagatcaatgaga cagaaaattaacaaggatattcaggacttgaactcagctctggaccaagcagacctaata gacatctacagaactctccacccaaaatcaacagaatatacattattctcagcatctcat cacacttattctaaaactgaccacataattggaagtaaactcctcagcaaatgtaaaaga atggaaatcataacaaacagtctctcagaacacagtgaaatcaaattagaactcaggatt aagaaactcactcaaaaccgcacaactacacagaaactgaacaacctgctcctgaatgac tactgggtaaataatgaaatgcaggcagaaataaagatgttctttgaaaccaatgagaac aaagacacaacataccagaatctctgggacccatttaaagcagtgtgtaaagggagattt acagcactaaatgctcacaagagaaagcaggaaagatctaaaattgacaacctaacatca aaattaaaagaactagagaagcaacagcaaacaaattcaaaagctagcagaagacaagaa ataactaagatcagagcagaactgaagaccagaagggaaagatttcagcattcatcgact tgtcctgggtacacacacacattggatgaacaaaaacatcttgttatagccagtgtgcaa ctccctaatgatgatgctcattttgatgcatcacactacgacagtgagaaaggagaattt ggaggttttggttcagttagtggaaaaattgaaatagaaatcaagatcaaccatgaagga gaagtaaacagggcccgttatatgccccagaatccttgtattatcacaacaaagactcct tccagtgatgttcttatcttggactatacaaaacatcctaaaccagatctttctggagac tgcaactcagacttgcgtctccatggacatcagaaaaaaggctatgggctttcttggcca aatctcagtgggcacttacttagtgcttcagatgatcacaccatctacccgtgggacatc agtgctgttccaaaggagggaaatgtggtggatgcaaagaccatctttacagggcataca gtagtagtagaagatgtttcctggcatctgttccataagtctctgtttgggtcagttgct gatcatcagaaacttatgatttgggatacttgttcaaacaatacttccaaaccaagccac tcagttgacgctcacactgctgaagtgaactgcctttcaatccttatagatttaagtaaa attggagaggaacaatccccagaagatgcagaagacgggtcaccagagttgttgtttatt catggtggtcacactgccaagatatctgatttctcctggaatcccaatgaaccttgggtg atttgttctgtatcagaagacaatatcatgcaagtgtggcaaatggcagagaacatttat aatgatgaagaccctgaaggaatatggatccagaaggacaaggtattggaatttctagcc agggcaatcaggcacgagaaggaaataaagggtattcaaataggaagagaggaagtcaaa ttgtctctgtttgcagatgacatgattgtatatttagaaagccccatcgtctcagcccaa aatctccttaagctaataagcaacttcagcaaaatctcaaggaacaaaatcaatgtgcaa aaatcacaagcattcctatataccaataacagacaaacagacagccaaatcatgagtgaa ctcccatccacaattgctactaagagaataaaatacctgggaatacaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcgaggaaataagagaggacaca aacaaatggaaaatacttccatgctcatggataagaagaatcaatacagtgaaaatggcc atactgcccagagtaacttatagattcaatgctatccccatccagctaccattgactttc ttcacagaattggaaaaacaaaatactttaaacttcatatggaatcaaaaaagaacccac atagccaagacaatcctaaggaaaaagaacaaagctggaggcatcatgctacctgacttc aaactatactacaaggctacagtaaccaaaacggcatggaactggtaccaaaacagatat atacaaccaatggaacagaacagagccctcagaaataataccacacatctacaactatct gatctttggcaaacctga >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_7|224_aa MTTSFLKKEVRKLKCASRVCKTDCRDLARVSDSVGLKESLKFVALASFQPPLVIPRQTRS GVDLQQTPADLQQRGPTVRRKTNKQKGIAHPLKDPTSQRSPTSKTKGDISTKMGRNQCKK AENSKNQNASSPPKDHNSSPAREQNWMENEFDKLTEVGFRRWVITNSSKLKEHILTQCKE AKNLEKSLDKLLTRITSLEKNINDLMELKNTARELREAYTSINS >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_7|675_bp atgacaacctctttccttaagaaagaggttcgcaaacttaagtgtgcatcaagggtctgt aaaacagactgcagggacctagccagagtttctgattcagtaggtctgaaagagagcctg aaattcgtagctctagcaagttttcagcctccgctggtgatacccaggcaaacaaggtct ggagtggacctccagcaaactccagcagacctgcagcagaggggcccaactgttagaagg aaaactaacaaacagaaaggaatagcacatccactcaaagaccccaccagccaaaggtca ccaacatcaaagaccaaaggggatatatccacaaagatggggagaaaccagtgcaaaaag gctgaaaattccaaaaaccagaatgcctcttctcctccaaaggatcacaactcctcgccg gcaagggaacaaaactggatggagaatgagtttgacaaactgacagaagtaggcttcaga aggtgggttataacaaactcttccaagctaaaggagcatattctaacccaatgcaaggaa gctaagaaccttgaaaaaagtttagacaaattgctaactagaataaccagtttagagaag aacataaatgacctgatggagctgaaaaacacagcacgagaacttcgtgaagcatacaca agtatcaatagctga >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_8|204_aa MVPVGQEAFKPGQFLRLPQCYLHCNEYDSLRLRWLRQTLESFIPQPLINVIKVSELDGRK MGDAQPEMFDKVLVDAPCSNDRSWLFSSDSQKASCRISQRRNLPLLQIELLRSAIKALRP GGILVYSTCTLSKAENQDVISEILNSHGNIMPMDIKGIARTCSHDFTFAPTGQECGLLVI PDKGKAWGPMYVAKLKKSWSTGKW >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_8|615_bp atggttcctgtgggacaggaagcttttaagccaggacagttcctgcgtctgccgcaatgt tatcttcattgtaatgaatatgatagtctgagattgaggtggctaaggcagacgttggaa tctttcatcccacagcctttgataaatgtaattaaagtgtctgaattggatggcagaaaa atgggagatgcccagcctgaaatgtttgacaaggtgttagtggatgctccgtgttcaaat gatcgaagctggttgttttcttctgactctcagaaggcatcctgtaggataagtcaaagg aggaatttgcctcttctacagatagagctgttaaggtctgcaattaaggccttacgtcct ggagggatacttgtatactctacatgcacgctttccaaggcagaaaatcaagatgtgatc agtgaaattttaaactcccacggtaacatcatgcctatggacattaaaggaatagcaagg acttgctcccacgacttcacatttgctcccactggccaggaatgtgggctcttagtgatt ccagataagggcaaagcctggggcccaatgtatgtagccaaattgaagaaatcatggagc acaggaaaatggtga >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_9|145_aa MEKKQESAILAKERTNFKKEQSITRIKCYREVREKENNHSSKQIGFVNFGCSMVMRWYQR AWLKKQQTTYETRDLMNPRNLISGVSCTATGLILMNAKRNLKPFVTQKPSLDTRVRDTVC MMNASVLYLQGSRNIAKSKTPQTLR >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_9|438_bp atggaaaagaagcaggagagtgccatacttgccaaggaaagaactaactttaagaaggag cagtcgattactagaatcaagtgctacagagaggtcagagaaaaggagaacaatcatagc agcaaacagattggatttgttaattttggatgctcaatggtaatgagatggtatcagaga gcatggttaaagaaacaacaaacaacttatgagaccagagatcttatgaatccacgtaat ttaatttctggggtgagctgcacagccacaggcttgatattaatgaatgcaaaaaggaac ctgaaaccttttgttacacagaagccatctttggacaccagggtgagagacacagtgtgc atgatgaatgccagtgtcttatatctgcagggaagtcggaatattgccaaatctaaaact cctcaaactcttcggtaa >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_10|57_aa MTITEDLDKNSLRGELIRMQCWLQPQKLNQEEGLTKDKKVARQQDPDSLLWATTPVK >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_10|174_bp atgaccatcactgaggacctggacaagaacagtttaagaggagagctgatcagaatgcag tgttggctacagcctcagaagttgaaccaggaggaggggctcactaaagacaagaaagtt gcaagacagcaggacccagactccttattgtgggccaccacaccagtcaaatag >gi568815595f:93964435_94226487|GENSCAN_predicted_peptide_11|56_aa MQQEGPHQTPDVSDFQAFKTAPFQLHLLPTVQLGKTASSTMRTCFVNDHIKDAIPE >gi568815595f:93964435_94226487|GENSCAN_predicted_CDS_11|171_bp atgcagcaagaaggccctcatcagacaccagatgtcagtgacttccaagccttcaaaacc gctccctttcaactacatctgcttcccacagtgcaacttggaaaaactgcctcatctacc atgagaacatgctttgtcaatgaccatatcaaagatgctatacctgaatga