GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:09:08 Sequence gi568815586f:118881883_119256698 : 374816 bp : 43.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 976 971 6 1.05 1.03 Term - 7517 7271 247 1 1 65 36 193 0.808 7.06 1.02 Intr - 9599 9507 93 1 0 80 115 31 0.223 4.08 1.01 Init - 22865 22462 404 2 2 60 39 175 0.122 5.90 1.00 Prom - 25974 25935 40 -2.56 2.00 Prom + 34701 34740 40 -3.26 2.01 Init + 35073 35125 53 2 2 75 80 26 0.286 1.13 2.02 Intr + 38316 38365 50 0 2 119 84 43 0.653 5.32 2.03 Intr + 69277 69484 208 2 1 51 86 88 0.117 3.04 2.04 Intr + 70554 70575 22 0 1 111 83 -5 0.070 -1.15 2.05 Intr + 99956 100131 176 1 2 31 90 224 0.222 15.64 2.06 Intr + 108843 108879 37 0 1 74 77 6 0.047 -3.64 2.07 Intr + 110464 110551 88 0 1 82 99 57 0.132 5.74 2.08 Term + 123068 123180 113 0 2 70 41 104 0.325 2.62 2.09 PlyA + 123185 123190 6 1.05 3.04 PlyA - 125595 125590 6 1.05 3.03 Term - 144250 144067 184 0 1 96 45 115 0.837 5.02 3.02 Intr - 148009 147812 198 0 0 33 70 126 0.633 3.77 3.01 Init - 170368 170304 65 1 2 46 64 54 0.065 -0.58 3.00 Prom - 172736 172697 40 -0.76 4.04 PlyA - 173640 173635 6 1.05 4.03 Term - 194321 193428 894 2 0 -3 34 2689 0.997 246.81 4.02 Intr - 197781 197716 66 0 0 106 66 32 0.737 1.90 4.01 Init - 211258 211193 66 1 0 77 100 54 0.418 6.57 4.00 Prom - 223422 223383 40 -2.46 5.02 PlyA - 223706 223701 6 -0.45 5.01 Sngl - 224711 224226 486 2 0 60 39 210 0.931 9.37 5.00 Prom - 226641 226602 40 -5.26 6.00 Prom + 231799 231838 40 -3.46 6.01 Init + 232261 232268 8 0 2 85 40 0 0.574 -4.64 6.02 Intr + 232396 232482 87 1 0 109 94 85 0.977 10.29 6.03 Intr + 235055 235126 72 2 0 93 72 80 0.754 5.42 6.04 Intr + 237823 238029 207 1 0 -35 82 160 0.420 1.29 6.05 Intr + 238368 238394 27 0 0 142 110 13 0.786 6.13 6.06 Intr + 243499 243597 99 1 0 111 91 32 0.716 5.03 6.07 Intr + 248796 248952 157 0 1 98 86 52 0.158 6.01 6.08 Intr + 263409 263803 395 2 2 133 86 151 0.167 12.85 6.09 Intr + 269135 269338 204 2 0 84 113 61 0.046 6.52 6.10 Intr + 271678 271767 90 1 0 90 86 43 0.046 3.41 6.11 Intr + 272361 272501 141 0 0 106 94 123 0.997 14.17 6.12 Term + 274613 274916 304 2 1 128 42 239 0.994 17.94 6.13 PlyA + 276290 276295 6 1.05 7.00 Prom + 285532 285571 40 -4.26 7.01 Init + 297431 297797 367 1 1 121 119 166 0.970 20.19 7.02 Intr + 305143 305206 64 2 1 123 100 68 0.835 9.28 7.03 Term + 311817 311976 160 0 1 76 50 142 0.720 6.61 7.04 PlyA + 312261 312266 6 -3.44 8.00 Prom + 312503 312542 40 -5.76 8.01 Init + 312968 313680 713 1 2 97 -2 576 0.469 43.88 8.02 Term + 319307 319313 7 2 1 111 47 0 0.050 -4.46 8.03 PlyA + 319674 319679 6 1.05 9.04 PlyA - 319687 319682 6 1.05 9.03 Term - 320058 319880 179 1 2 82 37 97 0.047 1.85 9.02 Intr - 342933 342861 73 0 1 97 64 37 0.147 1.18 9.01 Init - 350123 349980 144 2 0 73 94 114 0.376 10.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 268848 268895 48 2 0 91 80 30 0.865 3.47 S.002 Intr + 269135 269302 168 1 0 84 86 133 0.940 12.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_1|247_aa MSELPFTIATRRIKHLGIQLTRDVKDLFKKNYKPLLNEIKKDTNKWKNIPCSWIGRINIM KMAILPKVIYRFNAIPIKLPKIFFTALEKTTLKFVWNQKRACIAKIILSKKNKAGDITLP DFKLYYKATVTKTACLPYGSVSYVWRTHVCLFADVSLVLRRGLGIWVGRPKALSNDAAND HNNNDGTKDSVHEFVSYYMPRIKNSIQTYFSQPGQPNQSEWMLAANNQHSYTNAYQLNIS PAWHICQ >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_1|744_bp atgagtgaactcccattcacaattgctacaaggagaataaaacacctaggaatacaactt acaagagatgtgaaggacctcttcaagaagaattacaaaccactgctcaatgaaataaag aaggacacaaacaagtggaagaatattccatgctcatggataggaagaatcaatatcatg aaaatggccatactgcccaaggtaatttatagattcaatgccattcccatcaagctacca aagattttcttcacagcattggagaaaactactttaaagttcgtatggaaccaaaaaaga gcctgcattgccaagataatcctaagcaaaaagaacaaggctggagacatcacgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatgtcttccctatggaagc gtaagctatgtatggaggacccatgtctgcttgttcgctgatgtatccctggtgcttaga agaggacttggaatatgggtggggagaccaaaggcgctcagcaatgatgctgcaaatgat cataataataatgatggcactaaagacagcgtccacgaatttgtctcttactacatgcca cgtatcaagaacagtattcaaacatatttttcacaaccaggacaacccaaccagtcagag tggatgctagctgcaaataaccagcacagctataccaatgcctaccagctgaacatcagt ccagcatggcatatctgtcaataa >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_2|248_aa MEAHSPTTLRELKEAKCGFCLIMDSETELLTFVALNMDIISVLKTFACFSLVQKSNSWKA FQNRQMSSKPLEMIFEGAQCQLSTLENSHPHALSTAASARTSVYSSLLDFYHFGQRPGRP GPFGLAMASVQQGEKQLFEKFWRGTFKAVATPRPESIIVASITARKPLPRPRRTSLKSTS DHGLLWHLLPTHSSGYPDKMSIRTQPVPSRAGAPNWFTQRRLKQAASSITVPVASDFGVS DTTKTYLK >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_2|747_bp atggaggcccacagtccaacaaccctcagagaactaaaggaagccaaatgtgggttctgc ctaataatggattcagaaacagaacttcttacctttgtggcattaaatatggacatcatt agtgtcctgaagacttttgcctgtttctcattagttcagaagagtaattcttggaaagca tttcagaacagacagatgagcagcaaacccttggaaatgatttttgagggtgcccagtgc cagctttctacccttgagaactcccaccctcatgctctgtctactgctgcttctgcaagg acatcagtgtattcctcattactagatttttaccactttggacagcgccccggacgcccc ggcccctttgggttggcgatggcgagcgttcagcaaggcgagaagcagctttttgagaag ttctggcgaggaaccttcaaagcggtggccaccccccgtcccgagagcatcattgtcgcc agtatcacggcccgcaagccgctgccaagacccagaagaacctctttgaaaagcacatca gaccatggtctcctgtggcacctgctgcccacccactcgtctgggtatcctgacaagatg tccatccgcacccagcctgtgcctagcagagcaggggcacccaactggttcacccagaga agattaaaacaggctgctagcagcatcacagttcctgtggcttcagatttcggtgtttct gataccacaaaaacctatttgaaataa >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_3|148_aa MGTVHIHESSHSCHGDYSKHSRIRSYQIYLVLSAYCVSDPAMSAFNPRLPLPGGEGDLCD VILAYSASSITIKTCFITAGRNKRVGEEIKPNPDVVAMRIYRPDLQPQRAPGPPMWSPDL VEDQTSHRLLPANGSVAVTLRQGHIGLH >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_3|447_bp atgggaacagtccacatccacgagagcagccacagttgccacggtgattattctaaacac agcagaattcgctcctaccagatttacctggtgctgtctgcctactgtgtctctgaccct gccatgtctgcatttaatccaaggctcccattacctggaggagaaggggatctgtgtgat gtgattttggcatattcggcaagttccatcaccatcaaaacatgttttatcactgcagga agaaacaaacgtgtaggagaagaaataaaaccaaacccagatgtggtggccatgagaatc taccgcccagatcttcaaccacagagggcaccaggacctccaatgtggagccctgacctt gtggaagatcaaacctcccacaggttgcttccagccaatggctcggtggccgtgactcta aggcagggacacataggactccactga >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_4|341_aa MEYYCEGLAILNWVIREALIEKYNSQLSRQSVRPQGKAKCAMRQLSLSSLSLSSSLSSSS SLFPLSSSSFIVTILITIVIVTIIIIIITITTITIITIIITITTIIIIATITTIISTIIT ITTTTIIITTIIITITTIIIIITIIATIIITTTIIITTTIINIIIIITITIITVIITIII TIIAIIITTIITIIITTIIITITTIINITIITIFIITIITIIITIIATIIIIITTIIITI TTIINIIIIIITIITIIITIIITIIATIITIITTIIITITTIIITISTIIITIITIIITI IIITVIITIIIIIIIILTIITIIIVIIIIQSRSHADAISSY >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_4|1026_bp atggaatactactgcgaggggcttgcaattttaaactgggtaatcagagaagccctcatt gagaagtacaactcacaactctcccgacaaagtgtgcgaccacagggcaaggctaagtgt gcaatgcggcagctgtcattgtcatcactgtcattatcatcatcattgtcatcatcatca tcattattcccattgtcatcatcatcattcattgtcactatcctcatcaccattgttatc gtcaccatcattattattatcattaccattactaccatcaccatcatcactatcatcatc accatcaccaccatcatcatcatcgccaccatcaccaccatcatcagcaccatcatcacc atcaccaccaccaccatcatcattaccaccatcatcatcaccatcaccaccatcatcatc atcatcaccatcatcgctaccatcatcatcactactaccatcatcatcaccaccaccatc atcaacatcatcatcatcatcaccatcaccatcatcaccgttatcatcaccatcatcatc accatcatcgctatcatcatcaccaccatcatcaccatcatcattaccaccatcatcatc accatcaccaccatcatcaacatcaccatcatcaccatcttcatcatcaccatcatcacc atcatcatcaccatcatcgctaccatcatcatcatcattaccactatcatcatcaccatc accaccatcatcaacatcatcatcatcatcatcaccatcatcaccattatcatcaccatc atcatcaccatcattgctaccattatcaccatcattaccaccatcatcatcaccatcacc accatcatcatcaccatcagcaccatcatcatcaccatcatcaccatcatcatcaccatc atcatcattactgtcatcatcaccatcattatcatcatcatcatcatcctcaccatcatt accatcataatcgtcatcatcatcatccagtccagatcccatgctgatgcaataagtagt tattaa >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_5|161_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIM KMAILPKVIYRFNAIPIKLPMTFSTELEKTTLKFIWNQKRARIAKTILSQKNKTGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITHTSTTI >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_5|486_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatatctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatatcatg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttttccacagaactggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagacaatcctaagccaaaagaacaaaactggaggcatcacgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaatagaacagagccctcagaaataacacacacgtctacaacc atttga >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_6|596_aa MRCASHDKDLTPPPSSRGKKKKKKSTRKKRRRSSSYSPSPVKKKKKKSSKKHKRRSKSST CGSWLSHDDDGDDEDEEEEECTTREREDEMENGGGERQKTLSTFSLSSLIQDRFTDSFGG GLEHRSFSKKRRHRSRSRPRKSHRHRHHRCPSRSQSSESRPSSCESRHRGRSPEEGQKSR RRHSRRCSKTLCKDSPEAQSSRPPSQPLQMLGYLSARGVCIMTVNISWLFRNSPWAQRSA VSNGSFCFQITGSGSAADLFTKTASPLTTSRGRSQEYDSGNDTSSPPSTQTSSARSRGQE KGSPSGGLSKSRELNSGNTSDSGNSFTTSSPQNKGAMLENLSPTSRGRESRGFQSPCLEC AEVKKSSLVPSTARSSPMKGCSRSSSYASTRSSSHSSRSPNPRASPRYTQSRSTSSEKSY SSKSGKRSPPSRSSRSRRSPSYSRYSPSRERDPKYSEKDSQQRERERARRRRRSYSPMRK RRRDSPSHLEARRITSARKRPIPYYRPSPSSSGSLSSTSSWYSSSSSRSASRSYSRSRSR SRSRRRSRTRTSSSSSSRSPSPGSRSRSRSRSRSRSRSRSQSRSYSSADSYSSTRR >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_6|1791_bp atgaggtgtgcctctcatgacaaagacttgacaccaccaccttcctccaggggaaagaag aaaaagaagaaatccactcggaagaagagaaggaggtcctcatcctatagcccatcgcct gtcaagaaaaagaagaagaaaagttccaagaaacacaagcgacgcagtaagagttccaca tgtggaagctggctgtctcatgatgatgacggtgatgatgaggatgaggaggaggaggaa tgcacaacaagggagagagaagatgagatggagaatggaggaggggaaaggcagaagact ctctctacgttctccttatccagcctcatccaggacagatttactgacagttttggggga ggattggagcaccggtcattctccaagaagagaaggcacagatctcgaagccggccccga aagtctcaccgccaccgccatcaccgctgcccctcgcggtcccagagctcggagtcccgc ccctcaagctgtgagagcaggcaccgcggccggtcccctgaggaagggcagaagtcccgc cgaaggcactcccgccgctgctccaagaccctctgcaaggacagccctgaggcccagtcc agtcgcccgcccagtcaacccctccagatgcttggctacctgtcagccaggggtgtatgc atcatgacggtaaacatttcttggttatttagaaacagcccatgggcccagcgctcagca gtgtccaacgggtctttttgctttcagatcactgggtcggggtctgctgctgacctcttt accaaaacagccagcccgctcaccacctcgcgaggacgttcccaggagtacgactcagga aatgacacgtcctcgccaccctccacgcaaaccagctcagccaggtctcggggccaggag aaggggagccccagtgggggcttgagcaagagccgggagctcaacagtggcaacacctct gattcagggaactccttcaccacctcctcaccccagaacaagggggccatgttggagaat ctctcccccaccagcaggggcagagagtcaaggggatttcagtcaccgtgtctggaatgt gccgaagtgaagaagtccagtttggtcccatccacagcccggagctcacccatgaaaggg tgttcccgcagctcctcctatgccagcacccgatcctccagtcactcgtcccgatcccca aatcccagggcttcccccaggtacacccaaagccgatccacctcttctgaaaaaagctat tcctccaagtctggcaagaggagcccgcccagcagaagctctaggtcccgccgcagccct agctactcccgctacagccccagcagggagcgggatcccaaatacagtgagaaggactcg cagcagcgggagcgcgagcgagcgcgtcggagacgtcggtcctactcgcctatgagaaag cgccggagagactccccgagccacctggaggcccggaggataaccagtgcccggaaacgc cccatcccctactatcggcccagcccctcctcatccggcagcctcagcagcacctcctcc tggtacagcagcagcagtagccgctcggccagccgcagctactcccggagccggagtcgg agccggagccggagacggagccggacccgcacgagcagcagctctagctcccgcagccct agtccgggctcccgcagccggagccggagcaggagccggagccggagccggagcaggagc cagagccggagctacagctcagcagacagctactccagcacgaggcgctaa >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_7|196_aa MADGQMPFSCHYPSRLRRDPFRDSPLSSRLLDDGFGMDPFPDDLTASWPDWALPRLSSAW PGTLRSGMVPRGPTATARFGVPAEGRTPPPFPGEPWKVCVNVHSFKPEELMVKTKDGYVE VSGKHEEKQQEGGIVSKNFTKKIQLPAEVDPVTVFASLSPEGLLIIEAPQVPPYSTFGES SFNNELPQDSQEVTCT >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_7|591_bp atggctgacggtcagatgcccttctcctgccactacccaagccgcctgcgccgagacccc ttccgggactctcccctctcctctcgcctgctggatgatggctttggcatggaccccttc ccagacgacttgacagcctcttggcccgactgggctctgcctcgtctctcctccgcctgg ccaggcaccctaaggtcgggcatggtgccccggggccccactgccaccgccaggtttggg gtgcctgccgagggcaggacccccccacccttccctggggagccctggaaagtgtgtgtg aatgtgcacagcttcaagccagaggagttgatggtgaagaccaaagatggatacgtggag gtgtctggcaaacatgaagagaaacagcaagaaggtggcattgtttctaagaacttcaca aagaaaatccagcttcctgcagaggtggatcctgtgacagtatttgcctcactttcccca gagggtctgctgatcatcgaagctccccaggtccctccttactcaacatttggagagagc agtttcaacaacgagcttccccaggacagccaggaagtcacctgtacctga >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_8|239_aa MPKGGRKGGHKGWARQYRSPEEIDVQLQAEKQKAREEEEQKEGGDGAAGDPKKEKKSLDS DESEDEEDDYQQNRKGVEGLFNIENPNQVAQTTKKVTQLYLDGPKELSRREREEIEKQKA KERYMKMHLAGKTEQAKADLARWPSSGNSGRRLPGRRKRKGKQKMMPHCQENECSRSPCI SNRDPWEEMPGTWAALPGPLLCLAHPVPWRRRNSPSWPGAPHGLGPPLHFGTEIVWGML >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_8|720_bp atgcctaaaggagggagaaagggaggccacaaaggctgggcgaggcagtataggagccct gaggagatcgacgtacagctgcaggctgagaagcagaaggccagggaagaagaggagcaa aaagaaggtggagatggggctgcaggtgaccccaaaaaggagaagaaatctctagactca gatgagagtgaggatgaagaagatgactaccagcaaaaccgcaaaggcgttgaagggctc ttcaacatcgagaaccccaaccaggtggcacagacaaccaaaaaggtcacacaactgtat ctggatgggccaaaggagctttcgaggagagaacgagaagagattgagaagcagaaggca aaagagcgttacatgaaaatgcacttggccgggaagacagagcaagccaaggctgacctg gcccgctggccatcatccggaaacagcgggaggaggctgcccggaagaaggaagaggaaa ggaaagcaaaagatgatgccacattgtcaggaaaacgaatgcagtcgctctccctgtata agtaaccgcgacccatgggaggagatgccggggacctgggccgcgctgccaggacctctg ctgtgtctcgcccaccctgtgccctggcgccgccgcaacagcccctcgtggccaggagcc ccccatggcctggggcctcctcttcattttggcacagaaattgtttgggggatgttgtga >gi568815586f:118881883_119256698|GENSCAN_predicted_peptide_9|131_aa MRRMERRERDREEEEEERRRKKEEENEKKKEDDKEKEGEEGGGRRERQQRQPVLGLKHAA KANAISSFPAQRGQIKSSLYITIGSQLARRQLGGGGGVLRIKLCQSWVKAGGCKKVLGIS SPSIIKVRAQE >gi568815586f:118881883_119256698|GENSCAN_predicted_CDS_9|396_bp atgaggaggatggagaggagggaaagggatagagaggaggaagaagaagaaagaaggagg aagaaggaggaggagaatgagaagaagaaggaagatgacaaggagaaggagggggaggag ggaggaggaaggagggagaggcagcagagacagccagtcctggggctcaaacatgcagcc aaggcaaacgccatctcctccttcccagcacagcgaggtcaaatcaagtcatctctatac atcaccattggcagccagctggctagaaggcagttgggagggggtggaggtgtgctgcgg ataaagctctgtcagagctgggtcaaggcagggggctgcaagaaggttctgggcatctcc tctcccagtatcatcaaagtgagagctcaggaataa