GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:20:31 Sequence gi568815582f:58150015_58379774 : 229760 bp : 42.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6212 6291 80 1 2 40 92 75 0.537 3.68 1.02 Term + 6537 6690 154 0 1 65 38 136 0.426 2.71 1.03 PlyA + 7581 7586 6 -0.45 2.19 PlyA - 7913 7908 6 1.05 2.18 Term - 8339 8159 181 1 1 147 42 177 0.772 15.10 2.17 Intr - 12064 11948 117 0 0 97 60 63 0.324 3.06 2.16 Intr - 14078 13973 106 0 1 44 -11 139 0.454 -2.05 2.15 Intr - 15694 15546 149 0 2 91 65 224 0.949 19.36 2.14 Intr - 16670 16570 101 2 2 73 98 154 0.999 12.89 2.13 Intr - 17294 17193 102 2 0 88 99 82 0.995 8.75 2.12 Intr - 17781 17671 111 0 0 75 97 134 0.998 12.66 2.11 Intr - 18679 18596 84 1 0 91 76 44 0.849 2.60 2.10 Intr - 20467 20351 117 1 0 -5 69 126 0.007 1.14 2.09 Intr - 24496 24437 60 1 0 64 106 53 0.014 2.71 2.08 Intr - 36842 36741 102 2 0 41 66 136 0.127 6.15 2.07 Intr - 46830 46719 112 2 1 93 113 140 0.978 16.46 2.06 Intr - 55923 55778 146 0 2 24 103 50 0.002 -1.74 2.05 Intr - 65753 65628 126 2 0 27 60 117 0.139 2.76 2.04 Intr - 72817 72646 172 0 1 84 87 102 0.347 8.72 2.03 Intr - 76632 76434 199 1 1 36 -24 168 0.005 -2.11 2.02 Intr - 86469 86260 210 1 0 81 61 98 0.449 4.36 2.01 Init - 92078 91949 130 2 1 85 92 58 0.589 6.26 2.00 Prom - 92936 92897 40 -8.85 3.05 PlyA - 93161 93156 6 1.05 3.04 Term - 93301 93212 90 1 0 101 45 77 0.835 1.44 3.03 Intr - 93510 93439 72 0 0 99 81 56 0.817 4.68 3.02 Intr - 99191 99079 113 0 2 53 91 51 0.930 1.08 3.01 Init - 100183 99916 268 1 1 45 -15 346 0.696 17.21 3.00 Prom - 100949 100910 40 -7.05 4.00 Prom + 102196 102235 40 -5.85 4.01 Init + 102747 102839 93 2 0 37 86 154 0.991 10.53 4.02 Intr + 103984 104256 273 0 0 78 91 249 0.900 21.01 4.03 Intr + 105063 105350 288 2 0 39 4 359 0.985 19.12 4.04 Intr + 108354 108509 156 2 0 71 110 200 0.999 19.79 4.05 Intr + 112371 112526 156 2 0 11 78 180 0.895 8.59 4.06 Intr + 113491 113559 69 0 0 97 80 69 0.554 5.46 4.07 Intr + 117465 117572 108 2 0 64 75 64 0.009 2.26 4.08 Intr + 128468 128623 156 1 0 79 83 175 0.364 15.39 4.09 Intr + 129677 129695 19 1 1 122 69 13 0.377 -1.93 4.10 Intr + 131688 131809 122 1 2 48 57 123 0.567 4.49 4.11 Term + 136049 136207 159 1 0 92 54 152 0.245 9.16 4.12 PlyA + 137265 137270 6 1.05 5.04 PlyA - 139954 139949 6 1.05 5.03 Term - 140782 140510 273 1 0 18 48 173 0.832 0.79 5.02 Intr - 141122 140905 218 0 2 90 60 243 0.891 19.00 5.01 Init - 143766 143718 49 0 1 76 81 100 0.890 7.27 5.00 Prom - 154069 154030 40 -6.35 6.00 Prom + 161726 161765 40 -3.55 6.01 Init + 163462 163672 211 0 1 69 42 139 0.371 6.40 6.02 Intr + 174720 174742 23 1 2 100 100 0 0.004 -1.26 6.03 Intr + 180465 180611 147 2 0 56 109 38 0.359 2.21 6.04 Intr + 186615 186905 291 2 0 29 93 175 0.235 8.51 6.05 Intr + 189681 189768 88 2 1 31 97 116 0.018 5.42 6.06 Intr + 202527 202575 49 1 1 40 81 80 0.009 -0.68 6.07 Term + 202821 203049 229 0 1 53 39 152 0.011 1.92 6.08 PlyA + 203351 203356 6 1.05 7.03 PlyA - 204148 204143 6 1.05 7.02 Term - 210853 210616 238 0 1 67 47 196 0.883 8.06 7.01 Init - 211845 211652 194 0 2 72 96 79 0.775 5.59 7.00 Prom - 220138 220099 40 -3.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 20455 20351 105 1 0 68 69 117 0.936 8.07 S.002 Init - 60328 60232 97 1 1 61 98 110 0.848 9.82 S.003 Init - 115979 115829 151 2 1 90 68 145 0.899 12.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:58150015_58379774|GENSCAN_predicted_peptide_1|77_aa MTSKDASQSNDSIRIRPSMSGEDLRWQWWPMGPHLLVSRGDAQSQAPTPEQQLRCHRSWS KGHHDAVVALKGACFPP >gi568815582f:58150015_58379774|GENSCAN_predicted_CDS_1|234_bp atgacatctaaagatgcaagccaatcaaatgattctattcgaataagaccctctatgtct ggagaagacctgagatggcagtggtggcctatgggtcctcacctgctggtgtccaggggg gatgcacagtctcaagctcccacacctgagcaacagcttagatgccacagaagctggtcc aaaggacaccacgatgctgtggtggccctgaaaggggcctgctttcctccatag >gi568815582f:58150015_58379774|GENSCAN_predicted_peptide_2|774_aa MARKDIQVLIIGTYEYYLVKGEKVLTILAIKLKVLKWSGYPGLFLAAHSLSLIIPRVWLT LTHNASPARLCFYHPSPSPSHLELPGMLLVPGCHNHLRRFPYSQSHVNVNSPINSSLKMI PENPHYPQGLLFTSISHGPDLQDLHADLTLVKHLYASVHQASVSVRTLSGANDRIPTPVK LAARELCAKAWIEMSLVSIDLEGTPPLGLLKQGTPQATAPQAAPVKGGEEPSPPSWKVGT EVLFQRREKRKKYPFIIPYPPVGGDDSRAGISAVKEQDLARLNKRGHPQEHQQKIPIPLA LLPRTGSGTFPPSPSGAFSGWRTEWVTRNQDDYQLVRKLGRGKYSEVFEAINITNNERVV VKILKPVKKKKIKREVKILENLRGGTNIIKLIDTVKDPVQLYQILTDFDIRFYMYELLKG NESMSGNNPNVHQLVVEYYSARKNNLDESQKQYTEGKKALDYCHSKGIMHRDVKPHNVMI DHQQKKLRLIDWGLAEFYHPAQEYNVRVASRYFKGPELLVDYQMYDYSLDMWSLGCMLAS MIFRREPFFHGQDNYDQLVRIAKVLGTEELYGYLKKYHIDLDPHFNDILGQHSRKRWENF IHSENRHLVSPEALDLLDKLLRYDHQQRLTAKEAMEHPYFLVSRQHDEDWKATGNAALML ANKTNQPNTNLEGKLHCLSGPGKRCLLEGLGEAVTPFLSLGSPKGNKSTWRRIVRSVAVL PLFHKQNKNQIKRLNAYREITFREQTQNGGRFGEHELDQAKGSPPPYIKPHFRM >gi568815582f:58150015_58379774|GENSCAN_predicted_CDS_2|2325_bp atggcccgcaaagatatccaggttctaatcattggaacctatgaatattaccttgtaaag ggggaaaaggtcttaacaattttagcgattaagttaaaggtcttaaaatggagtggttat cctggattatttttggcggcgcacagcctctccttgatcatacctcgtgtctggctgacc ctcacgcacaatgccagccctgctcgactctgcttttatcatccctccccttccccgtct cacctagaattacctggaatgctgctggtccctgggtgccacaaccatcttcgcaggttc ccttattcacagtcccatgtgaatgtaaacagtcccatcaattcatcgcttaaaatgatc cctgagaacccccactatccccagggattgttgttcacaagtatctctcatggcccagac ctgcaagacctccatgctgacctgacgttagtcaagcacctgtatgcgtctgtgcatcag gcatctgtgtcagtcaggacactttctggtgcaaatgacagaattccaacgcctgtcaag ctggctgcaagagaactctgtgcaaaggcctggattgaaatgtccctggtttctattgat ctggaagggactcctcctctaggattgctgaaacagggcaccccgcaggcgacagctccg caggcagctcctgtgaagggaggagaggagccatctccaccctcctggaaagtagggaca gaagtcctattccagaggagagaaaaaagaaaaaagtacccgttcatcattccttatccc ccagtgggtggggatgacagcagggcaggcatctctgctgtcaaggagcaggatttggca agactgaataagaggggtcacccacaggagcaccagcagaaaattcccattcctctggct ctcttgcccaggacaggctctggcacattcccaccatccccctcaggggcattctctggt tggaggacagagtgggtcacacgtaatcaagatgattaccaactggttcgaaaacttggt cggggaaaatatagtgaagtatttgaggccattaatatcaccaacaatgagagagtggtt gtaaaaatcctgaagccagtgaagaaaaagaagataaaacgagaggttaagattctggag aaccttcgtggtggaacaaatatcattaagctgattgacactgtaaaggaccccgtgcaa ctctaccagatcctgacagactttgatatccggttttatatgtatgaactacttaaagga aatgaaagcatgagtggaaacaacccaaatgtccatcagctagtagtggaatactactca gcaaggaaaaacaacttggatgaatctcagaaacagtatactgaaggaaagaaggctctg gattactgccacagcaagggaatcatgcacagggatgtgaaacctcacaatgtcatgata gatcaccaacagaaaaagctgcgactgatagattggggtctggcagaattctatcatcct gctcaggagtacaatgttcgtgtagcctcaaggtacttcaagggaccagagctcctcgtg gactatcagatgtatgattatagcttggacatgtggagtttgggctgtatgttagcaagc atgatctttcgaagggaaccattcttccatggacaggacaactatgaccagcttgttcgc attgccaaggttctgggtacagaagaactgtatgggtatctgaagaagtatcacatagac ctagatccacacttcaacgatatcctgggacaacattcacggaaacgctgggaaaacttt atccatagtgagaacagacaccttgtcagccctgaggccctagatcttctggacaaactt ctgcgatacgaccatcaacagagactgactgccaaagaggccatggagcacccatacttc ttggtctcacggcagcacgatgaagactggaaagcgacgggtaatgcggcattgatgctt gccaataaaaccaaccaaccaaacacaaaccttgaaggaaaactacattgcctgtctgga ccaggcaagaggtgcctactggagggtcttggtgaagcggtaactccattcctttccctt gggtcccccaaaggtaataaaagtacctggaggagaatagtcaggtctgttgcggttctc ccacttttccataagcagaacaagaaccaaatcaaacgtcttaacgcgtatagagagatc acgttccgtgagcagacacaaaacggtggcaggtttggcgagcacgaactagaccaagcg aagggcagcccaccaccgtatatcaaacctcacttccgaatgtaa >gi568815582f:58150015_58379774|GENSCAN_predicted_peptide_3|180_aa MLVPKSEAARRGGPRRKRGGPQARAAVPELLHQPAQLDNRQLQLRPFMGVGEDALGLIIS HHSAEAAPANAVCPGAPGAGGHASLRLRRDQKTKEWSSLGSIPLKNGIKLRWNPKSMLLT STLMWDMSFSYPSKAIPTKVLEEFIEITFNQCLSLILLVFRSEEETEKIFEEIMAEFFHI >gi568815582f:58150015_58379774|GENSCAN_predicted_CDS_3|543_bp atgctggtccccaagtccgaggccgcccggagaggagggcctaggaggaagaggggcggc ccccaggcgcgggcggcggtacctgagctcctccaccagcccgcacagctggataacagg cagctccagctccgacccttcatgggagtcggagaggacgctctcggactcatcatcagt catcactctgccgaagcggcccctgccaatgccgtgtgcccaggagctccgggtgccggc ggccacgcgtcactccggctgcgacgcgatcagaaaaccaaggaatggagtagcttaggt tccattccactcaagaatggaatcaagctccggtggaacccaaagtccatgctgttaact tctacgctgatgtgggacatgtccttctcctatccatccaaagccattcccactaaggtg ttggaagagttcattgagattacatttaaccagtgtctttccttgatcctgctcgtcttc agaagtgaggaggagacagaaaaaatatttgaagaaataatggctgaatttttccacatt tga >gi568815582f:58150015_58379774|GENSCAN_predicted_peptide_4|532_aa MFEKYYAKLEPRDQRPPRLSEIKISAADYAQFRGRRRSKSRTGMDRGVGLTADQKLELVQ KEVADMKDDLRHTRANAERDLQHHEVHLPASAACPAQAPHLYPTERYLEWVHVIVGALEE TQAVKREKPVVAQSKSESFKTREANGAAFGPWPKALEPPGSCWYKSEGLKAKNLESDVRG QEEQKQLSGMGRRGRARRLSTQASSTCFALAALAANWMAIIEEAEIRWSEVSREVHEFEK DILKAISKKKGSILATQKVMKYIEDMNRRRKEEVSEALHDVDFQQLKIENAQFLETIEAR NQELTQLKLSSGNTLQVLNAYKKSFADPSHSAQVAMVAVGLGSSQSKLHKAMEIYLNLDK EILLRKELLEKIEKETLQVEEDRAKAEAVNKRLRKQLAEFRAPQVMTYVREKILNADLEK SIRMWERKVEIAEMSLKGHLWTETWNGAAINNRKQSEQTEPRAAERQKSSRFSPLPSETW EGYVVAHGVVIKVLMDDDGIDWILCVSNLARIHVTYTHYNDNVLPGERARES >gi568815582f:58150015_58379774|GENSCAN_predicted_CDS_4|1599_bp atgtttgagaaatattacgctaaactggagcccagggatcagcgacctccacgattatca gaaattaaaatatcagcagcagattatgcacagtttcgaggcaggcgtagatccaaatcc cggacaggtatggaccgtggggtaggcctgactgccgaccaaaaacttgagctggtacaa aaagaggttgcggacatgaaggatgacttacgacacacaagggcaaatgcggaacgcgac ctgcagcatcacgaggtacaccttcctgccagcgcagcctgtcctgcccaggctccccac ctctaccccactgagaggtatctagagtgggttcatgtgattgttggggctttggaagaa acacaggctgtgaaaagagagaagccggtagtggctcagtccaagtctgaaagcttcaaa accagggaagccaatggtgcagccttcggtccgtggccgaaggccctagagcccccagga agctgctggtacaagtccgagggtctaaaggcaaagaacctggagtctgatgtccgaggg caggaggagcagaagcaattgtcgggcatgggaagaagagggagagccagaagactcagt acacaagcttcttccacctgctttgctctagctgctctggcagccaattggatggcgatc attgaggaggctgaaattcgatggagtgaagtttcgagagaagtgcatgagtttgaaaaa gatattctaaaagccatatccaagaagaaagggagtattttggccactcagaaagtgatg aaatacattgaggacatgaaccgccggaggaaggaagaggtgagtgaggcccttcacgat gttgattttcagcagttgaagatagagaacgctcaatttcttgagacaattgaagcaagg aatcaagaactgacccagctaaagctgtcatctggaaacactctgcaggttctcaatgcc tacaaaaaaagttttgccgacccctcgcatagtgctcaggtggcaatggtggctgtgggc ttaggtagctcgcagagcaagcttcacaaggcaatggaaatatacctcaatctggacaag gagatcttgctgagaaaagagctacttgaaaaaattgaaaaagaaacactacaagtagag gaggaccgggccaaagccgaggcagtgaataagaggctccggaagcagctggccgagttc cgggcaccacaggtgatgacttacgtccgggagaagatcttaaatgcggacctggagaag agcatcaggatgtgggaaaggaaagtggagatagcagagatgtccttaaaaggccatctt tggactgagacatggaatggggccgcaattaacaacaggaaacaatctgaacagactgaa ccacgagcagcagaaaggcagaagagcagccgcttcagccccttaccatccgagacctgg gagggctatgttgttgctcatggagttgttatcaaagtcctcatggatgatgatggtatt gactggatactctgtgtgagcaatcttgctaggatccatgttacttatacccactataac gacaatgtccttcctggagagagagcaagggaatcttga >gi568815582f:58150015_58379774|GENSCAN_predicted_peptide_5|179_aa MRGVLLVLLGLLYSSTSCGVQKASVFYGPDPKEGLVSSMEFPWVVSLQDSQYTHLAFGCI LSEFWVLSIASAIQNRPVPLPLGPATSPGLPSLISAVRTAGLATEKCNDESSIQCRWDIL SDFTAITEALAPNTRVPTTYQLFNQSIGALTLKHLGLTNVTATNTCTERKASEKIQKRF >gi568815582f:58150015_58379774|GENSCAN_predicted_CDS_5|540_bp atgcgaggggtgctcctggtgctgctcggccttctctattcttccaccagttgtggcgtc cagaaagcttccgttttctacggtcctgaccccaaggagggcttggtcagcagcatggag ttcccgtgggtggtgtcgctgcaggactcccagtacacacacctggctttcggctgcatc ctgagcgagttctgggtcctcagcatcgcatccgccattcagaacaggccagtgcccctg cctttggggcccgcaacatcgccggggttgccaagcctgatctctgctgtgaggactgca ggccttgcaacggaaaaatgcaatgatgagtcctccattcaatgcaggtgggatattctt tctgattttaccgccatcacggaggcactggctcctaacactagggtccctaccacttat caactgttcaaccagtcaataggagcccttaccttaaaacatttgggccttacaaacgtt actgccacaaatacatgtacagagagaaaagcctcggaaaaaatacagaagcgcttttaa >gi568815582f:58150015_58379774|GENSCAN_predicted_peptide_6|345_aa MGATIQDEIWVGTQPNHITALGEMFIGRAKWELVRLPEPTEYWVKAGVLVGTNPPLVALC GVFTGAYGKSYEPIGNKPGKSVPISLQFRQHLLFLDFLIIAFLTTPDQLTQTLGVRPRHQ YGFKAPQRYREFQDSCSRRGNIFFLNWVNVKLSFVSCGDHINVVTYASPKKVKYFRPPPS SPVNQMPLVLPCDDHFCRQGAILLAAFYIDASSSCLNVALNKIKVVKATMDNFGIGKDRA RMVQFPQFAEAGVGLGQSCFRGEEHRDYDRPGSATGDRGLFHPLSSTVKDGQSQSGHFRD LPPGNALSFSGLFLAEKKNSVIFLLFAFERREIWLHSAWLSGSQT >gi568815582f:58150015_58379774|GENSCAN_predicted_CDS_6|1038_bp atgggggctacaattcaagatgagatttgggtggggacacagccaaaccatatcactgcc cttggagaaatgttcattggacgtgctaaatgggaactagtaagattgcctgagcccaca gaatattgggtaaaagctggagtactggttgggacaaatcctccactggtggctctttgt ggagtgttcactggggcttatggcaaaagctatgagcccattggaaataagccaggtaaa agcgttcctatttctctgcagtttcgccagcatctgttgtttcttgactttttaataatt gcctttctgactacaccagaccaattgactcagacattgggtgtgagacccaggcatcag tatggctttaaagccccccagaggtacagagaattccaggacagttgcagtagacgtggg aacattttttttctcaactgggtgaatgtgaagctctcattcgtttcctgtggtgaccac ataaacgttgtcacctatgcatcgcccaagaaagttaaatatttccgtcctcccccaagc agcccagtcaaccaaatgccactggtgcttccgtgcgatgatcatttctgccggcagggg gcaatcttgctcgccgcattctatattgacgcttcctccagctgtctgaacgtggcgctc aacaaaattaaggtggtcaaggccacaatggataattttggtattggaaaggacagagct cggatggtccagttccctcagtttgcagaagcgggggttggactagggcagagctgtttt cgaggagaggagcaccgagattatgacaggcctgggagcgctaccggagaccggggctta tttcatcccttatcttcaaccgtaaaagacggacagtcccagagtggccatttcagagac ctaccccctgggaacgcattgtctttctcagggctgttccttgctgagaaaaagaattca gtgatatttctcctatttgcttttgaaagaagagaaatatggctccattctgcgtggctc tcaggaagccagacctaa >gi568815582f:58150015_58379774|GENSCAN_predicted_peptide_7|143_aa MGQYFCLRKAEEKIKGTLSCTLSTSSATLGWSTKQVLGVPDSRTWLLDDISGPLLGQREA HCPEGHRHTSTTIKMIQENMTSPNELNKPPETNSGEREKCDLSDREINIVILRKLNEIKD NTEKKLRILSDKFTKEIEIIKKN >gi568815582f:58150015_58379774|GENSCAN_predicted_CDS_7|432_bp atggggcaatacttctgcttgagaaaagcagaggaaaaaataaaggggactttgtcttgc actttaagtaccagctcagccacactgggttggagcaccaagcaggttcttggagtccct gattccaggacttggctcttggacgacatttctggacctttactgggccagagagaagcc cactgccctgaaggacacagacacacatctacaactatcaagatgatccaggaaaacatg acctcaccaaatgaactaaataagccaccagagaccaattctggagaaagagagaaatgt gacctttctgacagagaaatcaacatagttattttgaggaaactcaacgaaattaaagat aacactgagaagaaactcagaattctatcagataaatttaccaaagagattgaaataatt aaaaagaattaa