GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:05:13 Sequence gi568815597f:33764192_33964749 : 200558 bp : 45.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 373 368 6 1.05 1.11 Term - 2465 2319 147 2 0 37 28 132 0.015 0.00 1.10 Intr - 7187 7014 174 2 0 43 81 70 0.019 1.94 1.09 Intr - 8560 8445 116 2 2 115 48 70 0.352 5.87 1.08 Intr - 9029 8915 115 2 1 65 91 21 0.572 0.12 1.07 Intr - 10380 10299 82 2 1 64 94 46 0.373 2.44 1.06 Intr - 28335 28232 104 0 2 140 103 130 0.851 18.67 1.05 Intr - 46673 46552 122 0 2 101 121 103 0.978 15.11 1.04 Intr - 55646 55522 125 1 2 77 78 149 0.972 13.03 1.03 Intr - 56365 56278 88 2 1 94 110 18 0.953 3.63 1.02 Intr - 61583 61506 78 0 0 79 75 106 0.958 7.92 1.01 Init - 65779 65674 106 1 1 80 52 114 0.663 5.39 1.00 Prom - 67309 67270 40 -11.53 2.00 Prom + 67776 67815 40 -4.96 2.01 Init + 69688 70814 1127 0 2 60 53 391 0.746 26.47 2.02 Term + 75136 75457 322 1 1 15 37 253 0.482 7.19 2.03 PlyA + 75500 75505 6 -0.45 3.04 PlyA - 75772 75767 6 -0.45 3.03 Term - 76018 75993 26 2 2 93 40 40 0.044 -1.91 3.02 Intr - 82805 82693 113 1 2 133 102 145 0.495 20.42 3.01 Init - 91124 91054 71 2 2 70 18 65 0.110 -1.78 3.00 Prom - 92982 92943 40 -3.56 4.00 Prom + 98973 99012 40 -5.86 4.01 Sngl + 100001 100561 561 1 0 82 45 407 0.845 31.94 4.02 PlyA + 101726 101731 6 1.05 5.00 Prom + 103275 103314 40 -5.66 5.01 Init + 103876 103936 61 0 1 77 72 -1 0.610 -1.39 5.02 Intr + 105012 105239 228 1 0 95 43 113 0.120 5.34 5.03 Term + 126384 126736 353 1 2 90 41 116 0.089 1.75 5.04 PlyA + 127874 127879 6 1.05 6.07 PlyA - 127924 127919 6 1.05 6.06 Term - 128962 128894 69 1 0 118 45 22 0.067 -1.16 6.05 Intr - 129759 129604 156 0 0 74 115 15 0.049 3.01 6.04 Intr - 132405 132366 40 2 1 74 94 14 0.002 -1.17 6.03 Intr - 154110 153903 208 1 1 115 107 299 0.709 32.64 6.02 Intr - 171763 171569 195 2 0 103 92 183 0.711 19.59 6.01 Init - 174747 174741 7 0 1 50 108 0 0.140 -0.77 6.00 Prom - 175594 175555 40 -3.46 7.03 PlyA - 177112 177107 6 1.05 7.02 Term - 178010 177946 65 0 2 105 45 115 0.971 6.95 7.01 Init - 178078 178012 67 1 1 87 47 2 0.487 -2.76 7.00 Prom - 180302 180263 40 -2.76 8.00 Prom + 184300 184339 40 -1.76 8.01 Init + 190399 190494 96 0 0 66 115 83 0.845 9.11 8.02 Intr + 198501 198591 91 2 1 100 89 21 0.083 2.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 7028 7191 164 0 2 108 41 113 0.802 6.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:33764192_33964749|GENSCAN_predicted_peptide_1|418_aa MAGAPPPALLLPCSLISDCCASNQRHSVGVGPSELVKKQIELKSRGVKLMPSKDNSQKTS VLTQVGVSQGHNMCPDPGIPERGKRLGSDFRLGSSVQFTCNEGYDLQGSKRITCMKVSDM FAAWSDHRPVCRARMCDAHLRGPSGIITSPNFPIQYDNNAHCVWIITALNPSKVIKLAFE EFDLERGYDTLTVGDGGQDGDQKTVLYMSQNACSDSPHTPGSRIPESMSGDIWRQKWTVL EICRDISSSDARSGSVRKSPKTSNAVELVAPGTEIEQGSCGDPGIPAYGRREGSRFHHGD TLKFECQPAFELNIEPNSISHVACWEPQGPGGAGSSSGQANTKALPIAETKPRDRCTGQG SAEAGPEKIKHTFVDYCEVPKVKCHEFRDELGTVRVLKELFSCGDRHHYINEVGPCGL >gi568815597f:33764192_33964749|GENSCAN_predicted_CDS_1|1257_bp atggcgggcgcccctccccccgccttgctgctgccttgcagtttgatctcagactgctgt gctagcaatcagcgacactccgtgggcgtaggaccctccgagctagtcaagaagcaaatt gagttgaagtctcgaggtgtgaagctgatgcccagcaaagacaacagccagaagacgtct gtgttaactcaggttggtgtgtcccaaggacataatatgtgtccagaccctggcataccc gaaaggggcaaaagactaggctcggatttcaggttaggatccagcgtccagttcacctgc aacgagggctatgacctgcaagggtccaagcggatcacctgtatgaaagtgagcgacatg tttgcggcctggagcgaccacaggccagtctgccgagcccgcatgtgtgatgcccacctt cgaggcccctcgggcatcatcacctcccccaatttccccattcagtatgacaacaatgca cactgtgtgtggatcatcacagcactcaacccctccaaggtgatcaagctcgcctttgag gagtttgatttggagaggggctatgacaccctgacggtcggtgatggtggtcaggatggg gaccagaagacagttctctacatgtctcaaaatgcctgcagtgacagccctcacacccca ggctctcgcatcccagagagcatgtctggggacatctggaggcagaaatggactgtactt gagatctgtcgtgacattagcagttcagatgcaaggtcaggttcagtgaggaagtctcca aagacttctaatgctgtggaacttgttgctcctgggacagagatcgagcagggcagttgc ggtgaccctggcatacctgcatatggccggagggaaggctcccggtttcaccacggtgac acactcaagtttgagtgccagcccgcctttgagctgaatatagagcccaacagcattagc cacgtagcatgctgggagccacaaggacctggaggagcagggagcagctcggggcaggca aacacaaaggctttgcccattgcagagaccaagcccagggacaggtgcacggggcagggc tctgcagaagcagggcctgagaagataaagcacacctttgtggactactgtgaagtgccc aaggttaagtgccacgaattcagagatgaattgggcacggtccgtgtcctcaaggagctc ttctcatgtggagatagacaccattatataaatgaagtgggcccttgtggcttataa >gi568815597f:33764192_33964749|GENSCAN_predicted_peptide_2|482_aa MSELPFTIASKRIKYLGIQLTRDVKELFKENYKPLLNEIKEDTTKWKNIPCSWVGRINIV KMAILPKVIYRLNAIPMKLPMPFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLP DFKLHYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAICRKLKLDHFLTPYTKINPRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKDKIDKWDLIKLKSFCTAKQTTIRVNRQPTKWEKIFATYSSDKGLISRI YNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKNCSPSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNRIKVGIWATVVKTPGFGDTERPNKAIAIMAWCLEESLTLKLEDVQ FHDLGKVGAVIVTKDDAMFLEGKGDKTQTEKNMQEIIEQLDVTISVYEKGKLEEHPAKFW MQ >gi568815597f:33764192_33964749|GENSCAN_predicted_CDS_2|1449_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggaactcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaccaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagactcaatgccatccccatgaagctacca atgcctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaaatcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactacactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaac tatctgatcttcgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcacttcctt acaccttatacaaaaatcaatccaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaacaaactaccatcagagtgaacaggcaacct acaaaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggca aaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaa aactgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccat ctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacaggataaaagttggc atttgggctacagtagtcaaaactccaggctttggtgacacagaaagaccaaacaaagct attgctataatggcatggtgtttggaagagagcttgaccctaaagcttgaagatgttcag tttcatgacttaggaaaagttggagcagtcattgtgaccaaagatgatgccatgttcttg gaaggaaaaggtgacaagactcaaactgagaaaaatatgcaagaaataattgagcagtta gatgttactattagtgtctatgaaaaaggaaaactagaagaacatccggcaaaattttgg atgcaatag >gi568815597f:33764192_33964749|GENSCAN_predicted_peptide_3|69_aa MTTRIQSAVAYGSYDVHGYPCQELFTGASLPAPVISSKNWLRLHFTSDGNHRQRGFSAQY QAEFTEVSE >gi568815597f:33764192_33964749|GENSCAN_predicted_CDS_3|210_bp atgaccaccaggatccaatcagctgtagcctatgggtcatatgatgtacatggctacccc tgtcaggagctgttcaccggagccagcctcccagcccccgttatcagcagcaagaactgg ctgcgactgcacttcacatcggatggcaaccaccggcagcgcggattcagtgcccaatac caagctgaattcaccgaagttagtgaataa >gi568815597f:33764192_33964749|GENSCAN_predicted_peptide_4|186_aa MGKEIQLKPKANVSSYVHFLLNYRNKFKEQQPNTYVGFKEFSRKCSEKWRSISKHEKAKY EALAKLDKARYQEEMMNYVGKRKKRRKRDPQEPRRPPSSFLLFCQDHYAQLKRENPNWSV VQVAKATGKMWSTATDLEKHPYEQRVALLRAKYFEELELYRKQCNARKKYRMSARNRCRG KRVRQS >gi568815597f:33764192_33964749|GENSCAN_predicted_CDS_4|561_bp atgggaaaagaaatccagctaaagcctaaggcaaatgtctcttcttacgttcactttttg ctgaattacagaaacaaattcaaggagcagcagccaaatacctatgttggctttaaagag ttctctagaaagtgttcggaaaaatggagatccatctcaaagcatgaaaaggccaaatat gaagccctggccaaactcgacaaagcccgataccaggaagaaatgatgaattatgttggc aagaggaagaaacggagaaagcgggatccccaggaacccagacggcctccatcatccttc ctactcttctgccaagaccactatgctcagctgaagagggagaacccgaactggtcggtg gtgcaggtggccaaggccacagggaagatgtggtcaacagcgacagacctggagaagcac ccttatgagcaaagagtggctctcctgagagctaagtacttcgaggaacttgaactctac cgtaaacaatgtaatgccaggaagaagtaccgaatgtcagctagaaaccggtgcagaggg aaaagagtcaggcagagctga >gi568815597f:33764192_33964749|GENSCAN_predicted_peptide_5|213_aa MEREGGKEEEKLGEPGPKIMGSSETIKCQQGDYPSAPLKDCKKEDTKSMAFFPLGKKGDN EKPPLGIPWSSWKLKTKCGNCESYLWDMFSQEMEEMARIAKSILSQKNKAGGIMLPDFKL YYKATVTKTAWYWYQNRDIDQWNRTEPSEITLHIYNYLIFDKPEKNKQWGKDSLFNKWYW ENWLAICRKLKLDPFLTPYTKINSRWIKDLNKR >gi568815597f:33764192_33964749|GENSCAN_predicted_CDS_5|642_bp atggagagagagggagggaaggaggaggagaagttgggggagcctggacctaagataatg gggtcttcagagactatcaagtgccaacaaggggattacccctctgcccctctaaaggat tgcaagaaagaagacaccaaatccatggcattttttcccttaggaaagaagggagataat gagaagcctccacttggtattccttggagcagctggaagctaaaaaccaaatgtgggaac tgtgaatcctatctctgggatatgttttctcaggaaatggaggaaatggcccgcatcgcc aagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgacttcaaacta tactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagat caatggaacagaacagagccctcagaaataacgctgcatatctacaactatctgattttt gacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtactgg gaaaactggctagccatatgcagaaagctgaaactggatcccttccttacaccttataca aaaattaattcaagatggattaaagacttaaataaacgttag >gi568815597f:33764192_33964749|GENSCAN_predicted_peptide_6|224_aa MPILPSHTCGNPGRLPNGIQQGSTFNLGDKVRYSCNLGFFLEGHAVLTCHAGSENSATWD FPLPSCRADDACGGTLRGQSGIISSPHFPSEYHNNADCTWTILAELGDTIALVFIDFQLE DGYDFLEVTGTEGSSLCHHSPSPDEETEAQQAAASEQEDCLKVQYQPPQHPGKRHGCLLA WQLTIEIWKIRAMKLTLYSHRMIVPGRAVTQMFLVMWELDGCKA >gi568815597f:33764192_33964749|GENSCAN_predicted_CDS_6|675_bp atgccaattctccccagccacacatgtgggaacccagggaggctgcccaatggcatccag cagggttcaaccttcaacctcggtgacaaggtccgctacagctgcaaccttggcttcttc ctggagggccacgccgtgctcacctgccacgctggctctgagaacagcgccacgtgggac ttccccctgccttcctgcagagctgatgatgcctgtggtgggaccctgcggggccagagt ggcatcatctccagcccccacttcccctcggagtaccataacaatgccgactgcacatgg accatcctggctgagctgggggacaccatcgccctggtgtttattgacttccagctggag gatggttacgactttctggaagtcactgggacagaaggctcctccctctgccatcattct ccttcgccagatgaagaaactgaggcacagcaggctgcagccagtgagcaagaggactgc ctcaaagtccagtatcagccaccccagcatcctggaaaaaggcatggctgcctgctagct tggcaattaaccattgagatttggaaaatcagggctatgaaacttactttgtatagccac agaatgattgtaccaggcagagctgtgacccagatgtttctagtgatgtgggaattggat ggatgcaaagcatga >gi568815597f:33764192_33964749|GENSCAN_predicted_peptide_7|43_aa MAYCRQKCGIPDSDVKTCLCPCMSYVALDKCSNISGNYAYLNK >gi568815597f:33764192_33964749|GENSCAN_predicted_CDS_7|132_bp atggcctattgcaggcagaaatgtggaatcccagacagtgatgtgaaaacctgcctctgc ccctgcatgagctacgtggccttggataaatgctccaacatctctggcaactatgcttat ctcaacaagtga >gi568815597f:33764192_33964749|GENSCAN_predicted_peptide_8|63_aa MVHPNDGMLFSLTKDGNSDTYYTAWMNCEDIMLPLPGIIFLPQISKIDSLISFRCVRKCL LLX >gi568815597f:33764192_33964749|GENSCAN_predicted_CDS_8|189_bp atggtacatccgaacgacggaatgttattcagcctgacaaaagacgggaattctgacaca tactacacagcatggatgaactgtgaggacatcatgctccctctgcctggaataatcttc ctcccccaaatcagcaagattgactccctcatatccttcaggtgtgtccgcaaatgtctg ctgctcgnn