GENSCAN 1.0 Date run: 2-Oct-118 Time: 18:31:03 Sequence gi568815582r:79494463_79699902 : 205440 bp : 42.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 462 457 6 1.05 1.10 Term - 12090 11932 159 0 0 94 51 58 0.193 -0.34 1.09 Intr - 15385 15189 197 2 2 32 92 148 0.553 7.81 1.08 Intr - 16778 16712 67 2 1 53 94 29 0.700 -2.44 1.07 Intr - 17258 17220 39 2 0 89 85 68 0.700 4.10 1.06 Intr - 17494 17416 79 0 1 30 84 73 0.644 -0.27 1.05 Intr - 17733 17548 186 2 0 77 66 119 0.525 6.48 1.04 Intr - 18156 18088 69 2 0 74 89 66 0.434 2.68 1.03 Intr - 29745 29647 99 2 0 66 81 108 0.024 6.21 1.02 Intr - 42804 42657 148 1 1 44 27 109 0.151 -1.23 1.01 Init - 43436 43322 115 2 1 52 105 124 0.809 10.92 1.00 Prom - 43905 43866 40 -6.95 2.00 Prom + 47508 47547 40 -7.75 2.01 Init + 47741 47915 175 1 1 24 80 199 0.716 12.26 2.02 Term + 48127 48230 104 2 2 47 45 89 0.523 -2.04 2.03 PlyA + 48581 48586 6 1.05 3.00 Prom + 53262 53301 40 -4.45 3.01 Init + 58699 58928 230 0 2 51 31 182 0.106 6.63 3.02 Intr + 68448 68728 281 0 2 75 103 196 0.273 16.00 3.03 Intr + 73750 73945 196 2 1 10 17 120 0.024 -5.35 3.04 Intr + 77670 77797 128 0 2 69 85 125 0.991 9.70 3.05 Term + 80727 80869 143 1 2 84 47 163 0.997 8.91 3.06 PlyA + 81685 81690 6 1.05 4.02 PlyA - 82067 82062 6 1.05 4.01 Sngl - 83185 83024 162 1 0 61 42 217 0.912 8.75 4.00 Prom - 83952 83913 40 -5.75 5.05 PlyA - 84015 84010 6 1.05 5.04 Term - 96045 95826 220 2 1 70 48 188 0.014 8.53 5.03 Intr - 105485 104323 1163 2 2 79 84 1465 0.012 131.88 5.02 Intr - 112078 111959 120 1 0 14 70 112 0.066 1.77 5.01 Init - 119377 119318 60 1 0 74 93 47 0.187 5.10 5.00 Prom - 121909 121870 40 -5.85 6.04 PlyA - 122017 122012 6 -0.45 6.03 Term - 122522 122412 111 2 0 99 54 105 0.803 5.78 6.02 Intr - 130158 130000 159 0 0 6 100 97 0.239 1.96 6.01 Init - 134182 133970 213 1 0 67 66 135 0.539 7.99 6.00 Prom - 140013 139974 40 -3.15 7.02 PlyA - 140888 140883 6 1.05 7.01 Sngl - 150053 149658 396 2 0 32 36 268 0.097 11.00 7.00 Prom - 154192 154153 40 -8.35 8.00 Prom + 154454 154493 40 -7.15 8.01 Init + 158025 158124 100 2 1 51 105 92 0.958 7.67 8.02 Intr + 160817 160928 112 0 1 63 3 148 0.460 2.32 8.03 Intr + 161863 161984 122 1 2 66 89 91 0.593 6.22 8.04 Intr + 162483 162664 182 1 2 7 72 80 0.646 -3.03 8.05 Intr + 164111 164365 255 1 0 33 96 227 0.390 14.72 8.06 Term + 166111 166314 204 0 0 21 37 136 0.259 -1.81 8.07 PlyA + 166862 166867 6 1.05 9.06 PlyA - 167135 167130 6 1.05 9.05 Term - 168317 168171 147 2 0 90 39 75 0.556 -0.28 9.04 Intr - 168642 168511 132 0 0 65 94 87 0.958 6.92 9.03 Intr - 173314 173209 106 0 1 58 110 48 0.081 3.30 9.02 Intr - 194688 194601 88 1 1 61 93 127 0.940 8.61 9.01 Init - 198172 198052 121 1 1 82 110 56 0.479 7.66 9.00 Prom - 200904 200865 40 -3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 151146 151060 87 1 0 93 91 53 0.867 5.12 S.002 Init - 151340 151220 121 2 1 43 87 80 0.882 3.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_1|385_aa MRTYIKPGIQGLKYRRCSINGDCHSEVQRFHILSVTSSASQRKTRKLRGWNRMFECDSVL SQSRYELRIDSPNDPLYHSIKCADLAVEPPALPLKLTRTGIRGFIKEEKQPGMLMGAVGC QIHNELPNCRAEGGFFSEESQLRCLDPRVEDERSVFQDKEKHCHSKSYGIMKQAVSTGSE FAITGNNQKESITRNIGEEVRALGVRAAVWPICDWYFAAANQGGLCLQTEAEEILQAQWA TECRLETPFRGWHDINKNNKGEGLERKDTEEALIRRGSSEPKKQNKFGAFVDNSGVISKP GLKEHASPEKLRERQCHARPWAPTVNELHAQAKPWFQVFPTFSLLCMIPVPMQESLTLYG CWAGDLQLLLCLPKTLRDIGHGHTL >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_1|1158_bp atgagaacctacatcaagccaggaattcagggcctgaaatatagaaggtgctcaatcaac ggcgactgtcattctgaggttcagaggtttcacattctatctgtgacctcgtctgcctca caaaggaaaaccaggaaacttcgaggttggaacaggatgttcgagtgtgactctgtgcta tctcaaagcagatatgaactgagaatagattcaccaaatgacccactttatcattcaata aaatgtgctgatctggcagttgagcctcctgctctcccgttgaaactgacccgaacaggg atcagagggtttatcaaagaggaaaagcaaccagggatgctgatgggcgcagttggatgc cagattcataatgagctccccaactgccgggcagaaggaggtttcttctcagaagagagt cagctcagatgcttagatccaagggtagaggatgaaagatctgtctttcaagataaagaa aaacattgtcattcaaagagctatgggattatgaagcaggctgtctctacaggtagtgag ttcgccatcacagggaataatcaaaaagaatccataacaaggaatattggagaagaagtt cgtgcactgggtgtcagagcagcagtctggccaatctgtgactggtattttgctgcagct aatcaaggtggcttatgtttgcagaccgaagcagaggaaattttgcaggcccaatgggct actgaatgccgattggaaactccttttcggggttggcatgacattaataaaaataataag ggagaaggtttggagagaaaagacacagaagaggcattgataagaagaggctcatcagag cctaagaaacaaaataagtttggtgcttttgtggataacagtggagtgatatcaaaacca ggcctaaaagagcacgcatccccagagaaattgagggagaggcaatgccatgccaggccc tgggcccccactgtgaatgaattacacgctcaggccaagccctggtttcaggtcttcccc acattctctctgctatgcatgattccagttcccatgcaggagagcctcaccctttatggg tgctgggctggggatcttcagctgctgctttgcttgccaaagacgctaagagatattggt catgggcacacactctga >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_2|92_aa MGTLEGGGNRKSKRAPRSLRAGTQTGKRFSASGGLSLARSGVPKAPLVRVIFGPCHVVKV STLGTVARSDDLSKAALGLENLASLDFLGLVH >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_2|279_bp atgggaactttggaaggaggaggaaacaggaagtccaagcgagctccacgaagcctccgg gcaggaacgcaaacaggaaagcggttctcagccagtggtggcctctcgctagctaggtct ggtgtccccaaggcccccctggtgcgtgttatttttggaccatgccacgtggtgaaggtc tccactctgggtacagtggccagatctgatgaccttagcaaagccgctcttggcctggag aatcttgcaagcttggactttctggggctcgtgcactga >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_3|325_aa MPAVMLMLCLGNSSGSLLRMAAVPQGTATPGKALAIATSPSWQRNLHSQQLAEWLPMEPG LSGPLAQRQRPIEEGLVSKSNSNKTKAKILQASISNSNVSTGQQVHLCLLLSIHPFSKYQ VSVTSRLALGGYTMETDSRTYAQEDLSYGLKAVPIKHWTFQTPKLNLNLADRSWKLCKGS LAGMGVCGWPLMVPGVFHWVPVVTHKMTHRLIKSPGVGIRQTWVQIPVLSGKWEGLILEQ FAGVKENSEISTTNFKAPKSCIGGGESGSNTTGAVFQRGVHMNLSTSFPLPYCEVVIAQA LPNAMTPLSEKQAADHEGPPLATAD >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_3|978_bp atgccagctgtcatgctgatgctctgcctcggcaacagctctgggtctcttttgagaatg gcagccgttcctcaaggcacagcaactccagggaaggccttggccattgccacctctcca tcatggcagagaaatctgcattcacagcagttggctgagtggcttccaatggagcctggt ctctctggacccctggcccagagacagagacccatagaggaggggctggtatccaaatca aattcaaacaaaacaaaagcaaaaatccttcaggcctcaatttcaaattcaaatgtgtct actggtcagcaggttcatttgtgcctcctcctgtccattcatccgttcagcaaatatcaa gtcagtgttacatctaggcttgcactaggtggttatactatggaaacagacagcagaact tacgctcaggaggatttatcgtacgggctgaaggctgtacccataaagcactggactttc cagacaccaaagctaaacctcaatctggcagataggagctggaagctctgcaaaggaagt ctggcagggatgggtgtctgtggatggcccctaatggtccctggtgtcttccactgggtt cctgttgtaactcacaagatgacccacaggcttattaagagcccaggtgttggaatcaga cagacctgggttcaaattcctgttctatctggaaaatgggaaggcctcattttggagcaa tttgctggagtcaaagaaaacagtgaaatttccacaaccaacttcaaagcccccaaatcc tgtatcggtggaggggagtctggtagcaacaccacaggggccgtttttcaaagaggggtg cacatgaatctcagtacaagtttccctctcccctattgtgaagtggtcattgcccaagcc ttacccaatgccatgaccccattatcagaaaagcaggcagctgaccatgaaggcccaccc ttggctactgctgattga >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_4|53_aa MSGVRDNFRGAEDSDVPEALPFGGHPKPPWAMMTTATMHRVSPGHQIYAIFDL >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_4|162_bp atgtcaggagttagagacaacttccgtggagcagaagacagtgatgttcccgaagccctg ccctttggaggtcacccaaaacctccatgggctatgatgacaacagctaccatgcaccga gtgtctcctggacaccagatctatgccatctttgacctgtaa >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_5|520_aa MTIIIITITLLPNHPNVIGKEHSIMMGDAPRMSRSMSGVFSAQKNTSERSATPTLPPRTK PIWRSGGGGGGGGRRMASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQ CGRLIAGGSLSSTPMSTPCSSVPPSPSFSAPSPGSGSEQKAHLEDYYWMTGYPQQLNPEA LGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGAGASLGGSGEEMGPAAAVVS AVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAPGAAGSAAASAGGAGGAGGGGPASAG GGGGGGGGGGGGGAAGAGGALHPHHAAGGLHFDDRFSDEQLVTMSVRELNRQLRGVSKEE VIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLLQQVDHLKQEISRLVRERDAYK EKYEKLVSSGFRENGSSSDNPSSPEFFMAMKEPQKSLSSGQQVAEAEIWDPISHLRPTLS VDTDSAMQRVRGASLEAQLESLKLARAFGAMVLKPGMKEM >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_5|1563_bp atgaccataatcatcatcactattacattactaccaaatcatcccaatgttattggaaag gaacactccattatgatgggagatgctccgaggatgtcacggagcatgtctggtgttttc agtgcacagaaaaacacatcagagagatcagcaacacccacgttgccaccaagaactaag cccatctggcggagcggcggcggcggcggcggcggcggcaggagaatggcatcagaactg gcaatgagcaactccgacctgcccaccagtcccctggccatggaatatgttaatgacttc gatctgatgaagtttgaagtgaaaaaggaaccggtggagaccgaccgcatcatcagccag tgcggccgtctcatcgccgggggctcgctgtcctccacccccatgagcacgccgtgcagc tcggtgcccccttcccccagcttctcggcgcccagcccgggctcgggcagcgagcagaag gcgcacctggaagactactactggatgaccggctacccgcagcagctgaaccccgaggcg ctgggcttcagccccgaggacgcggtcgaggcgctcatcagcaacagccaccagctccag ggcggcttcgatggctacgcgcgcggggcgcagcagctggccgcggcggccggggccggt gccggcgcctccttgggcggcagcggcgaggagatgggccccgccgccgccgtggtgtcc gccgtgatcgccgcggccgccgcgcagagcggcgcgggcccgcactaccaccaccaccac caccacgccgccggccaccaccaccacccgacggccggcgcgcccggcgccgcgggcagc gcggccgcctcggccggtggcgctgggggcgcgggcggcggtggcccggccagcgctggg ggcggcggcggcggcggcggcggcggaggcggcgggggcgcggcgggggcggggggcgcc ctgcacccgcaccacgccgccggcggcctgcacttcgacgaccgcttctccgacgagcag ctggtgaccatgtctgtgcgcgagctgaaccggcagctgcgcggggtcagcaaggaggag gtgatccggctgaagcagaagaggcggaccctgaaaaaccgcggctatgcccagtcctgc cgcttcaagagggtgcagcagagacacgtcctggagtcggagaagaaccagctgctgcag caagtcgaccacctcaagcaggagatctccaggctggtgcgcgagagggacgcgtacaag gagaaatacgagaagttggtgagcagcggcttccgagaaaacggctcgagcagcgacaac ccgtcctctcccgagtttttcatggccatgaaagagcctcaaaagtcactgtcctcaggc caacaagtggcagaagctgagatctgggacccaataagtcacctccgccccactttaagt gtagacacagacagcgctatgcagagggtccggggagcatcactggaagctcagctggaa tctttaaagctggccagagcttttggtgcaatggtcttgaagcctggaatgaaggagatg tag >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_6|160_aa MSPFFTCDSLVVMRRPSCEEGETSGSGVAPRRGLAQQDTVKLPLGNLKTPEGNGSLHSGR ALPHWVLLQLQPRMRENLERDIGWIRHELRGSIPWVRDEKEVHHLRGGCEHQIDFRGKAL GDLKVSTWIPQLVQDTPLDLEESIQFHANQLVLMEQQGDQ >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_6|483_bp atgtctccattcttcacttgtgactcacttgtggtcatgcgtaggcccagctgtgaggaa ggtgaaacctccgggtcaggtgtagctcccaggagaggtttagcccagcaggacacagtg aaattgcccctggggaacctgaagaccccagaaggaaatggaagcctgcactcgggcagg gccttgccccactgggtccttctgcagctgcagccacggatgagggaaaatttggagaga gatattggatggatcagacatgagttgagaggcagtattccctgggtaagagatgagaaa gaagtccatcacctaagaggtggctgtgagcatcagatagacttcagaggcaaagcactg ggagatttaaaggtgagcacatggatcccccaactggttcaggatacacctttagacctt gaagagagtattcagttccatgcaaaccagttggttctcatggaacaacagggagaccaa tga >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_7|131_aa MTLRLSQVTAGSGPRASPLTIVEHVGGAFLKIDAQKGHMSVNPCTLSKPEILQVPFTSST SKPPSELAEMQFQIISVLFSKWRQRLHMLVKVLPGKLQSLGVQCSGVAGKVTAKFQTHKG AEVASGNQVME >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_7|396_bp atgactctcagactcagccaggtcactgccggttccgggccgcgtgcttccccactgacc atcgtagaacatgttggaggtgccttcttgaagatagatgctcagaaaggacatatgtca gtcaatccctgcacactctctaagccagagattctccaagtgccgttcacatcatctaca tccaaaccacctagtgagcttgctgaaatgcaattccaaatcatttcagtcctattctcg aagtggagacagagactccacatgcttgtcaaggtgctcccagggaaactccagagtttg ggggtccagtgctctggggttgctggaaaggtcacagccaaattccaaacccacaagggg gctgaagttgcaagtggcaatcaagtcatggagtaa >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_8|324_aa MGANLPGPECEKVHGEGDRTRSYSREGRKKKMGGAVLSTQNIVQSCPHGAYMLVLVHSWE AETDDDDDDDVKKVRPPGTNGTVTTTERRVSQYELYPDSNSQDLYRQGSGDALYEPGMWE TFSIAWALFIPCWGFAEFLLSSSLTIATLFVLSAVDLGVTDTRGKRAVFAVEGEFHAASR QDNFIKDAESDFREYSGQPWKALGSFQSYPGQLSAPLCEELMVLGSSGQLQCAQEGISVI IQRGQSWCGKWVFTCRQGLEQLTNMPTQREPQNNGKLTMERRVKEEMVNIRLVVHNPTTC NVHLSTRQVYLGQKKNLMWLHHSV >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_8|975_bp atgggagccaatcttcctggacctgaatgtgagaaagtccatggagagggggacagaact aggagttactctagagagggaaggaagaagaagatgggaggtgctgttctaagcactcag aatatagttcaaagctgtcctcatggagcttacatgctagttctggtccactcatgggaa gctgaaacggatgatgatgatgatgatgatgtgaaaaaagtgagaccaccagggaccaac ggaacagtcaccactactgaaagacgtgtctcacaatatgaactttacccagactcaaac tcccaggacctctacaggcaaggctctggtgatgcactgtatgaacctggcatgtgggaa actttcagcattgcctgggctctgtttatcccatgctggggttttgctgaatttcttctt tcttcctccctgacaatcgccacgctctttgttctgtccgctgtggatttgggtgttaca gatacaaggggtaagagagctgtgtttgctgtagaaggtgagttccacgcagccagccgt caagataactttatcaaggacgcggaatcagatttcagagaatacagtggacagccttgg aaggcacttggaagtttccagtcctatcctggccagctcagtgcacccctctgtgaagag cttatggtccttgggagcagtggccagttacagtgtgcacaggaaggcatttcagtaatc attcagcgaggtcagagctggtgcgggaaatgggtcttcacatgcaggcagggtttggag cagcttacaaacatgcccacgcaaagggagccacaaaataatggcaaactgacaatggaa cggagggtgaaggaagaaatggtgaatattagacttgtggtgcacaacccaacaacgtgc aacgttcacttatcaaccaggcaggtttacttaggacagaagaaaaacctgatgtggcta caccattctgtctga >gi568815582r:79494463_79699902|GENSCAN_predicted_peptide_9|197_aa MMWGNLEPQASMHSAFPPPAPQPNTKQKHLDHDQQSQHAPETETKIRGFRLLKAENKTKQ NNNDNNNKIRQAPRGDLPSYCWLSALYCNHMSLGYPGERQESINQVRQHIPDPIGLQAAG QGMGTLCSDMSTKESSENELLQAKQIKIKPFTKQLAVVLVCHTLCKEPLAKDLNLVLPSR SFQSSEGDKQVSIYNGI >gi568815582r:79494463_79699902|GENSCAN_predicted_CDS_9|594_bp atgatgtggggaaacctggagccacaggcatccatgcattctgcatttcccccaccggca ccacaaccaaacacaaaacaaaaacatttggaccatgaccagcaaagccagcatgcacca gagacagaaactaaaatccgtggcttcaggctgctaaaagccgaaaacaaaacaaaacag aacaacaacgacaacaacaacaaaataagacaggctcctcgtggtgacctcccatcctat tgctggctctctgctctttactgcaaccacatgtcacttggataccccggagagcggcag gaaagcatcaaccaggtaaggcagcacattccagatcccattggtctgcaagctgccggt cagggcatgggcactctatgcagtgacatgtctactaaagaatcgtctgagaatgaactc cttcaggcaaagcaaataaaaattaagccattcaccaaacaattagcagtggtcctggtg tgccacaccctctgcaaagaacccttggcaaaagacttgaacttggtcttgccctccagg agtttccagtccagtgagggggacaaacaagtctccatctacaatggaatatag