GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:47:19 Sequence gi568815592f:166899413_167139975 : 240563 bp : 44.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1975 2067 93 0 0 66 66 87 0.696 4.68 1.02 Intr + 2139 2361 223 2 1 -38 111 158 0.851 3.20 1.03 Intr + 4203 4372 170 1 2 111 29 88 0.335 4.87 1.04 Intr + 4766 4969 204 1 0 74 20 123 0.321 3.50 1.05 Intr + 9566 9646 81 1 0 133 23 35 0.131 1.33 1.06 Intr + 23313 23442 130 2 1 100 81 53 0.634 6.07 1.07 Term + 26169 26329 161 1 2 56 43 116 0.672 2.00 1.08 PlyA + 28088 28093 6 1.05 2.00 Prom + 28118 28157 40 -3.06 2.01 Sngl + 29687 29863 177 1 0 79 41 127 0.234 2.05 2.02 PlyA + 30081 30086 6 1.05 3.09 PlyA - 30112 30107 6 -0.45 3.08 Term - 30379 30176 204 1 0 73 44 185 0.985 9.97 3.07 Intr - 33487 33318 170 2 2 58 80 84 0.612 4.27 3.06 Intr - 33857 33763 95 1 2 32 64 93 0.427 0.91 3.05 Intr - 39596 39483 114 1 0 61 94 152 0.906 12.66 3.04 Intr - 43677 43607 71 0 2 58 119 47 0.133 2.78 3.03 Intr - 46935 46891 45 0 0 88 62 50 0.071 1.01 3.02 Intr - 51300 51198 103 2 1 74 58 26 0.065 -1.62 3.01 Init - 56770 56685 86 1 2 101 101 246 0.721 25.49 3.00 Prom - 83907 83868 40 -1.86 4.00 Prom + 98099 98138 40 -5.16 4.01 Init + 100001 100102 102 1 0 99 100 281 0.426 30.64 4.02 Intr + 103781 103835 55 1 1 108 107 8 0.521 3.25 4.03 Intr + 116040 116104 65 1 2 69 78 48 0.164 0.34 4.04 Intr + 122997 123223 227 2 2 111 59 174 0.798 13.58 4.05 Intr + 125370 125482 113 0 2 66 89 38 0.708 1.72 4.06 Intr + 127135 127203 69 2 0 127 59 3 0.347 0.45 4.07 Intr + 133191 133230 40 1 1 46 82 66 0.273 -0.92 4.08 Intr + 134463 134559 97 0 1 27 86 127 0.311 6.41 4.09 Intr + 146543 146641 99 1 0 86 131 27 0.058 7.11 4.10 Term + 168372 168479 108 2 0 17 41 167 0.631 3.41 4.11 PlyA + 172310 172315 6 -0.45 5.04 PlyA - 172436 172431 6 1.05 5.03 Term - 175775 175741 35 0 2 98 45 55 0.498 -0.05 5.02 Intr - 179361 179294 68 2 2 102 94 1 0.600 0.65 5.01 Init - 183459 182183 1277 0 2 58 53 406 0.496 25.80 5.00 Prom - 183816 183777 40 -7.36 6.02 PlyA - 183831 183826 6 1.05 6.01 Sngl - 186647 186354 294 2 0 88 54 225 0.788 14.80 6.00 Prom - 192797 192758 40 -5.86 7.04 PlyA - 193887 193882 6 1.05 7.03 Term - 198759 198531 229 2 1 34 43 248 0.932 11.00 7.02 Intr - 203575 203476 100 2 1 114 41 36 0.247 0.67 7.01 Init - 208969 208945 25 1 1 70 94 42 0.147 1.64 7.00 Prom - 222484 222445 40 -4.16 8.00 Prom + 225627 225666 40 -3.06 8.01 Init + 228896 228997 102 1 0 83 83 64 0.038 3.70 8.02 Term + 236828 237943 1116 1 0 129 44 774 0.988 69.14 8.03 PlyA + 238883 238888 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 60216 60337 122 1 2 58 50 106 0.846 2.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:166899413_167139975|GENSCAN_predicted_peptide_1|353_aa MPGERTAGGENAGERTAGKTPCERTKEGERQGRERRENTEGGKCQVNAQGGKRLVNAQRR KPGERTSSRTPSERTRGECCANTHGKHNRASTQWGERHANAQQEERFALAAQQCPSAKGS RMLFSSSLPATAMGSRIPAGTPPHPRNVNTSGIRIGRCDPGTGGPPQPGSSHFTACSSLL GPGNMQTLSSKTAQDPTFAFLNSSQGTPMLRAEAHAGGTPASGQLWVNFRGVRFQWNEEL TVEDGRGLAGHVSTEGEVKYFQNTYYAQVAVFSTGAARAAHISGCSPRHMVIKTEEQMVT VLTYATLSASTLYGPPLCHPAGIFYVTQQELEEKRPWSLGVEAQSHPGLQKNS >gi568815592f:166899413_167139975|GENSCAN_predicted_CDS_1|1062_bp atgcctggtgaacgtacagcgggaggggagaacgctggtgaacgcacagcggggaaaacg ccatgtgaacgcacaaaggaaggagaacgccaggggagagaacgccgggagaacacagag ggaggaaaatgccaggtgaacgcacaagggggaaaacgcctggtgaacgcacaaaggaga aagccaggtgaacgcacaagcagcagaacgccaagtgaacgcacgaggggagaatgctgc gccaacacacacgggaaacacaaccgcgcgagcacgcagtggggagaacggcacgcgaac gctcagcaagaagaacgctttgccttggcagcgcagcagtgcccctctgcaaaggggtct cgcatgctgttcagcagctctcttcccgccacagcgatgggcagccggatcccagctgga actccgccacatcccaggaatgtgaataccagtggaatccggataggacggtgtgaccca ggcacaggtgggccaccccaacccggctcctcacacttcacggcatgcagcagcctcctg ggccctgggaacatgcagactctgagcagcaagactgcgcaggacccgacatttgcattt ctaaacagctcccaggggacgccaatgctgcgggctgaggcccacgctggaggaacaccg gccagtgggcagctttgggtgaacttcagaggtgttaggttccagtggaatgaggaactg acagtggaggatggcagaggcctggcagggcatgtcagcactgagggggaggttaaatat tttcaaaacacctactatgctcaagtagctgtgttcagtactggggcagcacgagcagcc cacatctcagggtgcagtccaagacatatggtcatcaagacagaagagcagatggtgaca gtcctcacctatgccacattgtccgcatctacactgtacggccctcccctgtgccatcca gctggcatcttctatgtaacacaacaagagctggaagagaagaggccctggtccctgggc gtggaggcgcagagccacccaggactgcaaaagaactcttaa >gi568815592f:166899413_167139975|GENSCAN_predicted_peptide_2|58_aa MSWGSGHMEQTQSELETPPRPTRHQRGLLPCVELFVLTGSLRQEGNRLRGENSSKRVP >gi568815592f:166899413_167139975|GENSCAN_predicted_CDS_2|177_bp atgtcctggggctctggtcacatggaacagacccagtctgagctagagacaccccccagg cccaccaggcatcagcgtgggctcctgccatgcgtggagctcttcgtcttgacagggtcc ctgcgccaagaaggcaacaggctccgaggggaaaactcgagcaagagagtcccctga >gi568815592f:166899413_167139975|GENSCAN_predicted_peptide_3|295_aa MRPAALRGALLGCLCLALLCLGGADKRLRASVSRPFSLEHPEPGHCLDLHLRAYPARCVK LTLAPGLLGQMTDVGIWSDLLPEMRAYWPDVIHSFPNRSRFWKHEWEKHGTCAAQVDALN SQKKYFGRSLELYRELDLNSCSRDRCVIHKSRDIYDVDLYGKCLQTLALEKGSRKMIACR ALTVLTWTHVRIDEGHVPAMFAQSSVFRELITGVAKATGATHLLSCFQDEEVQTIGQIEL CLTKQDQQLQNCTEPGEQPSPKQEVWLANGAAESRGLRVCEDGPVFYPPPKKTKH >gi568815592f:166899413_167139975|GENSCAN_predicted_CDS_3|888_bp atgcgccctgcagccctgcgcggggccctgctgggctgcctctgcctggcgttgctttgc ctgggcggtgcggacaagcgcctgcgtgcgtcagtgtccaggcccttctcattggagcac ccagagccaggacactgcctggatctgcacctcagggcgtaccctgctcggtgtgtcaag ctgaccctggccccaggacttctgggtcagatgaccgacgtgggaatctggtcggatctt ttgccagaaatgagggcatactggcctgacgtaattcactcgtttcccaatcgcagccgc ttctggaagcatgagtgggaaaagcatgggacctgcgccgcccaggtggatgcgctcaac tcccagaagaagtactttggcagaagcctggaactctacagggagctggacctcaacagc tgcagtagggaccgatgtgtcatccacaaatcccgggatatttacgacgtggacctttac ggaaaatgtttgcagactcttgccttagagaaaggaagccgtaagatgatcgcttgcagg gccctcaccgtcctcacctggactcatgtgcgaatagatgagggacatgtgcctgccatg tttgcccagagctcggtgttcagggaactgattacaggggtggcaaaagccacaggggcc acacatttgctgagctgcttccaggatgaggaagtacagacaattggtcagatagaactg tgcctcactaagcaagaccagcagctgcaaaactgcaccgagccgggggagcagccgtcc cccaagcaggaagtctggctggcaaatggggccgccgagagccggggtctgagagtctgt gaagatggcccagtcttctatcccccacctaaaaagaccaagcattga >gi568815592f:166899413_167139975|GENSCAN_predicted_peptide_4|324_aa MAATAAAVVAEEDTELRDLLVQTLENSGVLNRIKNKTPLVNESLKKFLNTKDDAMLHVVV EVVHSKTPDGAIRMKANDEANQSDTSVSLSEPKSKSSLHLLSHETKIGSFLSNRTLDGKD KAGLCPDEDDMEGDSFFDDPIPKPEKTYGLRKEPRKQAGSLASLSDAPPLKSGLSSLAGA PSLKDSESKRGNTVLKDLKLISDKIGSLGLGTGEDDDYVDDFNSTSHRSEKSEISIGEEI EEDLSVEIDDINTSDKHPYCPSARPFLAALMGSSLLRALLHQGAVLSLPNHKQTISKDKE EANSITIQLDHIDNCGPLHPTTAE >gi568815592f:166899413_167139975|GENSCAN_predicted_CDS_4|975_bp atggcggcgacggcggccgcagtggtggccgaggaggacacggagctgcgggacctgctg gtgcagacgctggagaacagcggggtcctgaaccgcatcaagaacaaaactcctttagtt aatgagagcctgaaaaagtttttaaataccaaagacgatgctatgttgcatgtggtggtt gaggttgtacacagcaagactccagatggagccatccgcatgaaggccaatgatgaggcc aatcagagtgatacaagtgtctccttgtcagaacccaagagcaaaagcagccttcactta ctgtcccatgaaacaaaaattggatcttttctaagcaacagaactttagatggcaaagac aaagctggcctttgtccagatgaagatgatatggaaggagattctttctttgatgatccc attcctaagccagagaaaacttacggtttgaggaaggaacctaggaagcaagcaggaagt ctggcctcgctctcggatgcaccccccttaaaaagtggactcagctccctggcgggagcc ccttctttaaaagactctgagagtaaaaggggaaatacagttttgaaagatctgaaattg atcagtgataaaattggatcacttggattaggaactggagaagatgatgactatgttgat gattttaatagtaccagccatcgctcagagaaaagtgagataagtattggtgaagagata gaagaagacctttctgtggaaatagatgacatcaataccagtgataagcacccctactgc cccagtgccaggccttttctagcagctctcatgggttcatccctgctgcgggcattgctc caccaaggggcagtgctctccctgccgaaccataagcagacaatcagcaaggataaagaa gaggcaaacagcatcaccattcagctggaccatattgacaattgtggaccacttcaccca acaacagcagaataa >gi568815592f:166899413_167139975|GENSCAN_predicted_peptide_5|459_aa MIVYLENPIVSAQNLLKLISNFSRVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRHVKDLFKENYKPLLNKIKEDTNKWKNIPCSWVGRINIVKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKSILSQKNAAGGIMLPDFKLYYKATV TKTAWYWYQNRDIDQWNRTELSKIMPHIYNHLIFDKPDKNKKWVKDSLFNKCCWENWLAI CRKLKLDPILTPYTKINSRWIKDLNARPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMA TKDKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFASYSSDKGLISRIYNELKQIYKK NTNNPINKWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREMQIKTTMRYHLTSVRMAIIK KSGNNRVEPRAAPPSDSPVGQAWEDTEPGYHFQYNIEQE >gi568815592f:166899413_167139975|GENSCAN_predicted_CDS_5|1380_bp atgattgtatatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcagagtctcaggttacaaaatcaatgtgcaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaaggcacgtgaaggacctcttcaaggag aactacaaaccgctgctcaacaaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatattgcccaaggtaatttat agattcaatgccattcccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcctgcatcgccaagtcaatcctaagccaa aagaacgcagctggaggcatcatgctacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaacagag ctctcaaaaataatgccgcatatctacaaccatctgatctttgacaaacctgacaaaaac aagaaatgggtaaaggattccctatttaataaatgctgctgggaaaactggctagccata tgtagaaagctgaaactggatcccatccttacaccttacacaaaaattaattcaagatgg attaaagacttaaatgctagacctaaaaccataaaaaccctagaagaaaacctaggcaat accattcaggacataggcatgggcaaagacttcatgtctaaaacaccaaaagcaatggca acaaaagacaaaattgataaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttgcaagctac tcctctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaa aatacaaacaaccccatcaacaagtgggcgaaggatatgaacagacacttctcaaaagaa gacatttatgcagccaaaagacacatgaaaaaatgctcatcatcactggccatcagagaa atgcaaatcaaaaccacaatgagataccatctcacatcagttagaatggcgatcattaaa aagtcaggaaacaacagggtggaaccacgtgcagcacctccaagtgacagccctgtgggg caggcctgggaggacacagagccagggtatcacttccagtacaatattgagcaggagtga >gi568815592f:166899413_167139975|GENSCAN_predicted_peptide_6|97_aa MGKKQSRKTENSKNQNASPPPKECSSSPATEQSWTENDFDELREGDFRQSNYSKLKEEVR THGKEVKNLEKRLDEWLTRITSAEKSLKDPMELKTTA >gi568815592f:166899413_167139975|GENSCAN_predicted_CDS_6|294_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagaatgcctctcctcct ccaaaggaatgcagctcctcaccagcaacggaacaaagctggacggagaatgactttgat gagttgagagaaggagacttcagacaatcaaactactccaagctaaaggaggaagttcga acccatggcaaagaagttaaaaaccttgaaaaaagattagacgaatggctaactagaata accagtgcagagaaatccttaaaggacccgatggagctgaaaaccacagcatga >gi568815592f:166899413_167139975|GENSCAN_predicted_peptide_7|117_aa MGLVSLVAALIFMSVLAEDVNVVLRYAVMWMTTECCLRSPIRLGVALWNPLDEDISLCRS AGSSTRKDPRRRKGTLGAQDDSRPELQKESCEETQAAMTRHQETLQATLSNEHPVGI >gi568815592f:166899413_167139975|GENSCAN_predicted_CDS_7|354_bp atggggctagtctccctggtggcagcgctgatcttcatgtcagtgttggcagaagatgtg aatgtggtgctgagatatgcagtgatgtggatgacgacagagtgctgccttcgctctccc atcaggctgggagtggctctgtggaatccccttgatgaagacatctccctgtgtcggagc gccggctccagcacccggaaggacccgagaaggcggaaaggcacccttggagcccaagac gacagccgccctgagctccaaaaggaaagctgtgaagaaacccaggcggccatgactcga catcaggaaacacttcaggccacgctgtctaatgaacaccctgtgggcatttag >gi568815592f:166899413_167139975|GENSCAN_predicted_peptide_8|405_aa MILPTSAVPCVRCLRALSIIGLESLSDHGGEYPEESMNFSDVFDSSEDYFVSVNTSYYSV DSEMLLCSLQEVRQFSRLFVPIAYSLICVFGLLGNILVVITFAFYKKARSMTDVYLLNMA IADILFVLTLPFWAVSHATGAWVFSNATCKLLKGIYAINFNCGMLLLTCISMDRYIAIVQ ATKSFRLRSRTLPRSKIICLVVWGLSVIISSSTFVFNQKYNTQGSDVCEPKYQTVSEPIR WKLLMLGLELLFGFFIPLMFMIFCYTFIVKTLVQAQNSKRHKAIRVIIAVVLVFLACQIP HNMVLLVTAANLGKMNRSCQSEKLIGYTKTVTEVLAFLHCCLNPVLYAFIGQKFRNYFLK ILKDLWCVRRKYKSSGFSCAGRYSENISRQTSETADNDNASSFTM >gi568815592f:166899413_167139975|GENSCAN_predicted_CDS_8|1218_bp atgattctgccgacaagtgctgtcccctgtgtccggtgcctccgcgctctatctattatt gggttggaaagccttagtgaccacggtggagagtacccagaggaatcaatgaatttcagc gatgttttcgactccagtgaagattattttgtgtcagtcaatacttcatattactcagtt gattctgagatgttactgtgctccttgcaggaggtcaggcagttctccaggctatttgta ccgattgcctactccttgatctgtgtctttggcctcctggggaatattctggtggtgatc acctttgctttttataagaaggccaggtctatgacagacgtctatctcttgaacatggcc attgcagacatcctctttgttcttactctcccattctgggcagtgagtcatgccaccggt gcgtgggttttcagcaatgccacgtgcaagttgctaaaaggcatctatgccatcaacttt aactgcgggatgctgctcctgacttgcattagcatggaccggtacatcgccattgtacag gcgactaagtcattccggctccgatccagaacactaccgcgcagcaaaatcatctgcctt gttgtgtgggggctgtcagtcatcatctccagctcaacttttgtcttcaaccaaaaatac aacacccaaggcagcgatgtctgtgaacccaagtaccagactgtctcggagcccatcagg tggaagctgctgatgttggggcttgagctactctttggtttctttatccctttgatgttc atgatattttgttacacgttcattgtcaaaaccttggtgcaagctcagaattctaaaagg cacaaagccatccgtgtaatcatagctgtggtgcttgtgtttctggcttgtcagattcct cataacatggtcctgcttgtgacggctgcaaatttgggtaaaatgaaccgatcctgccag agcgaaaagctaattggctatacgaaaactgtcacagaagtcctggctttcctgcactgc tgcctgaaccctgtgctctacgcttttattgggcagaagttcagaaactactttctgaag atcttgaaggacctgtggtgtgtgagaaggaagtacaagtcctcaggcttctcctgtgcc gggaggtactcagaaaacatttctcggcagaccagtgagaccgcagataacgacaatgcg tcgtccttcactatgtga