GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:06:56 Sequence gi568815592f:34140781_34344881 : 204101 bp : 47.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1375 1436 62 0 2 60 117 17 0.649 2.24 1.02 Term + 4154 4856 703 2 1 77 54 345 0.458 23.00 1.03 PlyA + 6298 6303 6 1.05 2.04 PlyA - 6428 6423 6 1.05 2.03 Term - 8156 8075 82 1 1 135 43 56 0.465 2.97 2.02 Intr - 12766 12680 87 0 0 86 75 43 0.398 1.89 2.01 Init - 14170 14115 56 1 2 67 44 88 0.319 2.33 2.00 Prom - 25770 25731 40 -5.96 3.00 Prom + 29909 29948 40 -5.46 3.01 Init + 34873 34880 8 0 2 89 95 0 0.087 1.32 3.02 Intr + 38811 38987 177 0 0 36 54 108 0.034 1.23 3.03 Intr + 48997 49068 72 1 0 82 92 118 0.108 10.12 3.04 Intr + 49308 49544 237 0 0 24 8 270 0.718 9.33 3.05 Intr + 49686 49932 247 0 1 -38 33 395 0.841 18.86 3.06 Intr + 50050 50263 214 0 1 -38 -12 258 0.002 1.49 3.07 Term + 55609 56027 419 2 2 12 48 251 0.800 8.94 3.08 PlyA + 56599 56604 6 1.05 4.00 Prom + 62886 62925 40 -2.86 4.01 Init + 78661 78940 280 0 1 48 -9 211 0.458 4.57 4.02 Intr + 82647 82684 38 1 2 142 95 2 0.606 4.28 4.03 Intr + 83520 83616 97 2 1 100 26 61 0.134 0.68 4.04 Intr + 91381 91514 134 2 2 96 76 48 0.934 4.76 4.05 Intr + 92058 92113 56 2 2 82 74 39 0.721 -0.32 4.06 Intr + 94045 94240 196 1 1 73 48 113 0.928 5.22 4.07 Intr + 95324 95393 70 1 1 93 86 52 0.978 4.25 4.08 Intr + 96102 96183 82 1 1 73 69 38 0.500 -0.90 4.09 Intr + 96424 96537 114 1 0 105 101 46 0.499 7.16 4.10 Intr + 99991 100102 112 1 1 81 117 104 0.500 12.98 4.11 Intr + 101932 102015 84 0 0 127 80 46 0.995 7.72 4.12 Intr + 103739 103835 97 1 1 138 72 31 0.965 6.08 4.13 Term + 104199 104302 104 1 2 97 48 52 0.959 0.54 4.14 PlyA + 105427 105432 6 1.05 5.07 PlyA - 105634 105629 6 1.05 5.06 Term - 106088 106023 66 2 0 103 43 93 0.987 4.24 5.05 Intr - 106369 106264 106 0 1 117 105 119 0.999 16.72 5.04 Intr - 106712 106687 26 2 2 114 101 -9 0.950 -0.18 5.03 Intr - 106978 106901 78 1 0 90 79 98 0.721 8.85 5.02 Intr - 108452 108199 254 0 2 60 94 127 0.903 7.65 5.01 Init - 109858 109810 49 1 1 86 58 43 0.853 0.21 5.00 Prom - 113365 113326 40 -4.76 6.03 PlyA - 113982 113977 6 1.05 6.02 Term - 122707 122525 183 1 0 54 55 381 0.991 28.94 6.01 Init - 122893 122792 102 1 0 98 -5 254 0.463 17.34 6.00 Prom - 123505 123466 40 -6.16 7.00 Prom + 127040 127079 40 -4.56 7.01 Init + 128514 128675 162 2 0 97 66 117 0.674 9.40 7.02 Intr + 131969 132071 103 1 1 108 67 43 0.623 3.95 7.03 Term + 143776 143852 77 2 2 88 44 122 0.857 5.80 7.04 PlyA + 144953 144958 6 1.05 8.04 PlyA - 145303 145298 6 1.05 8.03 Term - 148151 147973 179 0 2 125 42 94 0.983 6.35 8.02 Intr - 152755 152671 85 1 1 113 99 72 0.937 10.19 8.01 Init - 201135 201082 54 0 0 93 69 87 0.563 8.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 172681 172786 106 1 1 30 50 163 0.824 4.58 S.002 Init + 176702 176759 58 1 1 75 61 90 0.836 6.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:34140781_34344881|GENSCAN_predicted_peptide_1|254_aa MPTSASPLGPPRQPAALLQPRAPGAPALLGALQARELAGGHLPEPRCPRRSPRPRRPEGT VPSVGPPRATTHRAPARFKPRRGVGCVPACPGGAGSRRPEARFSRCPEPGPRRAALTLRT QRAAEPPGVRDAAGMSWRPPGRSRRQARGVRLLLRQASKREGDRRQPLLLLRRSPASEPE GAGEAARGAGSGRELRAGKGAERERPASAAGRGTGAWRRRNSGNSSRKKNESELESALES ERERERETRERGRR >gi568815592f:34140781_34344881|GENSCAN_predicted_CDS_1|765_bp atgcccaccagtgcaagccccttaggcccgccaagacagccagcagctcttctgcagccc cgggctccgggggcgccagctctcctcggagcactccaagcgcgggagctcgccggaggc cacctccccgagccccgatgtccccgccgcagccctcgccccaggaggccggagggcacg gtcccctctgtcggtccccctcgggccacaactcaccgcgcgccagcaaggtttaagccg cggcgaggggtcggctgcgtgccggcctgccccgggggcgcaggcagtcgacggccggag gcacggttctcgcggtgcccagagcccgggccgcggcgggcagcgctgacgctgcggacc cagcgcgccgcggagccgcccggagtccgggacgcagcagggatgagctggcggccgccc ggccggagccgccgccaagcgcgcggcgttcggctgctgctccggcaagcgagcaagcga gagggagatcggcggcaacccctgctcctgctgcggcggagcccggcgtccgagccggaa ggagcgggagaggcggcaagaggagctgggtctggccgagagctccgggctgggaaggga gcggagcgggagcgaccagcgagcgcagccggccgaggtacgggcgcctggcggcgccgc aacagcggcaacagcagcaggaaaaaaaatgagagcgagctagagagcgcgctggagagc gagcgagagagggagcgagagacgagagagcgaggaaggcgctga >gi568815592f:34140781_34344881|GENSCAN_predicted_peptide_2|74_aa MRAFGHRAAVLARDGEPGCCSQESVSHDLCSVRQKALPLGRQFSFVNRDGVAISLDEDWC RNGRVSITPCLLHE >gi568815592f:34140781_34344881|GENSCAN_predicted_CDS_2|225_bp atgagggcgttcggccaccgggcggcggtgctggcccgcgatggggagcctggctgctgc tcacaggagagcgtcagccatgacctctgcagtgttcggcaaaaggctttgcccctgggc aggcaattcagttttgtcaacagagatggtgttgccatttccctggatgaggactggtgc aggaacgggcgggtcagcatcactccctgcttactccacgagtag >gi568815592f:34140781_34344881|GENSCAN_predicted_peptide_3|457_aa MGRRGRSSRLFLVPAGSVELAAPVMPPPLQPESFQQPLQTDHYHHQTHLILTATLKGGYY AGMSFTTRSTTFSTNYQSLDSMQPPRVRSLETENRRLESKIQEYLEKKGPQVRDWGHYFK TMEDLRAQIFANSVNNASIILQIDNPHLAADETELAMCQSGERHLSGLTVEVDAPKSQDL AKIMADIQAQYDKLSQKNREKLNQYCSHQTEESTTVVTTQSAKIRAAEMTLMKLRRTVQS LEINLDSTRAEEQRQAQEYKALLNIQVKLEAEMATYHHLLEDGKDFNLGMPWTAAIPCKR PPSNRPPSPRRQDSGWQSGAAKRATTARGRQAATRDSQGQAHAGHQPHYSSASQVGTHGA RASTLLTAASRIPLNSCPHGNPSLQGSRLPGIPVRPQRRPCGRSLGTAAAHQVCCTPSVR GPSECTLRLLHCERERGERNMDPKTLALAELRLGPMP >gi568815592f:34140781_34344881|GENSCAN_predicted_CDS_3|1374_bp atgggaagaagggggcgcagctcccgcctgtttctggttcctgctggctccgtggagcta gcagccccggtcatgcctcccccactgcagccagagtcttttcagcagccactccagacg gaccactaccaccatcagactcacttaatcctcacagcaaccctaaaaggtgggtactac gcaggcatgagcttcaccactcgctctaccaccttctccaccaactaccagtccctggac tcaatgcagccgcccagagtgaggagcctggagactgagaatcggaggctggagagcaaa atccaggagtatctggagaagaagggaccccaggtcagagattgggggcattacttcaag accatggaggacctgagggctcagatcttcgcaaattctgtgaacaatgccagcatcatt ctgcagattgacaatccccatcttgccgctgatgagacagagctggccatgtgccagtct ggagagcgacatctctctgggttaaccgtggaggtagatgcccccaaatcacaggatctt gccaagatcatggcagacatccaggcccaatatgacaagctgtctcagaagaatcgagag aagctgaaccagtactgctcccaccagactgaggagagcaccacagtggtcaccacgcag tctgccaagatcagagctgctgagatgacgctcatgaagctgagacgtacagtccagtcc ttggagatcaacctggactcaacccgggcagaggagcagcgccaggcccaggagtacaag gccctgctgaatatccaggtcaagctggaggccgagatggccacctaccaccacctgctg gaagacggcaaggacttcaatcttgggatgccctggacagcagcaattccatgcaaacga ccaccatccaacagaccaccctccccccgccgccaggatagtggatggcaaagtggcgcg gcgaagcgggccacgaccgcgagggggcgccaggcagccacgcgagactcccagggtcag gcccacgcgggccaccagccgcactactcttccgcttcccaagtcggcacacacggcgcc cgtgcctccacgctgctgactgctgcaagtcgaatccctctcaactcctgtccgcacggg aaccccagtttacaggggagccggctgcctggcatcccggtgcggccacagaggcgtcct tgtggaaggagcctggggaccgctgcggcccatcaagtgtgctgcacgccctccgtgcgc ggcccgtcggagtgcaccttgcgcctcctgcattgtgaaagggagcggggagaacggaac atggacccaaagacactggcactggcagagctccgcctggggccgatgccttag >gi568815592f:34140781_34344881|GENSCAN_predicted_peptide_4|487_aa MGDVEKGKKIFIMKRSQCHIVEKGGKHKTGPNLHGLFGRKTGQAPGHSYIATIKNKDIIW GEDTLMEYLENPKKYIPGTKMIFVGIKKEEAGHGRLHGCCSTESQLTAARTPPAGSRYAA AARTPGSGLRVPVGLGERGWHYSMYHAGGIACVLVYPLVHPTQCKPLEDRNVATHFKATP LVVISSGVGEELGFLGDLALKKGGHTSHTDGERCTRERGRTGPSPHLPSRQAASPVPGPS SLRGLGEREVEGRAALGGSPSPDLGPKSQRKGPGPQVGLRISRTPSKSPQAPLSRCCAPL IGTPSRGYFWRWRGSKKASAFATSGGRGGARPVLSAQHRRSRQPGARTAGRRPSSREGKM SESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKT RGVSHILKLIAALSSALLPSLGLGAKGAWLLALGPPSPPPLAATPIFHLCPHHHTTQHTS RCRAPMG >gi568815592f:34140781_34344881|GENSCAN_predicted_CDS_4|1464_bp atgggtgatgttgagaaaggcaagaagatttttattatgaagcgttcccagtgccacatt gttgaaaagggaggcaagcataagactgggccaaatctccatggtctcttcgggcggaag acaggtcaggcccctggacactcttacatagccaccattaagaacaaagacatcatctgg ggagaggatacactgatggagtatttggagaatcccaagaagtacatccctggaacaaaa atgatctttgtcggcattaagaaggaagaggctgggcacggtcggctgcatgggtgctgc tcaacagaaagccagctgacagcagcacgcacacccccggccggctcacggtacgcggcc gctgcccgcactcccgggtcgggcctgcgcgtccctgtggggttgggcgagcggggctgg cattacagcatgtaccacgctggtggaattgcctgtgtacttgtctatcccctcgtgcac cccactcaatgcaagccccttgaggacaggaatgtagctacccatttcaaggccacaccc ttggtggtgatttcttcaggggttggagaggagctgggcttccttggggatttagccctg aagaagggcggccacacgtcccacacggacggggagagatgtacccgagagagagggcgg acaggaccctctccgcacctccccagccgccaggccgccagcccggtgcccgggcctagc agcctccgcggccttggagagcgcgaagtggaggggcgcgcggctctggggggcagcccg agcccagacctgggtcccaagtcccaacggaagggcccaggtccccaagtgggcctgcgt atctccagaacaccatctaagtcacctcaagctcccctgagccggtgctgcgctcctcta attgggactccgagccggggctatttctggcgctggcgcggctccaagaaggcatccgca tttgctaccagcggcggccgcggcggagccaggccggtcctcagcgcccagcaccgccgc tcccggcaacccggagcgcgcaccgcaggccggcggccgagctcgcgagaagggaagatg agtgagtcgagctcgaagtccagccagcccttggcctccaagcaggaaaaggacggcact gagaagcggggccggggcaggccgcgcaagcagcctccgaaggagcccagcgaagtgcca acacctaagagacctcggggccgaccaaagggaagcaaaaacaagggtgctgccaagacc cggggagtcagtcacatcctgaagctcattgctgccctgagctctgccctcctgccctcc ctgggcctgggggccaagggggcttggctcctggctctgggcccaccatcaccaccgcct ctggccgccacccccatcttccacctgtgccctcaccaccacactacacagcacaccagc cgctgcagggctcccatgggctga >gi568815592f:34140781_34344881|GENSCAN_predicted_peptide_5|192_aa MGFHLFGQAGLELLTSALGCPLPPPGGPCKPRCPRLRTAHPRLRIGGSRNLQATGFPLPD WLRPGAEARPISPGPLITRLTAFPGPGSQPRAFLSGLQRREANSDSMVGYVLGPFFLITL VGVVVAVVMYVQKKKRVDRLRHHLLPMYSYDPAEELHEAEQELLSDMGDPKVVHGWQSGY QHKRMPLLDVKT >gi568815592f:34140781_34344881|GENSCAN_predicted_CDS_5|579_bp atggggtttcacctctttggccaggctggtctcgaactcctgacatcagccctcggctgc cccctgccgccgccgggcggaccctgcaagccccgctgtccccgccttcgcaccgcccac ccccgcctccggattggcggctccagaaatctccaggccaccggctttccgctaccggat tggctgcgtccgggtgctgaggcccggcccatttccccgggtcctttgatcacgcgcctg acggcttttccggggcccgggagccaaccgagggcgttcctgtcggggctgcagcggcgg gaggccaacagcgactccatggtgggctatgtgttggggcccttcttcctcatcaccctg gtcggggtggtggtggctgtggtaatgtatgtacagaagaaaaagcgggtggaccggctg cgccatcacctgctccccatgtacagctatgacccagctgaggaactgcatgaggctgag caggagctgctctctgacatgggagaccccaaggtggtacatggctggcagagtggctac cagcacaagcggatgccactgctggatgtcaagacgtga >gi568815592f:34140781_34344881|GENSCAN_predicted_peptide_6|94_aa MAKIKARDLRGKKEELLKQLDDLKVELSQLRVAKTQKENLRKFYKGRNYKPLDLRPKKTR AMRRRLNKHEENPKTKKQQRKERLYPLGKYAVKA >gi568815592f:34140781_34344881|GENSCAN_predicted_CDS_6|285_bp atggccaagatcaaggctcgagatcttcgcgggaagaaggaggagctgctgaaacagctg gacgacctgaaggtggagctgtcccagctgcgcgtcgccaaaactcagaaagaaaacctc aggaaattctacaagggcaggaactacaagcccctggacctgcggcctaagaagacacgc gccatgcgccgccggctcaacaagcatgaggagaacccgaagaccaagaagcagcagcgg aaggagcggctgtacccgctggggaagtacgcggtcaaggcctga >gi568815592f:34140781_34344881|GENSCAN_predicted_peptide_7|113_aa MPEPPTHSMGSCAARASPTSTTPCSTAPSPIDHPRAEECERTAPDWQAAPPAAPPPAGFN CLCSEVWMEVPASQGRCEGLGPGPWAYADEDADSNKGIKKVLDENEKYVKDNM >gi568815592f:34140781_34344881|GENSCAN_predicted_CDS_7|342_bp atgcctgagcctcccacccactccatgggctcctgtgccgcccgagcctccccaacgagc accaccccctgctccacggcgcccagtcccatcgaccacccaagggctgaggagtgcgag cgcacggcgccggactggcaggcagctccacctgcagccccgccccctgcaggcttcaac tgcctctgcagtgaggtgtggatggaagtacctgcctctcagggtagatgtgaaggtctg ggtcctgggccctgggcctatgcagatgaggatgcggactccaacaaaggcattaagaaa gtactagatgaaaatgagaaatatgtgaaggataacatgtga >gi568815592f:34140781_34344881|GENSCAN_predicted_peptide_8|105_aa MEPEEEPSVAAVREVCEENQERKHRTYVYVLIVTEVLEDWEDSVNIGRKREWFKIEDAIK VLQYHKPVQASYFETLRQGYSANNGTPVVATTYSVSAQSSMSGIR >gi568815592f:34140781_34344881|GENSCAN_predicted_CDS_8|318_bp atggagcccgaggaggagccaagtgtggcagcagttcgtgaagtctgtgaggagaaccag gagaggaagcacaggacgtatgtctatgtgctcattgtcactgaagtgctggaagactgg gaagattcagttaacattggaaggaagagggaatggtttaaaatagaagacgccataaaa gtgctgcagtatcacaaacccgtgcaggcatcatattttgaaacattgaggcaaggctac tcagccaacaatggcaccccagtcgtggccaccacatactcggtttctgctcagagctcg atgtcaggcatcagatga