GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:11:02 Sequence gi568815592f:37157918_37431169 : 273252 bp : 43.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12272 12740 469 1 1 21 97 516 0.784 38.48 1.02 Intr + 12856 12962 107 2 2 113 117 215 0.991 26.83 1.03 Intr + 13025 13114 90 1 0 -24 105 174 0.898 8.09 1.04 Intr + 13208 13574 367 1 1 119 89 840 0.978 81.92 1.05 Intr + 15079 15255 177 2 0 81 93 152 0.965 14.89 1.06 Term + 16017 16174 158 1 2 132 43 28 0.910 0.80 1.07 PlyA + 17447 17452 6 1.05 2.03 PlyA - 17561 17556 6 1.05 2.02 Term - 55041 54540 502 2 1 -38 42 593 0.003 36.05 2.01 Init - 61113 60509 605 0 2 65 92 414 0.058 34.28 2.00 Prom - 62999 62960 40 -11.82 3.00 Prom + 63028 63067 40 -4.76 3.01 Init + 64399 64876 478 0 1 72 117 140 0.590 11.12 3.02 Term + 87731 87849 119 0 2 -21 38 324 0.846 15.20 3.03 PlyA + 89451 89456 6 1.05 4.00 Prom + 92283 92322 40 -2.06 4.01 Init + 100001 100056 56 1 2 82 83 35 0.005 3.16 4.02 Intr + 121387 121694 308 1 2 33 94 300 0.919 21.19 4.03 Intr + 124268 124447 180 0 0 57 95 64 0.915 3.84 4.04 Intr + 126432 126547 116 1 2 9 92 147 0.839 7.37 4.05 Intr + 133326 133440 115 2 1 108 86 88 0.972 10.62 4.06 Intr + 155001 155107 107 1 2 97 99 148 0.997 16.73 4.07 Intr + 155899 155974 76 0 1 99 78 161 0.963 15.29 4.08 Intr + 158786 158913 128 0 2 92 91 150 0.849 16.10 4.09 Intr + 159194 159289 96 1 0 90 109 79 0.989 10.41 4.10 Term + 165154 165297 144 0 0 7 38 146 0.742 -0.59 4.11 PlyA + 165732 165737 6 1.05 5.06 PlyA - 166185 166180 6 1.05 5.05 Term - 166402 166375 28 0 1 87 48 44 0.050 -2.15 5.04 Intr - 173476 173155 322 2 1 110 82 194 0.304 15.92 5.03 Intr - 177249 177131 119 2 2 49 65 46 0.078 -1.49 5.02 Intr - 185336 185282 55 0 1 129 81 28 0.163 4.24 5.01 Init - 187692 187560 133 0 1 82 63 68 0.669 4.00 5.00 Prom - 187816 187777 40 -2.46 6.00 Prom + 191548 191587 40 -5.46 6.01 Init + 196248 196358 111 2 0 69 80 93 0.789 6.83 6.02 Intr + 202529 202657 129 1 0 104 87 77 0.940 10.09 6.03 Intr + 210567 211301 735 2 0 81 81 284 0.766 18.64 6.04 Intr + 213595 213657 63 0 0 78 78 100 0.982 6.91 6.05 Intr + 216703 216792 90 0 0 80 79 118 0.999 10.29 6.06 Intr + 219009 219116 108 2 0 79 76 65 0.889 4.88 6.07 Intr + 223233 223437 205 2 1 69 113 142 0.798 13.57 6.08 Term + 236591 236766 176 0 2 96 34 104 0.143 3.72 6.09 PlyA + 236797 236802 6 1.05 7.03 PlyA - 238215 238210 6 -0.45 7.02 Term - 241530 241357 174 0 0 81 32 55 0.056 -3.04 7.01 Init - 250364 250311 54 2 0 83 70 82 0.900 7.18 7.00 Prom - 257962 257923 40 -3.56 8.02 PlyA - 258101 258096 6 1.05 8.01 Term - 260250 260144 107 1 2 133 54 24 0.243 2.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 109764 109782 19 2 1 73 83 34 0.856 1.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:37157918_37431169|GENSCAN_predicted_peptide_1|455_aa RPRWLRRPERSRWQRRRRDRQQQQQQQQQQPLASCPAALPHEPHEPLTPPFSALPDPAGA PSRRQSRQRPQLSSDSPSAFRASRSHSRNATRSHSHSHSPRHSLRHSPGSGSCGSSSGHR PCADILEVGMLLSKINSLAHLRAAPCNDLHATKLAPGKEKEPLESQYQVGPLLGSGGFGS VYSGIRVSDNLPPGRGNLTETLGFQVAIKHVEKDRISDWGELPNGTRVPMEVVLLKKVSS GFSGVIRLLDWFERPDSFVLILERPEPVQDLFDFITERGALQEELARSFFWQVLEAVRHC HNCGVLHRDIKDENILIDLNRGELKLIDFGSGALLKDTVYTDFDGTRVYSPPEWIRYHRY HGRSAAVWSLGILLYDMVCGDIPFEHDEEIIRGQVFFRQRVSSECQHLIRWCLALRPSDR PTFEEIQNHPWMQDVLLPQETAEIHLHSLSPGPSK >gi568815592f:37157918_37431169|GENSCAN_predicted_CDS_1|1368_bp cggccgcggtggctgaggaggcccgagaggagtcggtggcagcggcggcggcgggaccgg cagcagcagcagcagcagcagcagcagcaaccactagcctcctgccccgcggcgctgccg cacgagccccacgagccgctcaccccgccgttctcagcgctgcccgaccccgctggcgcg ccctcccgccgccagtcccggcagcgccctcagttgtcctccgactcgccctcggccttc cgcgccagccgcagccacagccgcaacgccacccgcagccacagccacagccacagcccc aggcatagccttcggcacagccccggctccggctcctgcggcagctcctctgggcaccgt ccctgcgccgacatcctggaggttgggatgctcttgtccaaaatcaactcgcttgcccac ctgcgcgccgcgccctgcaacgacctgcacgccaccaagctggcgcccggcaaggagaag gagcccctggagtcgcagtaccaggtgggcccgctactgggcagcggcggcttcggctcg gtctactcaggcatccgcgtctccgacaacttgccgcccggacgagggaacctgacggag accctgggcttccaggtggccatcaaacacgtggagaaggaccggatttccgactgggga gagctgcctaatggcactcgagtgcccatggaagtggtcctgctgaagaaggtgagctcg ggtttctccggcgtcattaggctcctggactggttcgagaggcccgacagtttcgtcctg atcctggagaggcccgagccggtgcaagatctcttcgacttcatcacggaaaggggagcc ctgcaagaggagctggcccgcagcttcttctggcaggtgctggaggccgtgcggcactgc cacaactgcggggtgctccaccgcgacatcaaggacgaaaacatccttatcgacctcaat cgcggcgagctcaagctcatcgacttcgggtcgggggcgctgctcaaggacaccgtctac acggacttcgatgggacccgagtgtatagccctccagagtggatccgctaccatcgctac catggcaggtcggcggcagtctggtccctggggatcctgctgtatgatatggtgtgtgga gatattcctttcgagcatgacgaagagatcatcaggggccaggttttcttcaggcagagg gtctcttcagaatgtcagcatctcattagatggtgcttggccctgagaccatcagatagg ccaaccttcgaagaaatccagaaccatccatggatgcaagatgttctcctgccccaggaa actgctgagatccacctccacagcctgtcgccggggcccagcaaatag >gi568815592f:37157918_37431169|GENSCAN_predicted_peptide_2|368_aa MKQQQWCGMTAKMGTVLSGVFTIMAVDMYLIFEQKHLGNGSCTEITPKYRGASNIINNFI ICWSFKIVLFLSFITILISCFLLYSVYAQIFRGLVIYIVWIFFYETANVVIQILTNNDFD IKEVRIMRWFGLVSRTVMHCFWMFFVINYAHITYKNRSQGNIISYKRRISTAEILHSRNK RLSISSGFSGSHLESQYFERQRMFSLMVGIFSVLNTTQFFIFDLNQKTHICYEAKFSIYV DSKSELVTWTLFHRANISTGLSLTTIIIGCFLFYCIHKNIYMGLLIYAMWIITYELINFS IVLLLNGIIKDHFKTLSYLHWIFQISHMLLHFFCLPFIVKHAYNLYKESQTVGRKRRHRL CSTIAVNS >gi568815592f:37157918_37431169|GENSCAN_predicted_CDS_2|1107_bp atgaaacagcagcagtggtgtgggatgactgccaaaatgggcaccgtgttgtcaggggtc ttcaccatcatggccgtagacatgtatctcatctttgaacagaagcacctagggaatggc agttgcactgagatcacaccaaagtacaggggtgcaagtaacatcataaataacttcatc atctgctggagttttaaaatcgtcctcttcctgtctttcatcaccatcctcatcagctgc ttcctcctgtactcagtgtatgcccagatcttcaggggcctggtcatctacattgtctgg atttttttctatgaaactgcaaacgtcgtaatacaaatcctcaccaacaatgactttgac attaaagaggtcagaatcatgcgctggtttggcttggtgtctcgtacagtcatgcactgt ttctggatgttctttgtcatcaactatgcccacataacctacaaaaaccggagccagggc aatataatttcctacaagagacgaatttctacagcggagattctccacagcagaaataaa agattatcaatttcgagtgggttcagtggctcacacctggaatcccagtactttgagagg cagaggatgttctccctcatggtgggcatcttctctgtccttaataccacccagttcttc atctttgacctgaaccagaagacacacatttgctatgaggccaagttcagcatctacgtg gactcaaagtcggagctagtcacttggaccctgttccacagggctaatatcagcactggc ctctccctcaccaccatcatcatcggctgcttcctcttttattgtatccacaagaatatc tacatggggctgctgatctatgccatgtggatcatcacttacgagctcatcaacttctcc atagtcctgctcctcaacgggatcatcaaagatcacttcaagacgctgagttatttgcac tggatcttccaaatctcacacatgctcctgcactttttctgtctgcccttcatcgtcaag catgcatacaacctttacaaggaatcccagactgtgggcaggaaacgccgccacaggctc tgctccaccattgcagtgaactcatga >gi568815592f:37157918_37431169|GENSCAN_predicted_peptide_3|198_aa MAPPGLDPNPAPRFELVPGLERGQAAGADIREPAGTAGSRGRGPPGVPEGAGCRDARVLH LGGPAPLTRKGRVSYLSLAPACSVKWEAQVCSRGSWQLQLHLGGQILPIPGPPPRAQGGF DAQPQFGQLQPAQEGGTSACSVEPEVGSTALVWAAAVAPAKTPTQNNSDNKEEEEQEEEE EEEQEEDEEEEEMEEEIY >gi568815592f:37157918_37431169|GENSCAN_predicted_CDS_3|597_bp atggcccccccagggctcgaccccaacccggctccgagatttgaactggtgccgggattg gagagaggccaggcagcgggagcagacatccgggagcctgcagggacggcaggaagccgg ggaagggggcctcctggggtccccgagggtgcaggctgcagagacgcccgggtcctgcac ctaggaggtcccgccccgctaactcgaaaggggcgggtctcctacttgtccctggctcct gcctgctcagtgaagtgggaggcccaggtctgcagccgtgggtcctggcagctgcagctg cacctgggagggcagatcctgcctattcccggccctcctccaagagcacagggaggcttc gatgcacagccacagtttgggcagctgcagcccgcccaagagggcgggacttctgcctgc tccgtagagccggaggtggggtctacggctttggtttgggcggctgcagtagcacccgca aaaacaccaacacagaacaacagtgacaataaggaagaggaggagcaggaggaggaggag gaggaagagcaggaggaggacgaggaggaggaggagatggaggaggagatctactag >gi568815592f:37157918_37431169|GENSCAN_predicted_peptide_4|441_aa MAAENSKQFWKRSAKLPGSFIKERSKVNTVPLKNKKASSFHEFARNTSDAWDIGDDEEED FSSPSFQTLNSKVALATAAQVLENHSKLRVKPERSQSTTSDVPANYKVIKSSSDAQLSRN SSDTCLRNPLHKQQSLPLRPIIPLVARISDQNASGAPPMTVREKTRLEKFRQLLSSQNTD LANTERRKLTLQRKREEYFGFIEQYYDSRNEEHHQDTYRQIFERILFIWAIRHPASGYVQ GINDLVTPFFVVFLSEYVEEDVENFDVTNLSQDMLRSIEADSFWCMSKLLDGIQDNYTFA QPGIQKKVKALEELVSRIDEQVHNHFRRYEVEYLQFAFRWMNNLLMRELPLRCTIRLWDT YQSEPEGFSHFHLYVCAAFLIKWRKEILDEEDFQSSRESSSSSSSSEAQNMGAQEDVSRR QQGLRGLRMEFEKHHAGHSED >gi568815592f:37157918_37431169|GENSCAN_predicted_CDS_4|1326_bp atggccgctgagaacagcaagcagttttggaagaggagcgctaagctgccggggagtttc attaaagaacgatcaaaagtcaacacagttcctctgaagaataagaaggcctccagtttt catgagtttgcacggaataccagtgatgcttgggacattggcgatgatgaggaagaggac ttttcctcaccttctttccaaactctgaactcaaaagttgctttggcaactgcagcccaa gttctagaaaaccacagcaagctgagagtaaaaccagaacggtcccagtcaacgacatcg gacgtccctgccaactacaaggtcataaagtccagcagtgatgcccagctgtccagaaac tctagtgatacatgcctgaggaacccactccacaaacagcaatcactccctctccggccc atcatccccctcgttgcccggatctcggatcagaacgcttctggggcccccccaatgact gtccgggagaaaacccgcctagaaaaattccgtcaacttctctccagccagaacactgac ttagcaaacactgagaggaggaagttgaccctgcagcggaagcgggaggaatattttggc ttcattgaacagtattatgactctcgaaacgaggaacatcaccaggatacctacagacag atctttgaaagaattctatttatttgggccatccgccaccctgccagtgggtatgtccag ggaattaatgacctggtcactccattctttgtcgtcttcctctcagaatatgtggaagag gatgtggagaactttgacgtgaccaacttgtctcaagacatgctgcgaagcattgaggct gacagcttttggtgcatgagcaagctgctggatggaatccaggataactacacctttgca caaccaggaatccagaagaaggtgaaggcactggaagagcttgtcagccggattgatgag caggtacataatcacttcaggaggtacgaggtagaatacctgcagtttgccttccgctgg atgaacaacctgcttatgcgggagcttcctcttcgctgcaccatccgcctgtgggacaca tatcagtctgaaccagaagggttctcccactttcatctctacgtgtgtgcagccttcttg atcaagtggaggaaagagatcttggatgaggaggattttcagtcatctcgggaaagcagc agcagcagcagcagcagcgaagcccagaacatgggagcccaggaagatgtcagcaggcgc cagcaaggcctcagagggctcaggatggaatttgagaagcaccatgcagggcactctgag gactaa >gi568815592f:37157918_37431169|GENSCAN_predicted_peptide_5|218_aa MGYYAAIKKDEFMSFVGTWMKLETIILSKLSQGQKSKRRMFSLIGLGVLCIEKAKQGKRE WGRLILWEAHLFHHESKIITLASMAASKTVSNQEISVGHPERELGPSTRHGILTVVQVTH PETWSGGCQKVKTWAVGVSFVQQEWFLAGVHGSHSGHSDQTAIRDEGSLGPRRRQHLSAV VIWGIGKHVLESVCLGEKQPNFFVAPVYCSHQCDPAGD >gi568815592f:37157918_37431169|GENSCAN_predicted_CDS_5|657_bp atgggatactatgcagccataaaaaaggatgagttcatgtcttttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaagcaaacgccgcatg ttctcactcatagggcttggagtcctctgcatcgagaaagcaaaacaagggaagagagaa tggggaaggctgatactgtgggaagctcatctttttcatcatgaatctaaaatcatcacc ttggcatctatggcagcttctaaaactgtctccaaccaggaaattagtgttggacatcct gagagagaactgggcccatccacgcggcatggaatactgacagtggtccaagtgacacat ccagagacatggagtggtggatgccagaaagttaagacctgggcggtgggtgtgagcttt gtacaacaggagtggttcctggccggggtccacggctcgcacagtggccacagtgatcag actgccatcagagatgaaggcagtctgggtccccggaggagacagcacctatcggcggta gtgatttggggcatcggcaaacatgtacttgagtctgtatgcctcggcgagaagcagccc aatttcttcgttgccccagtgtattgtagccaccagtgtgacccagctggtgactaa >gi568815592f:37157918_37431169|GENSCAN_predicted_peptide_6|538_aa MGEPGFFVTGDRAGGRSWCLRRVGMSAGWLLLEDGCEVTVGRGFGVTYQLVSKICPLMIS RNHCVLKQNPEGQWTIMDNKSLNGVWLNRARLEPLRVYSIHQGDYIQLGVPLENKENAEY EYEVTEEDWETIYPCLSPKNDQMIEKNKELRTKRKFSLDELAGPGAEGPSNLKSKINKVS CESGQPVKSQGKGEVASTPSDNLDPKLTALEPSKTTGAPIYPGFPKVTEVHHEQKASNSS ASQRSLQMFKVTMSRILRLKIQMQEKHEAVMNVKKQTQKGNSKKVVQMEQELQDLQSQLC AEQAQQQARVEQLEKTFQEEEQHLQGLEIAQGEKDLKQQLAQALQEHWALMEELNRSKKD FEAIIQAKNKELEQTKEEKEKMQAQKEEVLSHMNDVLENELQCIICSEYFIEAVTLNCAH SFCSYCINEWMKRKIECPICRKDIKSKTYSLVLDNCINKMVNNLSSEVKERRIVLIRERK ASPNNVPVMGYLSSSSRQAVSPACCCFTTFQMPTSSNFQSHYLCCPLSFRLCEYFNLL >gi568815592f:37157918_37431169|GENSCAN_predicted_CDS_6|1617_bp atgggggagcccggcttcttcgtcacaggagaccgcgccggtggccggagctggtgcctg cggcgggtggggatgagcgccgggtggctgctgctggaagatgggtgcgaggtgactgta ggacgaggatttggtgtcacataccaactggtatcaaaaatctgccccctgatgatttct cgaaaccactgtgttttgaagcagaatcctgagggccaatggacaattatggacaacaag agtctaaatggtgtttggctgaacagagcgcgtctggaacctttaagggtctattccatt catcagggagactacatccaacttggagtgcctctggaaaataaggagaatgcggagtat gaatatgaagttactgaagaagactgggagacaatatatccttgtctttccccaaagaat gaccaaatgatagaaaaaaataaggaattgagaactaaaaggaaattcagtttggatgaa ttagcaggtcctggagctgaaggcccctcaaatttgaaatccaaaataaataaagtgtct tgtgaatctggtcagccagtgaaatcacaggggaaaggtgaagtggccagtacaccctct gacaatttggatcctaagttgactgcccttgagccaagtaagaccacaggggctcccatt taccctggcttccccaaagtcacagaggttcatcatgagcagaaagcctcaaactcttca gcatctcagagaagcttacagatgtttaaggtgaccatgtccaggattctgaggctcaaa atacagatgcaggaaaaacatgaagccgttatgaatgtgaaaaagcagacccaaaagggg aactcaaagaaagttgtgcaaatggagcaggaacttcaggacttacagtcccagctgtgt gcagagcaggctcagcagcaggcaagagtggagcaactagagaagactttccaggaagag gaacagcatcttcagggtttggagatagcccaaggagaaaaggacctgaagcaacagctg gcccaggctctgcaggagcattgggctctaatggaagagctaaatcgcagcaagaaggac tttgaagcaatcattcaagccaagaacaaagaattagagcagaccaaggaagagaaggag aagatgcaagcacagaaggaagaagttcttagccacatgaatgatgtgctagagaatgag ctccaatgtattatttgttcagaatacttcattgaggctgtcaccttgaactgtgcccac agtttctgctcctactgtatcaatgaatggatgaagcggaagatagaatgccccatttgt cggaaggacattaagtccaaaacgtactctttggttctggacaattgcattaataagatg gtaaataatctgagctcagaagtgaaagaacgacgaattgttctcattagggaacgaaaa gcatccccaaacaatgtaccagtgatgggctacctgagctcatctagtcgccaagcagta tctcctgcttgctgctgctttactacattccagatgcccacctcatccaatttccagagc cactatctctgctgtccactttccttcaggctctgtgaatacttcaacctgctgtga >gi568815592f:37157918_37431169|GENSCAN_predicted_peptide_7|75_aa MAEGKGEADTFFMGRQNRDPSDHLYIHESHICITKPALTSALSTLLRSHKILMIDTFSST QGQQLLHVLIWAVIS >gi568815592f:37157918_37431169|GENSCAN_predicted_CDS_7|228_bp atggcagaaggcaaaggagaagcagataccttcttcatggggcggcagaacagagacccc agtgaccatctatacatccatgagtctcacatttgtatcaccaagccagctctcacctct gcactctcaactctgctgagaagccacaaaatccttatgattgacaccttctcctccacc caagggcagcagctccttcatgtcctcatatgggcagtgatttcataa >gi568815592f:37157918_37431169|GENSCAN_predicted_peptide_8|35_aa XAHNEGRPWPTERRWPSVSQEENSHQNLTMLAAEL >gi568815592f:37157918_37431169|GENSCAN_predicted_CDS_8|108_bp ngtgcacacaatgaaggaaggccatggcccacagagagaagatggccatctgtaagccag gaagaaaactcccaccagaacctgaccatgctggctgcagaattgtga