GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:38:44 Sequence gi568815593r:176292777_176516363 : 223587 bp : 45.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2254 2495 242 0 2 100 -5 171 0.282 5.35 1.02 Intr + 3475 3544 70 1 1 77 77 106 0.259 7.58 1.03 Intr + 20915 21069 155 1 2 118 115 112 0.947 15.77 1.04 Intr + 29497 29649 153 1 0 110 59 118 0.986 10.29 1.05 Intr + 31853 31981 129 2 0 75 105 84 0.956 8.61 1.06 Intr + 43944 44100 157 0 1 126 88 19 0.958 5.71 1.07 Intr + 44286 44370 85 2 1 53 94 6 0.279 -2.91 1.08 Term + 52407 52669 263 1 2 111 55 279 0.984 22.59 1.09 PlyA + 52893 52898 6 1.05 2.08 PlyA - 53312 53307 6 1.05 2.07 Term - 55032 54824 209 1 2 70 43 186 0.996 9.80 2.06 Intr - 55287 55145 143 2 2 81 95 90 0.685 9.00 2.05 Intr - 55580 55474 107 2 2 20 99 -8 0.390 -7.49 2.04 Intr - 57961 57837 125 2 2 100 87 138 0.842 15.20 2.03 Intr - 59972 59846 127 2 1 90 82 131 0.991 12.95 2.02 Intr - 62989 62795 195 1 0 51 65 216 0.008 15.21 2.01 Init - 69119 68826 294 2 0 70 66 131 0.313 4.29 2.00 Prom - 70753 70714 40 -5.86 3.00 Prom + 71809 71848 40 -5.86 3.01 Init + 72788 72970 183 1 0 85 86 400 0.815 36.62 3.02 Intr + 73556 73805 250 1 1 62 61 501 0.764 41.81 3.03 Intr + 76031 76206 176 0 2 78 94 370 0.990 36.26 3.04 Term + 78946 79119 174 0 0 83 44 207 0.994 13.56 3.05 PlyA + 84597 84602 6 1.05 4.06 PlyA - 84855 84850 6 1.05 4.05 Term - 91598 91455 144 2 0 96 42 189 0.301 13.01 4.04 Intr - 92551 92445 107 2 2 96 105 106 0.997 13.03 4.03 Intr - 94133 94064 70 2 1 80 64 129 0.903 8.45 4.02 Intr - 95567 95459 109 1 1 54 79 108 0.904 6.79 4.01 Init - 95763 95657 107 0 2 85 77 177 0.999 15.99 4.00 Prom - 95896 95857 40 -9.46 5.00 Prom + 95952 95991 40 -14.47 5.01 Init + 96044 96197 154 1 1 103 97 119 0.999 14.45 5.02 Term + 96555 96721 167 1 2 111 46 236 0.999 19.78 5.03 PlyA + 96956 96961 6 1.05 6.07 PlyA - 97671 97666 6 -0.45 6.06 Term - 100169 99998 172 1 1 106 43 262 0.998 20.80 6.05 Intr - 103756 103703 54 0 0 94 48 114 0.916 6.09 6.04 Intr - 104942 104831 112 0 1 38 97 205 0.996 15.84 6.03 Intr - 105271 105154 118 1 1 75 68 212 0.999 17.94 6.02 Intr - 117527 117481 47 0 2 120 115 63 0.911 10.33 6.01 Init - 123587 123401 187 2 1 87 84 319 0.995 30.62 6.00 Prom - 131309 131270 40 -3.26 7.00 Prom + 150185 150224 40 -3.66 7.01 Init + 153097 153152 56 0 2 82 55 28 0.243 -2.34 7.02 Intr + 154358 154500 143 2 2 45 86 82 0.741 3.70 7.03 Term + 154834 155225 392 2 2 69 51 329 0.578 22.45 7.04 PlyA + 157160 157165 6 1.05 8.00 Prom + 160977 161016 40 -2.16 8.01 Init + 179719 179784 66 0 0 63 95 34 0.322 0.67 8.02 Intr + 185114 185212 99 1 0 75 92 11 0.404 0.51 8.03 Intr + 186412 186480 69 0 0 129 60 39 0.165 4.58 8.04 Intr + 198393 198463 71 2 2 93 60 -9 0.026 -5.22 8.05 Intr + 199418 199556 139 2 1 127 82 36 0.551 7.37 8.06 Intr + 201223 201308 86 0 2 110 123 49 0.487 9.22 8.07 Intr + 201408 201499 92 0 2 74 98 -70 0.410 -7.76 8.08 Intr + 203710 203887 178 2 1 52 82 230 0.951 17.78 8.09 Intr + 206138 206309 172 2 1 86 95 175 0.988 17.95 8.10 Intr + 207227 207370 144 1 0 126 70 92 0.870 11.68 8.11 Intr + 212597 212630 34 1 1 87 86 16 0.180 -0.90 8.12 Term + 216489 216682 194 1 2 86 42 104 0.425 3.18 8.13 PlyA + 218944 218949 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 62944 62795 150 1 0 83 65 212 0.862 18.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:176292777_176516363|GENSCAN_predicted_peptide_1|417_aa NKGQKLEPIPHRRLRMVTNTIEENFPLGTVQFLMDFVSPQHYPPREIVAHIIQKILLSGS ETVDVLKEAYMLLMKIQQYEPLHPANAKTVEWDWKLLTYVMEEEGQTLPGRVLFLRYVVQ TLEDDFQQTLRRQRQHLQQSIANMVLSCDKQPHNVRDVIKWLVKAVTEDGLTQPPNGNQT SSGTGILKASSSHPSSQPNLTKNTNQLIVCQLQRMLSIAVEVDRTPTCSSNKIAEMMFGF VLDIPERSQREMFFTTMESHLLRCKVLEIIFLHSCETPTRLPLSLAQALYFLNNSTSLLK CQSDKSQWQTWDELVEHLQFLLSSYQHVLREHLRSSVIDRKDLIIKRIKPKPQQGDDITV VDVEKQIEAFRSRLIQMLGEPLVPQLQDKVHLLKLLLFYAADLNPDAEPFQKGWSGS >gi568815593r:176292777_176516363|GENSCAN_predicted_CDS_1|1254_bp aacaagggtcaaaaattagaacccatccctcatcgaagactaagaatggtaacaaatacc attgaagagaattttcctctggggactgtgcagtttttgatggactttgtgtcaccccag cattacccaccaagagaaatcgtggctcacatcatccagaaaatcttgctcagtggctct gagactgtggatgtcctaaaggaggcctacatgcttctcatgaaaattcaacagtatgaa ccgctacatccagccaatgccaagacagtggagtgggactggaaactgctcacctatgtc atggaggaagagggacaaactctgcctgggcgagtccttttcctgcgttatgtcgttcag accctagaagatgactttcagcagaccctgaggaggcaacggcagcacctgcagcaatcc attgcaaacatggtgctttcctgtgacaagcagccccacaatgtcagggatgttatcaag tggctggtcaaagcagtaactgaagatggattgactcagcccccaaatggaaatcaaacg tcttcaggaacaggaatcttgaaagccagcagtagccacccttcttcccagcccaacctg acaaagaacaccaatcagctgattgtgtgccagcttcagaggatgctctccatagccgta gaggtggacaggacccccacctgcagctccaataaaattgccgagatgatgtttgggttt gtgctggacattcctgagaggagccagagagaaatgttctttactaccatggaaagccac cttctgcgctgcaaagtgttagaaatcatattcctccacagctgtgagacacccacccgc ctgcctctgtctctggcccaggccctctactttctgaataattctacgtcactgctcaag tgtcagtcagataaaagccagtggcagacttgggacgaattggttgagcatctgcagttt ctgctgtccagttatcaacatgttttaagagaacacttaaggagttccgtgatcgaccga aaggacttaataatcaaaaggattaagcccaaaccccagcaaggagatgacatcacagtg gtagacgtagagaagcagattgaggccttccgcagccgcctgatccagatgctgggggag cctcttgtcccccaactccaagacaaagtgcacttgttgaagctcctgctcttctatgct gcggacttgaaccctgatgcagagccctttcaaaagggctggagcggctcctga >gi568815593r:176292777_176516363|GENSCAN_predicted_peptide_2|399_aa MGRGRVSRSWTTTPSMPCAPLGQVSREIPRSPEALSAVLPARWVVLLPSGSWGPSKSPPL SQWLGSRLRHSLSGDVGPRNLFSAGLRAAGSPAAAGSQVSLFLALEASAPLGKMSLPIGI YRRAVSYDDTLEDPAPMTPPPSDMGSVPWKPVIPERKYQHLAKVEEGEASLPSPAMTLSS AIDSVDKVPVVKAKATHVIMNSLITKQTQESIQHFERQAGLRDAGYTPHKGLTTEETKYL RVAEALHKLKLQSGEVTKEERQPASAQSTPSTTPHSSPKQRPRGWFTSGSSTALPGPNPS TMDSGSGDKDRNLSDKWSLFGPRSLQKYDSGSFATQAYRGAQKPSPLELIRAQANRMAED PAALKPPKMDIPVMEGKKQPPRAHNLKPRDLNVLTPTGF >gi568815593r:176292777_176516363|GENSCAN_predicted_CDS_2|1200_bp atggggagggggcgggtctcgcgatcgtggacaacaactcccagcatgccctgtgctccg ctgggccaagtctcgcgcgagatcccgcggtctccggaggctttatctgcagtgctgcct gcccgctgggtggtactgctacctagtgggtcttggggaccttcgaaatcgccgccgctc tcacaatggcttgggtccagactgcgccacagcctctcgggagacgtgggccctcggaac ctttttagtgccggactccgggccgcaggcagtcccgcggcagcaggatcacaggtgtct ctgttcttagctcttgaggctagtgcgcctctaggcaagatgtccctgcccatcgggata taccgccgggcagtcagctatgatgataccctcgaggaccctgcgcccatgactcctcct ccatcggacatgggcagcgtcccttggaagccagtgattccagagcgcaagtatcagcac ctcgccaaggtggaggaaggagaggccagtctaccctcccctgccatgaccctgtcatca gccattgacagtgtggacaaggtcccagtggtgaaggctaaagctacccatgtcatcatg aattctctgatcacaaaacagacccaggaaagcattcagcattttgagcgacaggcaggg ctgagagatgctggctacacaccccacaagggcctcaccaccgaggagaccaagtacctt cgagtggccgaagcactccacaaactaaagttacagagtggagaggtaacaaaagaagag aggcagcctgcatcagcccagtccaccccaagcaccactccgcactcttcacctaagcag aggcccaggggctggttcacttctggttcttccacagccttacctggcccaaatcctagc accatggactctggaagtggggataaggacagaaacttgtcagataagtggagcctcttt ggaccgagatcccttcagaagtacgattctggaagttttgccacccaggcctaccgagga gcccagaagccctctccattggaactgatacgtgcccaggccaaccgaatggctgaagat ccagcagccttgaagccccccaagatggacatcccagtgatggaaggaaagaaacagcca ccacgggcccataacctcaaaccccgtgacctgaatgtgctcacacccactggcttctag >gi568815593r:176292777_176516363|GENSCAN_predicted_peptide_3|260_aa MAPRPLGPLVLALGGAAAVLGSVLFILWKTYFGRGRERRWDRGEAWWGAEAARLPEWDEW DVLREAASRNLTSASPQPEDEEDEEPALEELEQREVLVLGLDGAGKSTFLRVLSGKPPLE GHIPTWGFNSVRLPTKDFEVDLLEIGGSQNLRFYWKEFVSEVDVLVFVVDSADRLRLPWA RQELHKLLDKDPDLPVVVVANKQDLSEAMSMGELQRELGLQAIDNQREVFLLAASIAPAG PTFEEPGTVHIWKLLLELLS >gi568815593r:176292777_176516363|GENSCAN_predicted_CDS_3|783_bp atggcgccgcggccgctgggccccttggtgctggcgctgggcggcgccgcggcggtgctg ggctcggtgctcttcatcctctggaagacctacttcggccgcggccgagagcggcgctgg gaccggggagaggcctggtggggcgcggaggctgcccgcctccccgagtgggacgagtgg gacgtccttcgggaggccgccagccgcaaccttacctccgcttccccgcagcccgaggac gaggaggacgaggagccggcgctggaggagctggaacagcgcgaggtgctggtgctgggg ctggatggcgcaggcaagagcacgttcctgcgcgtgttgtcggggaagccaccgctggaa ggccacatccccacctggggcttcaactccgtgcgtctgcccaccaaggactttgaggtg gacctgctagaaattgggggcagccagaacctgcgcttctactggaaggagtttgtgagc gaggtggatgtgctggtgtttgtggtggactcggctgaccgactgcggctgccctgggcc cgacaggagctgcacaagctgctggacaaggaccctgacctgcctgtcgtcgtggtggcc aacaagcaggacctgagcgaggccatgagtatgggggagctgcagcgggagctgggtcta caggctatcgataaccagcgggaggttttcctcttggcagccagcattgcccctgcagga cccacctttgaagagcctggcaccgtgcacatctggaaactgctcttggagctcctctcc tag >gi568815593r:176292777_176516363|GENSCAN_predicted_peptide_4|178_aa MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAV DPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVR YMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE >gi568815593r:176292777_176516363|GENSCAN_predicted_CDS_4|537_bp atgcccaaggccaagggcaaaacccggaggcagaagtttggttacagtgtcaaccgaaag cgtctgaaccggaatgctcgacggaaggcagcgccgcggatcgaatgctcccacatccga catgcctgggaccacgctaaatcggtacggcagaacctggccgagatggggttggctgtg gaccccaacagggcggtgcccctccgtaagagaaaggtgaaggccatggaggtggacata gaggagaggcctaaagagcttgtacggaagccctatgtgctgaatgacctggaggcagaa gccagccttccagaaaagaaaggaaatactctgtctcgggacctcattgactatgtacgc tacatggtagagaaccacggggaggactataaggccatggcccgtgatgagaagaattac tatcaagataccccaaaacagattcggagtaagatcaacgtctataaacgcttttaccca gcagagtggcaagacttcctcgattctttgcagaagaggaagatggaggtggagtga >gi568815593r:176292777_176516363|GENSCAN_predicted_peptide_5|106_aa MATPGPVIPEVPFEPSKPPVIEGLSPTVYRNPESFKEKFVRKTRENPVVPIGCLATAAAL TYGLYSFHRGNSQRSQLMMRTRIAAQGFTVAAILLGLAVTAMKSRP >gi568815593r:176292777_176516363|GENSCAN_predicted_CDS_5|321_bp atggcgactcccggccctgtgattccggaggtcccctttgaaccatcgaagcctccagtc attgaggggctgagccccactgtttacaggaatccagagagtttcaaggaaaagttcgtt cgcaagacccgcgagaacccggtggtacccataggttgcctggccacggcggccgccctc acctacggcctctactccttccaccggggcaacagccagcgctctcagctcatgatgcgc acccggatcgccgcccagggtttcacggtcgcagccatcttgctgggtctggctgtcact gctatgaagtctcgaccctaa >gi568815593r:176292777_176516363|GENSCAN_predicted_peptide_6|229_aa MADDFGFFSSSESGAPEAAEEDPAAAFLAQQESEIAGIENDEGFGAPAGSHAAPAQPGPT SGAGSEDMGTTVNGDVFQEANGPADGYAAIAQADRLTQEPESIRKWREEQRKRLQELDAA SKVTEQEWREKAKKDLEEWNQRQSEQVEKNKINNRIADKAFYQQPDADIIGYVASEEAFV KESKEETPGTEWEKVAQLCDFNPKSSKQCKDVSRLRSVLMSLKQTPLSR >gi568815593r:176292777_176516363|GENSCAN_predicted_CDS_6|690_bp atggctgatgactttggcttcttctcgtcgtcggagagcggtgccccggaggcggcggag gaggacccggcggccgccttcctggcccagcaggagagcgagattgcaggcatagagaac gacgagggcttcggggcacctgccggcagccatgcggcccccgcgcagccgggccccacg agtggggctggttctgaggacatggggaccacagtcaatggagatgtgtttcaggaggcc aacggtcctgctgatggctacgcagccattgcccaggctgacaggctgacccaggagcct gagagcatccgcaagtggcgagaggagcagaggaaacggctgcaagagctggatgctgca tctaaggtcacggaacaggaatggcgggagaaggccaagaaggacctggaggagtggaac cagcgccagagtgaacaagtagagaagaacaagatcaacaaccggatcgctgacaaagca ttctaccagcagccagatgctgatatcatcggctacgtggcatccgaggaggctttcgtg aaggaatccaaggaggagaccccaggcacagagtgggagaaggtggcccagctatgtgac ttcaaccccaagagcagcaagcagtgcaaagatgtgtcccgcctgcgctcggtgctcatg tccctgaagcagacgccactgtcccgctag >gi568815593r:176292777_176516363|GENSCAN_predicted_peptide_7|196_aa MAHVCNPSTLGGRGGRIPRTSSQSLFCLTGSTLQGDEGGAQAAESLAENHTGGQWKSVYQ TQASLEETRGQCSASAERQGSRHGKADAGPGFQRRSHGGGRQAVRTTGGAMNGRGSGEPS PDLPASDPEAAMERWPKSAHSFRFEGGVGNAAPVSRALRNLPRDGQELLETLGQGPATKR LVFRNGCVSRETSKSI >gi568815593r:176292777_176516363|GENSCAN_predicted_CDS_7|591_bp atggctcacgtctgtaatcccagcactttgggaggccgaggcgggcggatcccgagaacc agctctcaaagccttttctgtctcaccggatcaaccctgcaaggggacgagggtggggcg caggcggccgagagccttgctgagaatcacacaggtggtcagtggaaaagcgtgtaccag acccaggcctctctggaagagacaaggggccagtgcagcgctagtgcggagagacaaggc tccaggcacggaaaagcagatgccgggccggggttccagcgccgcagtcacggcggtgga cggcaagctgtgcggacgactgggggcgccatgaatggtcgaggatccggggagccgagt ccagatcttcctgcaagtgaccccgaagcggcaatggaacggtggccgaagtctgcccac agcttccgcttcgaaggcggcgtcggaaatgccgcgccagtatccagagcccttcggaac ctcccgagagatggtcaggagcttttggaaactttaggtcaggggccagcgaccaaaagg ctggtatttcggaacggctgcgtctcccgagaaacctccaagagtatctga >gi568815593r:176292777_176516363|GENSCAN_predicted_peptide_8|447_aa MVLNFWAQVILLPWPPRMLGLQVEISLFALKLTGCLEEFLEDRDYILFIFCPSNIDLTGI ESMDQCRHTLEQHNWNIEGKYARHVLLILMQLYHPTSLFNVRFALRFIRPDPRSRVTDPV GDIVSFMHSFEEKYGRAHPVFYQGTYSQALNDAKRELRFLLVYLHGDDHQDSDEFCRNTL CAPEVISLINTRMLFWACSTNKPEGYRVSQALRENTYPFLAMIMLKDRRMTVVGRLEGLI QPDDLINQLTFIMDANQTYLVSERLEREERNQTQVLRQQQDEAYLASLRADQEKERKKRE ERERKRRKEEEVQQQKLAEERRRQNLQEEKERKLECLPPEPSPDDPESVKIIFKLPNDSR VERRFHFSQSLTDFNCALSEPEQVKYRLHILSLNASTICVTPKDFWDFDETCEGEDTEKP VICKHLLLFPHHLWDISAVVSKWQIIN >gi568815593r:176292777_176516363|GENSCAN_predicted_CDS_8|1344_bp atggtcttgaacttctgggcccaagtgatcctcttaccttggcctcccagaatgctggga ttacaggttgagatcagtctctttgccctgaagcttactggttgtttagaggagtttctt gaggatagagactacatcttattcattttctgtccctccaacatagatctcactggcatc gaatctatggatcagtgtcgccataccttggaacagcataactggaacatagagggaaaa tatgcacggcatgtattactaattttaatgcagctgtatcatccaaccagcttgtttaat gttaggtttgctcttcgttttatacggcctgaccctcgcagccgggtcactgaccccgtt ggggacattgtttcatttatgcactcttttgaagagaaatatgggagggcacaccctgtc ttctaccagggaacgtacagccaggcacttaacgatgccaaaagggagcttcgctttctt ttggtttatcttcatggagatgatcaccaggactctgatgagttttgtcgcaacacactc tgtgcacctgaagttatttcactaataaacactaggatgctcttctgggcatgctctaca aacaaacctgagggatacagggtctcacaggctttacgagagaacacctatccattcctg gccatgattatgctgaaggatcgaaggatgactgtggtgggacggctagaaggcctcatt caacctgatgacctcattaaccaactgacatttatcatggatgctaaccagacttacctg gtgtcagaacgcctagaaagggaagaaagaaaccagacccaagtgctgagacaacagcag gatgaggcctacctggcctctctcagagctgaccaggagaaagaaagaaagaaacgggag gagcgggagcgtaagcggcggaaggaggaggaggtgcaacagcaaaagttggcagaggag agacggcggcagaatttacaggaggaaaaggaaaggaagttggaatgcctgccccctgaa ccttcccctgatgaccctgaaagtgtcaagatcatcttcaaattacctaatgattctcga gtagagagacgattccacttttcacagtctctaacagattttaactgtgccttgtcagag ccagagcaggtcaaatacaggctgcacattttgtcacttaatgccagtacaatctgtgtt actcctaaggacttttgggattttgatgagacctgcgagggagaagacactgagaagcca gtgatctgcaagcatttgctcttgtttccacatcacctctgggatatttcagctgttgtt tccaaatggcaaatcatcaactaa