GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:47:59 Sequence gi568815588r:70783953_70985930 : 201978 bp : 46.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 383 378 6 1.05 1.08 Term - 2664 2531 134 1 2 117 29 57 0.388 0.95 1.07 Intr - 2936 2815 122 1 2 86 43 91 0.412 4.54 1.06 Intr - 8196 8084 113 0 2 0 93 81 0.302 -1.02 1.05 Intr - 8775 8696 80 1 2 25 32 112 0.205 -1.43 1.04 Intr - 10615 10235 381 2 0 51 75 287 0.066 18.58 1.03 Intr - 20341 20262 80 0 2 78 93 41 0.545 2.79 1.02 Intr - 27473 27395 79 0 1 100 89 25 0.190 2.41 1.01 Init - 31997 31724 274 2 1 79 18 240 0.021 11.24 1.00 Prom - 33425 33386 40 -8.76 2.00 Prom + 34518 34557 40 -4.06 2.01 Init + 44999 45055 57 1 0 49 110 48 0.681 4.41 2.02 Intr + 60521 60686 166 1 1 83 89 107 0.796 9.83 2.03 Intr + 67191 67258 68 1 2 63 115 21 0.230 0.92 2.04 Intr + 67840 67908 69 0 0 84 96 30 0.659 2.78 2.05 Intr + 70756 70903 148 0 1 17 51 158 0.707 4.81 2.06 Intr + 73662 73738 77 1 2 110 113 36 0.879 7.53 2.07 Intr + 75419 75547 129 1 0 94 75 57 0.746 5.89 2.08 Intr + 84393 84481 89 2 2 81 48 82 0.674 2.27 2.09 Intr + 85840 85945 106 1 1 76 100 71 0.648 7.32 2.10 Intr + 87096 87194 99 2 0 58 71 64 0.556 2.01 2.11 Intr + 87840 88034 195 2 0 52 105 129 0.853 10.61 2.12 Intr + 89399 89637 239 1 2 99 36 227 0.973 15.01 2.13 Intr + 91450 91596 147 1 0 70 115 87 0.997 8.95 2.14 Intr + 92589 92709 121 0 1 98 54 54 0.990 3.50 2.15 Term + 93243 93383 141 2 0 101 43 148 0.990 9.53 2.16 PlyA + 97203 97208 6 1.05 3.05 PlyA - 97218 97213 6 1.05 3.04 Term - 100096 99998 99 1 0 55 41 150 0.952 5.13 3.03 Intr - 101280 101200 81 0 0 51 113 90 0.991 7.63 3.02 Intr - 101977 101846 132 1 0 125 51 109 0.997 11.74 3.01 Init - 104581 104519 63 1 0 99 59 53 0.605 3.23 3.00 Prom - 105442 105403 40 -3.26 4.04 PlyA - 105672 105667 6 1.05 4.03 Term - 107501 107282 220 1 1 77 42 163 0.772 7.11 4.02 Intr - 109882 109747 136 2 1 70 49 68 0.306 0.73 4.01 Init - 110472 110424 49 0 1 68 78 28 0.316 1.06 4.00 Prom - 111839 111800 40 -6.66 5.00 Prom + 120414 120453 40 -4.76 5.01 Init + 131542 131595 54 0 0 103 105 58 0.832 8.30 5.02 Intr + 132794 132893 100 1 1 85 77 29 0.642 1.18 5.03 Intr + 135262 135332 71 2 2 123 45 54 0.443 3.50 5.04 Intr + 138111 138289 179 2 2 44 97 131 0.389 8.32 5.05 Intr + 142051 142132 82 1 1 49 -1 72 0.203 -5.96 5.06 Intr + 142494 142604 111 2 0 87 59 109 0.571 8.48 5.07 Intr + 143174 143266 93 1 0 100 109 38 0.974 7.26 5.08 Intr + 143817 143854 38 2 2 63 108 27 0.706 -0.74 5.09 Intr + 144852 144929 78 0 0 113 55 43 0.526 2.17 5.10 Intr + 147892 148029 138 1 0 22 59 127 0.310 2.78 5.11 Term + 148275 148314 40 0 1 69 55 38 0.176 -4.74 5.12 PlyA + 148596 148601 6 -0.45 6.09 PlyA - 152690 152685 6 1.05 6.08 Term - 154086 154007 80 1 2 70 45 63 0.352 -1.87 6.07 Intr - 154826 154628 199 2 1 113 68 90 0.900 8.42 6.06 Intr - 160505 160406 100 1 1 32 77 96 0.859 3.01 6.05 Intr - 162633 162500 134 0 2 122 85 146 0.990 17.14 6.04 Intr - 164516 164447 70 1 1 118 14 21 0.076 -3.12 6.03 Intr - 172693 172563 131 1 2 99 68 107 0.174 9.29 6.02 Intr - 182976 182924 53 1 2 116 68 58 0.037 5.23 6.01 Intr - 193713 193643 71 2 2 83 94 83 0.013 7.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:70783953_70985930|GENSCAN_predicted_peptide_1|420_aa MAPGAPARLPDPGVRGPAATSRAHAQTRAPRPVRAAYRPGSGLHPDPPVNVSTRPSWGGC GIPTVTNFSSKLPWTKNFLVTFGHLEPPERVGHARPLQGLSESYRSTVFLQVLLLVDHWV AFLNAGLDQSIPVSSRAVSPGQMTAVKKPDVLSPVRASKMTKKRNNGLTKKGHSHVQPIC CMNCARCMPKDKAIKKFVIRNIVEPAAVRDIFEARVFNAYVLSKQYMKLHYCVSCAIHSK VVRNQSHEGFKDPTLPPLDLWVLPHDPHQSPSEQGGVSAVIKKTYYTYINVSGEVETKGK NKESKSHLVNIEQAPETKNSLSEEFRSIMKEVKTICCQDAQKYGRGGFQYDFKRVTVFLQ RQAQKKPGNILTCRRMEKRHRVWKPQGTSMKPGVLRAKALGKAHLIILRDSTRHCLQAFP >gi568815588r:70783953_70985930|GENSCAN_predicted_CDS_1|1263_bp atggctccgggggcaccggcccggctcccggaccctggcgtgcgcggccccgctgctacc agccgcgcgcacgcccagacccgggccccacggcctgtgcgcgccgcctaccggccaggc agcgggctgcacccagaccctccagtgaatgtctcaaccaggccgtcctggggcggctgt ggtatcccgacggtcaccaacttctcttccaaactgccttggacaaaaaactttcttgtc acatttggtcacttggagcctcctgagagagtagggcatgctagacctttgcaaggcttg tctgagagctacagatctactgttttcctgcaggttttattgcttgtagatcattgggtg gccttcctgaatgctggcttagaccagtctatacctgtcagctccagagctgtgagtcct ggtcagatgacagctgtaaagaaaccagatgtcctctctccagtccgtgcctccaagatg acaaagaaaaggaacaacggcctcaccaaaaagggccacagccatgtgcagcctatttgc tgcatgaactgtgcccgatgcatgcccaaggacaaggctattaagaaatttgtcattcga aacatagtggagcccgcagcagtcagggacatttttgaagcgagggtcttcaatgcctat gtgctttccaagcagtatatgaagctacattactgtgtgagttgtgcaattcacagcaaa gtagtcagaaatcaatctcatgaaggcttcaaggacccaacactcccacccttagacctg tgggtgctgccccatgacccccaccaaagcccatctgaacaaggaggagtcagtgctgtc atcaagaaaacctactacacctacattaatgtgtctggagaagtggaaactaaggggaaa aataaggaaagcaaaagtcacctggtgaacattgagcaggccccggagacaaaaaactcc ttatctgaggaatttagaagtataatgaaagaagtgaagaccatctgctgccaggatgcc cagaagtatggaagagggggttttcagtatgatttcaaaagagtcaccgtattcctgcag agacaggcccagaaaaagccagggaacatcctgacctgccgaaggatggagaaaagacac agagtctggaagccccagggcaccagtatgaagccaggggtgctgagagcaaaggccctg ggaaaagcccacctcatcattctgagggacagcacccggcactgtctgcaagccttccca tag >gi568815588r:70783953_70985930|GENSCAN_predicted_peptide_2|616_aa MPFRSVLKGAGEGKYEDARKAFEPYLEILEVYSTKAKNYVNGHCTKYEPWQLIAWSVVWT LLIVWGYEFVFQPESLWSRFKKKCFKLTRKMPIIGRKRDSSKRFRNEPGYIPVKLSLRIQ IQDKLNKTKDDISKNMSFLKVDKEYVKALPSQGLSSSAVLEKLKEYSSMDAFWQEGRASG TVYSGEEKLTELLVKAYGDFAWSNPLHPDIFPGLRKIEAEIVRIACSLFNGGPDSCGCVT SGGTESILMACKAYRDLAFEKGIKTPEIVAPQSAHAAFNKAASYFGMKIVRVPLTKMMEV DVRAMRRAISRNTAMLVCSTPQFPHGVIDPVPEVAKGYKGNSLMFFVFWNKLAVKYKIPL HVDACLGGFLIVFMEKAGYPLEHPFDFRVKGVTSISADTHKYGYAPKGSSLVLYSDKKYR NYQFFVDTDWQGGIYASPTIAGSRPGGISAACWAALMHFGENGYVEATKQIIKTARFLKS ELENIKGIFVFGNPQLSVIALGSRDFDIYRLSNLMTAKGWNLNQLQFPPSIHFCITLLHA RKRVAIQFLKDIRESVTQIMKNPKAKTTGMGAIYGMAQTTVDRNMVAELSSVFLDSLYST DTVTQGSQMNGSPKPH >gi568815588r:70783953_70985930|GENSCAN_predicted_CDS_2|1851_bp atgcctttcagaagtgtcctgaagggcgctggggagggcaaatatgaggatgccaggaag gcctttgagccctacttagagattttggaagtatactccacaaaagccaagaattatgta aatggacattgcaccaagtatgagccctggcagctaattgcatggagtgtcgtgtggacc ctgctgatagtctggggatatgagtttgtcttccagccagagagtttatggtcaaggttt aaaaagaaatgttttaagctcaccaggaagatgcccattattggtcgtaagagggacagc agcaagaggttcaggaatgagcctggctatattccagtgaaactttctttacgaattcag attcaagacaagttgaacaagaccaaggatgatattagcaagaacatgtcattcctgaaa gtggacaaagagtatgtgaaagctttaccctcccagggtctgagctcatctgctgttttg gagaaacttaaggagtacagctctatggacgccttctggcaagaggggagagcctctgga acagtgtacagtggggaggagaagctcactgagctccttgtgaaggcttatggagatttt gcatggagtaaccccctgcatccagatatcttcccaggactacgcaagatagaggcagaa atcgtgaggatagcttgttccctgttcaatgggggaccagattcgtgtggatgtgtgact tctgggggaacagaaagcatactgatggcctgcaaagcatatcgggatctggcctttgag aaggggatcaaaactccagaaattgtggctccccaaagtgcccatgctgcatttaacaaa gcagccagttactttgggatgaagattgtgcgggtcccattgacgaagatgatggaggtg gatgtgcgggcaatgagaagagctatctccaggaacactgccatgctcgtctgttctacc ccacagtttcctcatggtgtaatagatcctgtccctgaagtggccaagggctataaagga aattctcttatgttctttgttttttggaacaagctggctgtcaaatacaaaatacccctt catgtcgacgcttgtctgggaggcttcctcatcgtctttatggagaaagcaggataccca ctggagcacccatttgatttccgggtgaaaggtgtaaccagcatttcagctgacacccat aagtatggctatgccccaaaaggctcatcattggtgttgtatagtgacaagaagtacagg aactatcagttcttcgtcgatacagattggcagggtggcatctatgcttccccaaccatc gcaggctcacggcctggtggcattagcgcagcctgttgggctgccttgatgcacttcggt gagaacggctatgttgaagctaccaaacagatcatcaaaactgctcgcttcctcaagtca gaactggaaaatatcaaaggcatctttgtttttgggaatccccaattgtcagtcattgct ctgggatcccgtgattttgacatctaccgactatcaaacctgatgactgctaaggggtgg aacttgaaccagttgcagttcccacccagtattcatttctgcatcacattactacacgcc cggaaacgagtagctatacaattcctaaaggacattcgagaatctgtcactcaaatcatg aagaatcctaaagcgaagaccacaggaatgggtgccatctatggcatggcccagacaact gttgacaggaatatggttgcagaattgtcctcagtcttcttggacagcttgtacagcacc gacactgtcacccagggcagccagatgaatggttctccaaaaccccactga >gi568815588r:70783953_70985930|GENSCAN_predicted_peptide_3|124_aa MVSPGVRGRDRGSGAGAAAGEAGKAHRLSAEERDQLLPNLRAVGWNELEGRDAIFKQFHF KDFNRAFGFMTRVALQAEKLDHHPEWFNVYNKVHITLSTHECAGLSERDINLASFIEQVA VSMT >gi568815588r:70783953_70985930|GENSCAN_predicted_CDS_3|375_bp atggtgagtccaggggtgcgcggccgcgatcggggcagcggggccggggcagccgcgggc gaggctggcaaagcacacaggctgagcgctgaggagagggaccagctgctgccaaacctg agggctgtggggtggaatgagctggaaggccgtgatgccatcttcaagcagtttcatttc aaagacttcaacagggcctttgggttcatgacaagagtggccctgcaggctgagaaactg gaccaccatcctgaatggtttaacgtgtacaacaaggtccacatcacgctgagcacccat gagtgtgccggcctttcagaacgggacataaacctggccagcttcatcgaacaagtagca gtgtccatgacatag >gi568815588r:70783953_70985930|GENSCAN_predicted_peptide_4|134_aa MLPQAKELQELPAAGRVQWTLPYIRGRSFMGELPTLPQRQLTWNDINIGSWSQYGFLDLP QGTRAQDPPNVGTHKGCNAGPLPSLVEGSCPTEQKLQQGQAGPGATGQSGERGRLSCQHA AVHQAADGGDKRAI >gi568815588r:70783953_70985930|GENSCAN_predicted_CDS_4|405_bp atgctgccacaagccaaggaactccaggagctaccggcagctggaagagtgcagtggacc ctgccttatatccggggcaggtccttcatgggagagcttcccactctgccccagaggcag ctgacctggaatgatattaacataggatcttggtcccagtacggtttcctggacctcccc caaggaacaagagctcaggacccaccaaatgtgggcacccacaaaggctgtaacgctggc cctctaccctcactggtggagggcagctgccccacagaacagaagctgcagcagggccaa gctggtccaggagccacgggccagagtggagaaaggggccgcctgagctgccaacatgcc gctgtccatcaggctgcagatggtggagataaaagagctatttag >gi568815588r:70783953_70985930|GENSCAN_predicted_peptide_5|327_aa MASTTSLTPRPLQDLPSLGSPMQRVSELESHLQVADSEPLVIQTEKMSPREALGTWIKVG KISLLGLRGAEGQQAPWLIFPGFGGTGAYSPFNAVKERILASAYTLWAGTVSSLSHLPRV SGDAGARSALVDFRSSCPLGGSRGLEAMNENSLSTLPVHTARDWLGSEDAETTATVLTFS WALFPSFQFPAIRRLCKAFGPQQTICSDAGPVRKTWPHPLSSLGRRFMRQSKTDDTPFGD TDKSLHISFYCHVTWGHPSISVCHTAQMRPMAAITHQPEAAAESLTQDPENLNPTVHGGL ELLSHGFLASETVLRRAPIPVMATILY >gi568815588r:70783953_70985930|GENSCAN_predicted_CDS_5|984_bp atggccagcaccacctcactcacccccagacccctccaggacctccccagcctggggagc ccaatgcagagagtgtctgagctagaaagtcacttacaggttgcagattctgagcccctg gtgatccagacggagaaaatgagtcccagagaagccctaggcacctggatcaaggtgggc aaaatcagcttgctgggcctccggggggcagaggggcagcaggcaccctggttgatattc cctggctttggaggcactggtgcatatagcccgttcaatgccgtgaaagaacgcatcttg gcttctgcatacaccttgtgggcagggacggtgtcttctctttcacatctgccccgtgtc tcaggtgatgctggagctcgatcagctcttgtggactttcgaagctcctgtccacttgga ggctccaggggcctcgaggccatgaatgagaactcactgtccacactgcccgtccacact gcccgcgactggctgggctctgaggatgcagagaccaccgccactgtcctcacgttctct tgggctctatttccatccttccagttcccagccatccggcgtctctgcaaggcctttggt ccccagcagaccatctgcagtgatgcaggcccggtccggaagacgtggcctcatccactg tcttcactaggaagaagattcatgaggcagagcaagacagatgacacgccttttggggat acggacaagtctctgcacatctctttctactgccacgtcacctggggtcatccgtccatc tcagtgtgccacactgcccagatgaggccgatggcagccatcactcatcagcctgaagca gctgcagaatccttaacccaggacccagagaacctcaacccaacggttcatggagggctg gaacttctaagccatggatttctagcatccgaaacagttttgagacgggctccaatacct gtcatggccaccatcctttactga >gi568815588r:70783953_70985930|GENSCAN_predicted_peptide_6|279_aa XRIGTFGKAVERLQRGLRPQSRAADLQQLAEECTFQVWHPNLVFQVPALAQATLECENRG PACCHMVGSMHVFPMGLDIPLVVETGGQLRASPTDVPEQELQAFLPRSKGTLAQPPTAIF ISLPEDFDCWVLFAEEKARNVKDASWEQPSSNGWTHKILSITGPGSKPQIPQKMPVIGSA CRGEPNLVQQEVPGAQEAGGKAGGMGNFEGTERPRGTKLCGINCSWLPGRYNPPNECPDL NRSCLCSSFWFLLASLREEPWACILDGHPGHPLAFSAGE >gi568815588r:70783953_70985930|GENSCAN_predicted_CDS_6|840_bp nnaagaattggcaccttcggcaaagcagtggagcgcctgcagcgaggcctcaggcctcag agcagagcagcagacctccagcagcttgcagaagaatgcacgttccaggtttggcatcca aacctggtctttcaggtgcctgccttggcgcaggccacactggagtgtgagaaccgaggg ccagcctgctgtcatatggtaggaagcatgcatgtgttccccatgggcctggacataccc ctggtggtggaaactgggggccagctgagagcatcccccactgatgtccctgagcaagaa cttcaggccttcctcccaaggtccaagggcacactggcccaaccgccaactgccatcttt atttctttgcctgaggactttgactgctgggtcctctttgctgaagagaaggccagaaat gtcaaggatgcctcctgggagcagccttcaagcaatggctggactcacaaaatcctgagc atcacggggcctggcagcaaaccccagatcccccagaaaatgcctgtgatcggatcagct tgcagaggggagcccaacttggtccagcaggaggtgccgggagcccaggaagcaggtgga aaagctgggggcatgggcaactttgagggtacagagaggccacgagggaccaagctctgc gggatcaactgttcctggctcccagggagatataaccccccaaatgagtgcccagacctc aacagaagctgtctctgctcctccttctggttcttgctggcttccttgagggaggagccc tgggcctgcatcctggacgggcacccaggtcaccccctcgccttcagtgctggtgaatga