GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:38:02 Sequence gi568815589r:104426256_104627209 : 200954 bp : 36.73% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 18581 18782 202 2 1 70 56 104 0.225 3.77 1.02 Term + 27829 27888 60 0 0 105 51 87 0.524 3.53 1.03 PlyA + 28031 28036 6 1.05 2.06 PlyA - 28142 28137 6 1.05 2.05 Term - 33831 33678 154 2 1 89 48 137 0.662 6.21 2.04 Intr - 44733 44636 98 0 2 87 84 30 0.286 0.39 2.03 Intr - 47398 47294 105 1 0 73 75 90 0.560 5.69 2.02 Intr - 52246 52114 133 0 1 46 119 14 0.409 0.03 2.01 Init - 69695 69502 194 2 2 67 60 168 0.361 10.39 2.00 Prom - 75272 75233 40 -3.75 3.00 Prom + 76221 76260 40 -8.95 3.01 Init + 78326 78895 570 1 0 73 52 333 0.010 23.33 3.02 Intr + 94752 94920 169 2 1 29 71 147 0.031 5.70 3.03 Term + 96599 96765 167 0 2 21 38 125 0.082 -2.10 3.04 PlyA + 97467 97472 6 1.05 4.02 PlyA - 97622 97617 6 1.05 4.01 Sngl - 100603 99998 606 1 0 87 44 205 0.427 11.94 4.00 Prom - 101479 101440 40 -4.95 5.02 PlyA - 103420 103415 6 1.05 5.01 Sngl - 110117 109515 603 2 0 87 39 314 0.384 22.34 5.00 Prom - 115359 115320 40 -3.45 6.00 Prom + 129442 129481 40 -3.75 6.01 Sngl + 143264 143875 612 1 0 99 42 420 0.802 34.44 6.02 PlyA + 144347 144352 6 1.05 7.00 Prom + 159095 159134 40 -3.75 7.01 Init + 164492 164644 153 1 0 72 42 95 0.619 3.33 7.02 Term + 166114 166275 162 0 0 14 48 214 0.941 6.95 7.03 PlyA + 168349 168354 6 1.05 8.03 PlyA - 168465 168460 6 1.05 8.02 Term - 173012 172202 811 1 1 17 36 355 0.004 14.86 8.01 Init - 179273 178420 854 2 2 38 89 510 0.027 40.26 8.00 Prom - 181450 181411 40 -6.95 9.02 PlyA - 181598 181593 6 1.05 9.01 Sngl - 191631 190993 639 0 0 83 42 318 0.553 22.63 9.00 Prom - 196674 196635 40 -7.65 10.00 Prom + 196970 197009 40 -3.65 10.01 Init + 198235 198445 211 0 1 68 57 144 0.551 8.29 10.02 Term + 198905 199446 542 0 2 32 38 251 0.519 7.93 10.03 PlyA + 200180 200185 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_1|87_aa XCFLPSNNRLQVLQIWILGLTSAICQGLQPQTEGCTVSFPTFEVLGLGLASLLLSLQTAY HGTSPCDCSLRVTTIEKKNKELGKVQR >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_1|264_bp nnatgctttctgccctcgaacaacagactccaagttcttcaaatttggattcttggactt acatcagcgatttgccagggccttcagccacagactgaaggctgcactgtcagcttccct acttttgaggttttgggacttggactggcttccttgctcctcagcttgcagacggcctat catgggacttcaccttgtgattgtagtttaagagtgaccacaattgagaaaaagaacaag gaactgggcaaagttcagagatga >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_2|227_aa MSKKNVQGTKSKRWGHKLHQKAVCSVKQEHGAKPRRLEVMGHKIQPTKDSNRQTKAYIGE KSNHWIWRENKSGLDSFGTQKLFPVPWGMEVARTVGWGSLDFDRHQNDLAALNVAKGYRY ELVFAKILDFEIHRVPFEDLWTVEDGGSPFAFYHDWKLPEASPEAEATILPVQPEEPCYF TDLVHSLSQVSIRHPPYGDVINKRPSTVNSPDFPPGTRFISMNDFYE >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_2|684_bp atgtctaagaaaaatgttcaaggaacaaagtcaaagaggtggggccacaaacttcatcaa aaggcggtttgcagtgtgaagcaggaacacggagcaaagccaaggcgtctggaagtcatg ggacacaagatccaaccaacaaaagattcaaatcgtcagacgaaggcctatattggtgaa aaaagtaaccattggatttggagagaaaataaaagtgggctggactcttttggaacccag aaacttttccctgtcccctggggcatggaggtggcaagaactgtggggtggggttcattg gattttgataggcatcaaaatgacctggctgcacttaacgttgcaaagggctacaggtat gagctggtatttgctaagatcttggattttgaaattcatagagttccatttgaggatcta tggactgtggaggatggcggctccccctttgctttctaccatgattggaagcttcctgag gcctccccagaagcagaagccactatacttcctgtacagcctgaagaaccgtgttacttc acagacctggttcattccttgtctcaggtctccatccggcacccaccctatggagacgtc attaataaacgtccatccacagtaaacagcccagactttccacctggcactaggtttatt tcaatgaatgacttttatgagtga >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_3|301_aa MGSTECVLLPMMAYDRYVAICNPLRYPVIMNRRTCVQIAAGSWMTGCLTAMVEMMSVLPL SLCGNSIINHFTCEILAILKLVCVDTSLVQLIMLVISVLLLPMPMLLICISYAFILASIL RISSVEGRSKAFSTCTAHLMVVVLFYGTALSMHLKPSAVDSQEIDKFMALVYAGQTPMLN PIIYSLRNKEPPLVIPRQTGSGVDLQQTPADLQQRGLTVGRKTNKQKGIAHPLRDPIRRS PTSKTKEKGNLHRKLRLKAAVKTTGDVQQGYAVLGHVRRIHRKRNKKDCNRFTEEFKLTG M >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_3|906_bp atgggctccactgagtgtgtgctcctgcccatgatggcatatgaccggtatgtggccatc tgcaaccccctgagataccctgtcatcatgaataggagaacctgtgtgcagattgcagct ggctcctggatgacaggctgtctcactgccatggtggaaatgatgtctgtgctgccactg tctctctgtggtaatagcatcatcaatcatttcacttgtgaaattctggccatcttgaaa ttggtttgtgtggacacctccctggtgcagttaatcatgctggtgatcagtgtacttctt ctccccatgccaatgctactcatttgtatctcttatgcatttatcctcgccagtatcctg agaatcagctcagtggaaggtcgaagtaaagccttttcaacgtgcacagcccacctgatg gtggtagttttgttctatgggacggctctctccatgcacctgaagccctccgctgtagat tcacaggaaatagacaaatttatggctttggtgtatgccggacaaacccccatgttgaat cctatcatctatagtctacggaacaaagagcctccgctggtgatacccaggcaaacaggg tctggggtggaccttcagcaaactccagcagacctgcagcagaggggcttgactgttgga aggaaaactaacaaacagaaaggaatagcacatccactcagagaccccatccgaaggtca ccaacatcaaagaccaaagaaaaaggaaatttacataggaaattaaggctaaaagcagct gtaaaaacgactggtgatgtgcagcaaggatatgctgtgcttggccatgtcagaagaatc cacagaaaaagaaacaaaaaggattgtaataggtttactgaagagtttaaactcactgga atgtaa >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_4|201_aa MAFDRYVAICNPLRYPIIMNKVVYVLLTSVSWLSGGINSTVQTSLAMRWPFCGNNIINHF LCEILAVLKLACSDISVNIVTLAVSNIAFLVLPLLVIFFSYMFILYTILRTNSATGRHKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLLGKDNLQATEGLVSMFYGVVTPMLNPIIYS LRNKDVKAAIKYLLSRKAINQ >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_4|606_bp atggcatttgatcgttatgtggccatctgtaaccctctgagataccccatcatcatgaac aaggtggtgtatgtactgctgacttctgtatcatggctttctggtggaatcaattcaact gtgcaaacatcacttgccatgcgatggcctttctgtgggaacaatattattaatcatttc ttatgcgagatcttagctgtcctaaaattagcttgttctgatatatctgtcaatattgtt accctagcagtgtcaaatattgctttcctagttcttcctctgctcgtgatttttttctcc tatatgttcatcctctacaccatcttgcgaacgaactcggccacaggaagacacaaggca ttttctacatgctcagctcacctgactgtggtgatcatattttatggtaccatcttcttt atgtatgcaaaacctaagtcccaggacctccttgggaaagacaacttgcaagctacagag gggcttgtttccatgttttatggggttgtgacccccatgttaaaccccataatctatagc ttgagaaataaagatgtaaaagctgctataaaatatttgctgagcaggaaagctattaac cagtaa >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_5|200_aa MAFDRYVAICNPLRYPIILSKVAYVLMASVSWLSGGINSAVQTLLAMRLPFCGNNIINHF ACEILAVLKLACADISLNIITMVISNMAFLVLPLMVIFFSYMFILYTILQMNSATGRRKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLIGEEKLQALDKLISLFYGVVTPMLNPILYS LRNKDVKAAVKYLLNKKPIH >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_5|603_bp atggcatttgatcgttatgtggccatctgcaacccactgagataccccatcatcctgagc aaggtggcgtatgtattgatggcttctgtgtcctggctgtccggtggaataaattcagct gtgcaaacattacttgccatgagactgcctttctgtgggaataatattatcaatcatttc gcatgtgaaatattagctgtcctcaagctggcctgtgctgatatatccctcaatattatc accatggtgatatcaaatatggccttcctggttcttccactgatggtcatttttttctcc tatatgttcatcctctacaccatcttgcaaatgaattcagccacaggaagacgcaaggca ttttccacgtgctcagctcacctgactgtggtgatcatattttacggtaccatcttcttt atgtatgcgaaaccgaagtctcaagacctgattggggaagaaaaattgcaagcattagac aagctcatttctctgttttatggggtagtgacacccatgctgaatcctatactctatagc ttgagaaataaggatgtaaaagctgctgtaaaatatttgctgaacaaaaaaccaattcac taa >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_6|203_aa MALDRYVAICYPLRYPVIMSKGAYVAMAAGSWVTGLVDSVVQTAFAMQLPFCANNVIKHF VCEILAILKLACADISINVISMTGSNLIVLVIPLLVISISYIFIVATILRIPSTEGKHKA FSTCSAHLTVVIIFYGTIFFMYAKPESKASVDSGNEDIIEALISLFYGVMTPMLNPLIYS LRNKDVKAAVKNILCRKNFSDGK >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_6|612_bp atggcactggaccgctatgtggccatctgctacccactgagataccctgtcatcatgagc aagggtgcctatgtggccatggcagctgggtcctgggtcactgggcttgtggactcagta gtgcagacagcttttgcaatgcagttaccattctgtgctaataatgtcattaaacatttt gtctgtgaaattctggctatcttgaaactggcctgtgctgatatttcaatcaatgtgatt agtatgacagggtcgaatctgattgttctggttattccattgttagtaatttccatctct tacatatttattgttgccactattctgaggattccttccactgaaggaaaacataaggcc ttctccacctgctcagcccacctgacagtggtgattatattctatggaaccatcttcttc atgtacgcaaagcctgagtctaaagcctctgttgattcaggtaatgaagacatcattgag gccctcatctcccttttctatggagtgatgactcccatgcttaatcctctcatctatagt ctgcgaaacaaggatgtaaaggctgctgtcaaaaacatactgtgtaggaaaaacttttct gatggaaaatga >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_7|104_aa MTVVIVFYGTILFMYMKAKSKDSAFDKLIALFYGIVTPMLNPIIYSLRNTEYDVLEPVQN KEDVNVEERLSSGCQSPSSMKGAYTQEEGVDPSTWLRTTHAKKS >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_7|315_bp atgacagtggtgattgtgttttatgggacaatcctcttcatgtacatgaaggcaaagtcc aaagactctgcttttgacaaactgattgccctgttctatggcatagtcacccccatgctc aatcctatcatctatagcctgaggaatacagagtatgatgttttggagcctgtccagaat aaggaagatgtcaacgtggaagagaggctcagttcagggtgtcaaagtccaagcagcatg aaaggagcatacacacaagaggaaggtgtggatcctagcacctggttgaggaccactcat gcaaagaaaagctga >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_8|554_aa MYVVILLGNGTLILISILDPHLHTPMYFFLGNLSFLDICYTTTSIPSTLVSFLSERKTIS LSGCAVQMFLGLAMGTTECVLLGMMAFDRYVAICNPLRYPIIMSKDAYVPMAAGSWIIGA VNSAVQSVFVVQLPFCRNNIINHFTCEILAVMKLACADISDNEFIMLVATTLFILTPLLL IIVSYTLIIVSIFKISSSEGRSKASSTCSAHLTVVIIFYGTILFMYMKPKSKETLNSDDL DATDKIISMFYGVMTPMMNPLIYSLRNKDVKEAVKHLLNRRFFSNILDPHLHTPMYFFLG NLSFLDICYTTTSIPSTLVSFLSERKTISLSGCAVQMFLSLAMGTTECVLLGVMAFDRYV AICNPLRYPIIMSKDAYVPMAAGSWIIGAVNSAVQTVFVVQLPFCRNNIINHFTCEILAV MKLACADISGNEFILLVTTTLFLLTPLLLIIVSYTLIILSIFKISSSEGRSKPSSTCSAR LTVVITFCGTIFLMYMKPKSQETLNSDDLDATDKLIFIFYRVMTPMMNPLIYSLRNKDVK EAVKHLLRRKNFNK >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_8|1665_bp atgtatgtggtcatccttctggggaatggtactctcattttaatcagcatcttggaccct caccttcacacccctatgtacttctttctggggaacctctccttcttggacatctgctac accaccacctctattccctccacgctagtgagcttcctttcagaaagaaagaccatttcc ctttctggctgtgcagtgcagatgttcctcggcttggccatggggacaacagagtgtgtg cttctgggcatgatggcctttgaccgctatgtggctatctgcaaccctctgagatatccc atcatcatgagtaaggatgcctatgtacccatggcagctgggtcctggatcataggagct gtcaattctgcagtacaatcagtgtttgtggtacaattgcctttctgcaggaataacatc atcaatcatttcacctgtgaaattctggctgtcatgaaactggcctgtgctgacatctca gacaatgagttcatcatgcttgtggccacaacattgttcatattgacacctttgttatta atcattgtctcttacacgttaatcattgtgagcatcttcaaaattagctcttccgagggg agaagcaaagcttcctctacctgttcagcccatctgactgtggtcataatattctatggg accatcctcttcatgtacatgaagcccaagtctaaagagacacttaattcggatgacttg gatgctaccgacaaaattatatccatgttctatggggtgatgactcccatgatgaatcct ttaatctacagtcttagaaacaaggatgtgaaagaggcagtaaaacacctactgaacaga aggttctttagcaacatcttggaccctcaccttcacacccctatgtacttctttctgggg aacctctccttcttggacatctgctacaccaccacctctattccctccacgctagtgagc ttcctttcagaaagaaagaccatttccctttctggctgtgcagtgcagatgttcctcagc ttggccatggggacaacagagtgtgtgcttctgggcgtgatggcctttgaccgctatgtg gctatctgcaaccctctgagatatcccatcatcatgagtaaggatgcctatgtacccatg gcagctgggtcctggatcataggagctgtcaattctgcagtacaaacagtgtttgtggta caattgcctttctgcaggaataacatcatcaatcatttcacctgtgaaattctagctgtc atgaaactggcctgtgctgacatctcaggcaatgagttcatcctgcttgtgaccacaaca ttgttcctattgacacctttgttattaattattgtctcttacacgttaatcattttgagc atcttcaaaattagctcttcggaggggagaagcaaaccttcctctacctgctcagctcgt ctgactgtggtgataacattctgtgggaccatcttcctcatgtacatgaagcccaagtct caagagacacttaattcagatgacttggatgccactgacaaacttatattcatattctac agggtgatgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtgaag gaggcagtaaaacacctactgagaagaaaaaattttaacaagtaa >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_9|212_aa MGTTECVLLGMMAFDRYVAICNPLRYPIIMSKNAYVPMAVGSWFAGIVNSAVQTTFVVQL PFCRKNVINHFSCEILAVMKLACADISGNEFLMLVATILFTLMPLLLIVISYSLIISSIL KIHSSEGRSKAFSTCSAHLTVVIIFYGTILFMYMKPKSKETLNSDDLDATDKIISMFYGV MTPMMNPLIYSLRNKDVKEAVKHLPNRRFFSK >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_9|639_bp atggggacaacagagtgtgtgcttctgggcatgatggcctttgaccgctatgtggctatc tgcaaccctctgagatatcccatcatcatgagcaagaatgcctatgtacccatggctgtt gggtcctggtttgcagggattgtcaactctgcagtacaaactacatttgtagtacaattg cctttctgcaggaagaatgtcatcaatcatttctcatgtgaaattctagctgtcatgaag ttggcctgtgctgacatctcaggcaatgagttcctcatgcttgtggccacaatattgttc acattgatgccactgctcttgatagttatctcttactcattaatcatttccagcatcctc aagattcactcctctgaggggagaagcaaagctttctctacctgctcagcccatctgact gtggtcataatattctatgggaccatcctcttcatgtatatgaagcccaagtctaaagag acacttaattcagatgacttggatgctaccgacaaaattatatccatgttctatggggtg atgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtgaaagaggca gtaaaacacctaccgaacagaaggttctttagcaagtga >gi568815589r:104426256_104627209|GENSCAN_predicted_peptide_10|250_aa MWKRLWNWVTGRRWKSLEGSEEDRKMWESLELPTDLLNGFTQNVDSNMDNKVQAEVVSDG DEKLVGSWSKAAPDVAKRGQCRARAMASEGVSLKPQKLPHGVEPASAPKSRIGVWKPLPR LQKMYGNTWMSRQKFAAGMGCSWRTSDRAVQEGKVGLEPPHRVPTGALPSGAVRRGPPPS RLQNGRSTNSLHHLPGKATDTQYQPVKVAEKEAVLCKATGSELPKTMGMHFLHQCDLDMR PESKEIILEL >gi568815589r:104426256_104627209|GENSCAN_predicted_CDS_10|753_bp atgtggaagcgactttggaactgggtaacaggcagaagatggaagagtttggagggctca gaagaagacaggaaaatgtgggaaagtttagaacttcctacagacttgctgaatggcttt acccaaaatgttgatagcaatatggataataaggtccaggctgaggtggtctcagatgga gatgagaaacttgttgggagctggagcaaagctgctccagatgtggctaaaaggggccaa tgcagagctcgggccatggcttcagagggggtaagcctcaagcctcagaagcttccacat ggtgttgagcctgcaagtgcaccgaagtcaagaattggggtttggaaacctctacctaga cttcagaagatgtatggaaacacctggatgtccaggcagaagtttgctgcagggatgggg tgctcatggagaacttctgatagggcagtgcaggagggaaaagtggggttggaaccccca cacagagtccctactggggcactgcctagtggagctgtgagaagagggccaccaccctcc agactccagaatggtagatctaccaacagcttgcaccatttgcctggaaaagccacagac actcaataccagcctgtgaaagtagctgagaaggaggctgtactctgcaaagccacaggg tcagagctgcccaagaccatgggaatgcacttcttgcatcagtgtgacctggatatgaga ccagagtcaaaggagatcattttggagctttaa