GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:50:15 Sequence gi568815589f:104404263_104605219 : 200957 bp : 36.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6004 6096 93 0 0 67 93 37 0.686 2.53 1.02 Intr + 11115 11203 89 2 2 131 110 30 0.933 7.35 1.03 Intr + 40574 40775 202 2 1 70 56 104 0.223 3.77 1.04 Term + 49822 49881 60 0 0 105 51 87 0.523 3.53 1.05 PlyA + 50024 50029 6 1.05 2.06 PlyA - 50135 50130 6 1.05 2.05 Term - 55824 55671 154 2 1 89 48 137 0.662 6.21 2.04 Intr - 66726 66629 98 0 2 87 84 30 0.286 0.39 2.03 Intr - 69391 69287 105 1 0 73 75 90 0.560 5.69 2.02 Intr - 74239 74107 133 0 1 46 119 14 0.409 0.03 2.01 Init - 91688 91495 194 2 2 67 60 168 0.361 10.39 2.00 Prom - 97265 97226 40 -3.75 3.00 Prom + 98214 98253 40 -8.95 3.01 Init + 100319 100888 570 1 0 73 52 333 0.010 23.33 3.02 Intr + 116745 116913 169 2 1 29 71 147 0.031 5.70 3.03 Term + 118592 118758 167 0 2 21 38 125 0.082 -2.10 3.04 PlyA + 119460 119465 6 1.05 4.02 PlyA - 119615 119610 6 1.05 4.01 Sngl - 122596 121991 606 1 0 87 44 205 0.427 11.94 4.00 Prom - 123472 123433 40 -4.95 5.02 PlyA - 125413 125408 6 1.05 5.01 Sngl - 132110 131508 603 2 0 87 39 314 0.384 22.34 5.00 Prom - 137352 137313 40 -3.45 6.00 Prom + 151435 151474 40 -3.75 6.01 Sngl + 165257 165868 612 1 0 99 42 420 0.802 34.44 6.02 PlyA + 166340 166345 6 1.05 7.00 Prom + 181088 181127 40 -3.75 7.01 Init + 186485 186637 153 1 0 72 42 95 0.619 3.33 7.02 Term + 188107 188268 162 0 0 14 48 214 0.941 6.95 7.03 PlyA + 190342 190347 6 1.05 8.03 PlyA - 190458 190453 6 1.05 8.02 Term - 195005 194195 811 1 1 17 36 355 0.004 14.86 8.01 Init - 200936 200413 524 2 2 67 89 321 0.048 24.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 12962 13151 190 2 1 75 47 95 0.813 0.04 S.002 Sngl - 200936 200409 528 2 0 67 48 332 0.929 22.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:104404263_104605219|GENSCAN_predicted_peptide_1|147_aa MGRFFLGLKLCFPVLHDLTSSALAYQRLPCTDVRLHNLLKMNFPNNSGPTHLGINHPIHD RCFLPSNNRLQVLQIWILGLTSAICQGLQPQTEGCTVSFPTFEVLGLGLASLLLSLQTAY HGTSPCDCSLRVTTIEKKNKELGKVQR >gi568815589f:104404263_104605219|GENSCAN_predicted_CDS_1|444_bp atgggacgattcttcttaggtctaaagctctgttttccagtattgcatgacctgacctct tcggctttggcgtaccagagattaccttgtacggatgtgagacttcacaacctgctaaaa atgaactttcctaataactcgggacctacccatctaggaataaaccatcctattcatgac agatgctttctgccctcgaacaacagactccaagttcttcaaatttggattcttggactt acatcagcgatttgccagggccttcagccacagactgaaggctgcactgtcagcttccct acttttgaggttttgggacttggactggcttccttgctcctcagcttgcagacggcctat catgggacttcaccttgtgattgtagtttaagagtgaccacaattgagaaaaagaacaag gaactgggcaaagttcagagatga >gi568815589f:104404263_104605219|GENSCAN_predicted_peptide_2|227_aa MSKKNVQGTKSKRWGHKLHQKAVCSVKQEHGAKPRRLEVMGHKIQPTKDSNRQTKAYIGE KSNHWIWRENKSGLDSFGTQKLFPVPWGMEVARTVGWGSLDFDRHQNDLAALNVAKGYRY ELVFAKILDFEIHRVPFEDLWTVEDGGSPFAFYHDWKLPEASPEAEATILPVQPEEPCYF TDLVHSLSQVSIRHPPYGDVINKRPSTVNSPDFPPGTRFISMNDFYE >gi568815589f:104404263_104605219|GENSCAN_predicted_CDS_2|684_bp atgtctaagaaaaatgttcaaggaacaaagtcaaagaggtggggccacaaacttcatcaa aaggcggtttgcagtgtgaagcaggaacacggagcaaagccaaggcgtctggaagtcatg ggacacaagatccaaccaacaaaagattcaaatcgtcagacgaaggcctatattggtgaa aaaagtaaccattggatttggagagaaaataaaagtgggctggactcttttggaacccag aaacttttccctgtcccctggggcatggaggtggcaagaactgtggggtggggttcattg gattttgataggcatcaaaatgacctggctgcacttaacgttgcaaagggctacaggtat gagctggtatttgctaagatcttggattttgaaattcatagagttccatttgaggatcta tggactgtggaggatggcggctccccctttgctttctaccatgattggaagcttcctgag gcctccccagaagcagaagccactatacttcctgtacagcctgaagaaccgtgttacttc acagacctggttcattccttgtctcaggtctccatccggcacccaccctatggagacgtc attaataaacgtccatccacagtaaacagcccagactttccacctggcactaggtttatt tcaatgaatgacttttatgagtga >gi568815589f:104404263_104605219|GENSCAN_predicted_peptide_3|301_aa MGSTECVLLPMMAYDRYVAICNPLRYPVIMNRRTCVQIAAGSWMTGCLTAMVEMMSVLPL SLCGNSIINHFTCEILAILKLVCVDTSLVQLIMLVISVLLLPMPMLLICISYAFILASIL RISSVEGRSKAFSTCTAHLMVVVLFYGTALSMHLKPSAVDSQEIDKFMALVYAGQTPMLN PIIYSLRNKEPPLVIPRQTGSGVDLQQTPADLQQRGLTVGRKTNKQKGIAHPLRDPIRRS PTSKTKEKGNLHRKLRLKAAVKTTGDVQQGYAVLGHVRRIHRKRNKKDCNRFTEEFKLTG M >gi568815589f:104404263_104605219|GENSCAN_predicted_CDS_3|906_bp atgggctccactgagtgtgtgctcctgcccatgatggcatatgaccggtatgtggccatc tgcaaccccctgagataccctgtcatcatgaataggagaacctgtgtgcagattgcagct ggctcctggatgacaggctgtctcactgccatggtggaaatgatgtctgtgctgccactg tctctctgtggtaatagcatcatcaatcatttcacttgtgaaattctggccatcttgaaa ttggtttgtgtggacacctccctggtgcagttaatcatgctggtgatcagtgtacttctt ctccccatgccaatgctactcatttgtatctcttatgcatttatcctcgccagtatcctg agaatcagctcagtggaaggtcgaagtaaagccttttcaacgtgcacagcccacctgatg gtggtagttttgttctatgggacggctctctccatgcacctgaagccctccgctgtagat tcacaggaaatagacaaatttatggctttggtgtatgccggacaaacccccatgttgaat cctatcatctatagtctacggaacaaagagcctccgctggtgatacccaggcaaacaggg tctggggtggaccttcagcaaactccagcagacctgcagcagaggggcttgactgttgga aggaaaactaacaaacagaaaggaatagcacatccactcagagaccccatccgaaggtca ccaacatcaaagaccaaagaaaaaggaaatttacataggaaattaaggctaaaagcagct gtaaaaacgactggtgatgtgcagcaaggatatgctgtgcttggccatgtcagaagaatc cacagaaaaagaaacaaaaaggattgtaataggtttactgaagagtttaaactcactgga atgtaa >gi568815589f:104404263_104605219|GENSCAN_predicted_peptide_4|201_aa MAFDRYVAICNPLRYPIIMNKVVYVLLTSVSWLSGGINSTVQTSLAMRWPFCGNNIINHF LCEILAVLKLACSDISVNIVTLAVSNIAFLVLPLLVIFFSYMFILYTILRTNSATGRHKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLLGKDNLQATEGLVSMFYGVVTPMLNPIIYS LRNKDVKAAIKYLLSRKAINQ >gi568815589f:104404263_104605219|GENSCAN_predicted_CDS_4|606_bp atggcatttgatcgttatgtggccatctgtaaccctctgagataccccatcatcatgaac aaggtggtgtatgtactgctgacttctgtatcatggctttctggtggaatcaattcaact gtgcaaacatcacttgccatgcgatggcctttctgtgggaacaatattattaatcatttc ttatgcgagatcttagctgtcctaaaattagcttgttctgatatatctgtcaatattgtt accctagcagtgtcaaatattgctttcctagttcttcctctgctcgtgatttttttctcc tatatgttcatcctctacaccatcttgcgaacgaactcggccacaggaagacacaaggca ttttctacatgctcagctcacctgactgtggtgatcatattttatggtaccatcttcttt atgtatgcaaaacctaagtcccaggacctccttgggaaagacaacttgcaagctacagag gggcttgtttccatgttttatggggttgtgacccccatgttaaaccccataatctatagc ttgagaaataaagatgtaaaagctgctataaaatatttgctgagcaggaaagctattaac cagtaa >gi568815589f:104404263_104605219|GENSCAN_predicted_peptide_5|200_aa MAFDRYVAICNPLRYPIILSKVAYVLMASVSWLSGGINSAVQTLLAMRLPFCGNNIINHF ACEILAVLKLACADISLNIITMVISNMAFLVLPLMVIFFSYMFILYTILQMNSATGRRKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLIGEEKLQALDKLISLFYGVVTPMLNPILYS LRNKDVKAAVKYLLNKKPIH >gi568815589f:104404263_104605219|GENSCAN_predicted_CDS_5|603_bp atggcatttgatcgttatgtggccatctgcaacccactgagataccccatcatcctgagc aaggtggcgtatgtattgatggcttctgtgtcctggctgtccggtggaataaattcagct gtgcaaacattacttgccatgagactgcctttctgtgggaataatattatcaatcatttc gcatgtgaaatattagctgtcctcaagctggcctgtgctgatatatccctcaatattatc accatggtgatatcaaatatggccttcctggttcttccactgatggtcatttttttctcc tatatgttcatcctctacaccatcttgcaaatgaattcagccacaggaagacgcaaggca ttttccacgtgctcagctcacctgactgtggtgatcatattttacggtaccatcttcttt atgtatgcgaaaccgaagtctcaagacctgattggggaagaaaaattgcaagcattagac aagctcatttctctgttttatggggtagtgacacccatgctgaatcctatactctatagc ttgagaaataaggatgtaaaagctgctgtaaaatatttgctgaacaaaaaaccaattcac taa >gi568815589f:104404263_104605219|GENSCAN_predicted_peptide_6|203_aa MALDRYVAICYPLRYPVIMSKGAYVAMAAGSWVTGLVDSVVQTAFAMQLPFCANNVIKHF VCEILAILKLACADISINVISMTGSNLIVLVIPLLVISISYIFIVATILRIPSTEGKHKA FSTCSAHLTVVIIFYGTIFFMYAKPESKASVDSGNEDIIEALISLFYGVMTPMLNPLIYS LRNKDVKAAVKNILCRKNFSDGK >gi568815589f:104404263_104605219|GENSCAN_predicted_CDS_6|612_bp atggcactggaccgctatgtggccatctgctacccactgagataccctgtcatcatgagc aagggtgcctatgtggccatggcagctgggtcctgggtcactgggcttgtggactcagta gtgcagacagcttttgcaatgcagttaccattctgtgctaataatgtcattaaacatttt gtctgtgaaattctggctatcttgaaactggcctgtgctgatatttcaatcaatgtgatt agtatgacagggtcgaatctgattgttctggttattccattgttagtaatttccatctct tacatatttattgttgccactattctgaggattccttccactgaaggaaaacataaggcc ttctccacctgctcagcccacctgacagtggtgattatattctatggaaccatcttcttc atgtacgcaaagcctgagtctaaagcctctgttgattcaggtaatgaagacatcattgag gccctcatctcccttttctatggagtgatgactcccatgcttaatcctctcatctatagt ctgcgaaacaaggatgtaaaggctgctgtcaaaaacatactgtgtaggaaaaacttttct gatggaaaatga >gi568815589f:104404263_104605219|GENSCAN_predicted_peptide_7|104_aa MTVVIVFYGTILFMYMKAKSKDSAFDKLIALFYGIVTPMLNPIIYSLRNTEYDVLEPVQN KEDVNVEERLSSGCQSPSSMKGAYTQEEGVDPSTWLRTTHAKKS >gi568815589f:104404263_104605219|GENSCAN_predicted_CDS_7|315_bp atgacagtggtgattgtgttttatgggacaatcctcttcatgtacatgaaggcaaagtcc aaagactctgcttttgacaaactgattgccctgttctatggcatagtcacccccatgctc aatcctatcatctatagcctgaggaatacagagtatgatgttttggagcctgtccagaat aaggaagatgtcaacgtggaagagaggctcagttcagggtgtcaaagtccaagcagcatg aaaggagcatacacacaagaggaaggtgtggatcctagcacctggttgaggaccactcat gcaaagaaaagctga >gi568815589f:104404263_104605219|GENSCAN_predicted_peptide_8|444_aa MAAGSWIIGAVNSAVQSVFVVQLPFCRNNIINHFTCEILAVMKLACADISDNEFIMLVAT TLFILTPLLLIIVSYTLIIVSIFKISSSEGRSKASSTCSAHLTVVIIFYGTILFMYMKPK SKETLNSDDLDATDKIISMFYGVMTPMMNPLIYSLRNKDVKEAVKHLLNRRFFSNILDPH LHTPMYFFLGNLSFLDICYTTTSIPSTLVSFLSERKTISLSGCAVQMFLSLAMGTTECVL LGVMAFDRYVAICNPLRYPIIMSKDAYVPMAAGSWIIGAVNSAVQTVFVVQLPFCRNNII NHFTCEILAVMKLACADISGNEFILLVTTTLFLLTPLLLIIVSYTLIILSIFKISSSEGR SKPSSTCSARLTVVITFCGTIFLMYMKPKSQETLNSDDLDATDKLIFIFYRVMTPMMNPL IYSLRNKDVKEAVKHLLRRKNFNK >gi568815589f:104404263_104605219|GENSCAN_predicted_CDS_8|1335_bp atggcagctgggtcctggatcataggagctgtcaattctgcagtacaatcagtgtttgtg gtacaattgcctttctgcaggaataacatcatcaatcatttcacctgtgaaattctggct gtcatgaaactggcctgtgctgacatctcagacaatgagttcatcatgcttgtggccaca acattgttcatattgacacctttgttattaatcattgtctcttacacgttaatcattgtg agcatcttcaaaattagctcttccgaggggagaagcaaagcttcctctacctgttcagcc catctgactgtggtcataatattctatgggaccatcctcttcatgtacatgaagcccaag tctaaagagacacttaattcggatgacttggatgctaccgacaaaattatatccatgttc tatggggtgatgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtg aaagaggcagtaaaacacctactgaacagaaggttctttagcaacatcttggaccctcac cttcacacccctatgtacttctttctggggaacctctccttcttggacatctgctacacc accacctctattccctccacgctagtgagcttcctttcagaaagaaagaccatttccctt tctggctgtgcagtgcagatgttcctcagcttggccatggggacaacagagtgtgtgctt ctgggcgtgatggcctttgaccgctatgtggctatctgcaaccctctgagatatcccatc atcatgagtaaggatgcctatgtacccatggcagctgggtcctggatcataggagctgtc aattctgcagtacaaacagtgtttgtggtacaattgcctttctgcaggaataacatcatc aatcatttcacctgtgaaattctagctgtcatgaaactggcctgtgctgacatctcaggc aatgagttcatcctgcttgtgaccacaacattgttcctattgacacctttgttattaatt attgtctcttacacgttaatcattttgagcatcttcaaaattagctcttcggaggggaga agcaaaccttcctctacctgctcagctcgtctgactgtggtgataacattctgtgggacc atcttcctcatgtacatgaagcccaagtctcaagagacacttaattcagatgacttggat gccactgacaaacttatattcatattctacagggtgatgactcccatgatgaatccttta atctacagtcttagaaacaaggatgtgaaggaggcagtaaaacacctactgagaagaaaa aattttaacaagtaa