GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:04:45 Sequence gi568815589f:104469168_104670127 : 200960 bp : 36.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 732 727 6 1.05 1.04 Term - 2642 2568 75 2 0 79 44 76 0.372 -0.84 1.03 Intr - 4486 4382 105 1 0 73 75 90 0.359 5.69 1.02 Intr - 9334 9202 133 0 1 46 119 14 0.330 0.03 1.01 Init - 26783 26590 194 2 2 67 60 168 0.360 10.39 1.00 Prom - 32360 32321 40 -3.75 2.00 Prom + 33309 33348 40 -8.95 2.01 Init + 35414 35983 570 1 0 73 52 333 0.010 23.33 2.02 Intr + 51840 52008 169 2 1 29 71 147 0.031 5.70 2.03 Term + 53687 53853 167 0 2 21 38 125 0.082 -2.10 2.04 PlyA + 54555 54560 6 1.05 3.02 PlyA - 54710 54705 6 1.05 3.01 Sngl - 57691 57086 606 1 0 87 44 205 0.427 11.94 3.00 Prom - 58567 58528 40 -4.95 4.02 PlyA - 60508 60503 6 1.05 4.01 Sngl - 67205 66603 603 2 0 87 39 314 0.384 22.34 4.00 Prom - 72447 72408 40 -3.45 5.00 Prom + 86530 86569 40 -3.75 5.01 Sngl + 100352 100963 612 1 0 99 42 420 0.802 34.44 5.02 PlyA + 101435 101440 6 1.05 6.00 Prom + 116183 116222 40 -3.75 6.01 Init + 121580 121732 153 1 0 72 42 95 0.619 3.33 6.02 Term + 123202 123363 162 0 0 14 48 214 0.941 6.95 6.03 PlyA + 125437 125442 6 1.05 7.03 PlyA - 125553 125548 6 1.05 7.02 Term - 130100 129290 811 1 1 17 36 355 0.004 14.86 7.01 Init - 136361 135508 854 2 2 38 89 510 0.027 40.26 7.00 Prom - 138538 138499 40 -6.95 8.02 PlyA - 138686 138681 6 1.05 8.01 Sngl - 148719 148081 639 0 0 83 42 318 0.553 22.63 8.00 Prom - 153762 153723 40 -7.65 9.00 Prom + 154058 154097 40 -3.65 9.01 Init + 155323 155533 211 0 1 68 57 144 0.551 8.29 9.02 Term + 155993 156534 542 0 2 32 38 251 0.520 7.93 9.03 PlyA + 157268 157273 6 1.05 10.02 PlyA - 157423 157418 6 1.05 10.01 Sngl - 165276 165061 216 0 0 68 53 253 0.942 14.72 10.00 Prom - 180458 180419 40 -4.05 11.03 PlyA - 181443 181438 6 1.05 11.02 Term - 187791 187614 178 2 1 65 48 167 0.270 6.58 11.01 Init - 192034 191925 110 1 2 23 84 85 0.302 1.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 165977 165724 254 0 2 7 48 242 0.851 6.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_1|168_aa MSKKNVQGTKSKRWGHKLHQKAVCSVKQEHGAKPRRLEVMGHKIQPTKDSNRQTKAYIGE KSNHWIWRENKSGLDSFGTQKLFPVPWGMEVARTVGWGSLDFDRHQNDLAALNVAKGYRY ELVFAKILDFEIHRVPFEDLWTVEGLQSSSETSVGHPVKLIAIGSQAT >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_1|507_bp atgtctaagaaaaatgttcaaggaacaaagtcaaagaggtggggccacaaacttcatcaa aaggcggtttgcagtgtgaagcaggaacacggagcaaagccaaggcgtctggaagtcatg ggacacaagatccaaccaacaaaagattcaaatcgtcagacgaaggcctatattggtgaa aaaagtaaccattggatttggagagaaaataaaagtgggctggactcttttggaacccag aaacttttccctgtcccctggggcatggaggtggcaagaactgtggggtggggttcattg gattttgataggcatcaaaatgacctggctgcacttaacgttgcaaagggctacaggtat gagctggtatttgctaagatcttggattttgaaattcatagagttccatttgaggatcta tggactgtggagggcctacaaagcagctcagaaaccagtgttggtcacccagtcaaattg attgctattggcagccaagctacatga >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_2|301_aa MGSTECVLLPMMAYDRYVAICNPLRYPVIMNRRTCVQIAAGSWMTGCLTAMVEMMSVLPL SLCGNSIINHFTCEILAILKLVCVDTSLVQLIMLVISVLLLPMPMLLICISYAFILASIL RISSVEGRSKAFSTCTAHLMVVVLFYGTALSMHLKPSAVDSQEIDKFMALVYAGQTPMLN PIIYSLRNKEPPLVIPRQTGSGVDLQQTPADLQQRGLTVGRKTNKQKGIAHPLRDPIRRS PTSKTKEKGNLHRKLRLKAAVKTTGDVQQGYAVLGHVRRIHRKRNKKDCNRFTEEFKLTG M >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_2|906_bp atgggctccactgagtgtgtgctcctgcccatgatggcatatgaccggtatgtggccatc tgcaaccccctgagataccctgtcatcatgaataggagaacctgtgtgcagattgcagct ggctcctggatgacaggctgtctcactgccatggtggaaatgatgtctgtgctgccactg tctctctgtggtaatagcatcatcaatcatttcacttgtgaaattctggccatcttgaaa ttggtttgtgtggacacctccctggtgcagttaatcatgctggtgatcagtgtacttctt ctccccatgccaatgctactcatttgtatctcttatgcatttatcctcgccagtatcctg agaatcagctcagtggaaggtcgaagtaaagccttttcaacgtgcacagcccacctgatg gtggtagttttgttctatgggacggctctctccatgcacctgaagccctccgctgtagat tcacaggaaatagacaaatttatggctttggtgtatgccggacaaacccccatgttgaat cctatcatctatagtctacggaacaaagagcctccgctggtgatacccaggcaaacaggg tctggggtggaccttcagcaaactccagcagacctgcagcagaggggcttgactgttgga aggaaaactaacaaacagaaaggaatagcacatccactcagagaccccatccgaaggtca ccaacatcaaagaccaaagaaaaaggaaatttacataggaaattaaggctaaaagcagct gtaaaaacgactggtgatgtgcagcaaggatatgctgtgcttggccatgtcagaagaatc cacagaaaaagaaacaaaaaggattgtaataggtttactgaagagtttaaactcactgga atgtaa >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_3|201_aa MAFDRYVAICNPLRYPIIMNKVVYVLLTSVSWLSGGINSTVQTSLAMRWPFCGNNIINHF LCEILAVLKLACSDISVNIVTLAVSNIAFLVLPLLVIFFSYMFILYTILRTNSATGRHKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLLGKDNLQATEGLVSMFYGVVTPMLNPIIYS LRNKDVKAAIKYLLSRKAINQ >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_3|606_bp atggcatttgatcgttatgtggccatctgtaaccctctgagataccccatcatcatgaac aaggtggtgtatgtactgctgacttctgtatcatggctttctggtggaatcaattcaact gtgcaaacatcacttgccatgcgatggcctttctgtgggaacaatattattaatcatttc ttatgcgagatcttagctgtcctaaaattagcttgttctgatatatctgtcaatattgtt accctagcagtgtcaaatattgctttcctagttcttcctctgctcgtgatttttttctcc tatatgttcatcctctacaccatcttgcgaacgaactcggccacaggaagacacaaggca ttttctacatgctcagctcacctgactgtggtgatcatattttatggtaccatcttcttt atgtatgcaaaacctaagtcccaggacctccttgggaaagacaacttgcaagctacagag gggcttgtttccatgttttatggggttgtgacccccatgttaaaccccataatctatagc ttgagaaataaagatgtaaaagctgctataaaatatttgctgagcaggaaagctattaac cagtaa >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_4|200_aa MAFDRYVAICNPLRYPIILSKVAYVLMASVSWLSGGINSAVQTLLAMRLPFCGNNIINHF ACEILAVLKLACADISLNIITMVISNMAFLVLPLMVIFFSYMFILYTILQMNSATGRRKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLIGEEKLQALDKLISLFYGVVTPMLNPILYS LRNKDVKAAVKYLLNKKPIH >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_4|603_bp atggcatttgatcgttatgtggccatctgcaacccactgagataccccatcatcctgagc aaggtggcgtatgtattgatggcttctgtgtcctggctgtccggtggaataaattcagct gtgcaaacattacttgccatgagactgcctttctgtgggaataatattatcaatcatttc gcatgtgaaatattagctgtcctcaagctggcctgtgctgatatatccctcaatattatc accatggtgatatcaaatatggccttcctggttcttccactgatggtcatttttttctcc tatatgttcatcctctacaccatcttgcaaatgaattcagccacaggaagacgcaaggca ttttccacgtgctcagctcacctgactgtggtgatcatattttacggtaccatcttcttt atgtatgcgaaaccgaagtctcaagacctgattggggaagaaaaattgcaagcattagac aagctcatttctctgttttatggggtagtgacacccatgctgaatcctatactctatagc ttgagaaataaggatgtaaaagctgctgtaaaatatttgctgaacaaaaaaccaattcac taa >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_5|203_aa MALDRYVAICYPLRYPVIMSKGAYVAMAAGSWVTGLVDSVVQTAFAMQLPFCANNVIKHF VCEILAILKLACADISINVISMTGSNLIVLVIPLLVISISYIFIVATILRIPSTEGKHKA FSTCSAHLTVVIIFYGTIFFMYAKPESKASVDSGNEDIIEALISLFYGVMTPMLNPLIYS LRNKDVKAAVKNILCRKNFSDGK >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_5|612_bp atggcactggaccgctatgtggccatctgctacccactgagataccctgtcatcatgagc aagggtgcctatgtggccatggcagctgggtcctgggtcactgggcttgtggactcagta gtgcagacagcttttgcaatgcagttaccattctgtgctaataatgtcattaaacatttt gtctgtgaaattctggctatcttgaaactggcctgtgctgatatttcaatcaatgtgatt agtatgacagggtcgaatctgattgttctggttattccattgttagtaatttccatctct tacatatttattgttgccactattctgaggattccttccactgaaggaaaacataaggcc ttctccacctgctcagcccacctgacagtggtgattatattctatggaaccatcttcttc atgtacgcaaagcctgagtctaaagcctctgttgattcaggtaatgaagacatcattgag gccctcatctcccttttctatggagtgatgactcccatgcttaatcctctcatctatagt ctgcgaaacaaggatgtaaaggctgctgtcaaaaacatactgtgtaggaaaaacttttct gatggaaaatga >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_6|104_aa MTVVIVFYGTILFMYMKAKSKDSAFDKLIALFYGIVTPMLNPIIYSLRNTEYDVLEPVQN KEDVNVEERLSSGCQSPSSMKGAYTQEEGVDPSTWLRTTHAKKS >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_6|315_bp atgacagtggtgattgtgttttatgggacaatcctcttcatgtacatgaaggcaaagtcc aaagactctgcttttgacaaactgattgccctgttctatggcatagtcacccccatgctc aatcctatcatctatagcctgaggaatacagagtatgatgttttggagcctgtccagaat aaggaagatgtcaacgtggaagagaggctcagttcagggtgtcaaagtccaagcagcatg aaaggagcatacacacaagaggaaggtgtggatcctagcacctggttgaggaccactcat gcaaagaaaagctga >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_7|554_aa MYVVILLGNGTLILISILDPHLHTPMYFFLGNLSFLDICYTTTSIPSTLVSFLSERKTIS LSGCAVQMFLGLAMGTTECVLLGMMAFDRYVAICNPLRYPIIMSKDAYVPMAAGSWIIGA VNSAVQSVFVVQLPFCRNNIINHFTCEILAVMKLACADISDNEFIMLVATTLFILTPLLL IIVSYTLIIVSIFKISSSEGRSKASSTCSAHLTVVIIFYGTILFMYMKPKSKETLNSDDL DATDKIISMFYGVMTPMMNPLIYSLRNKDVKEAVKHLLNRRFFSNILDPHLHTPMYFFLG NLSFLDICYTTTSIPSTLVSFLSERKTISLSGCAVQMFLSLAMGTTECVLLGVMAFDRYV AICNPLRYPIIMSKDAYVPMAAGSWIIGAVNSAVQTVFVVQLPFCRNNIINHFTCEILAV MKLACADISGNEFILLVTTTLFLLTPLLLIIVSYTLIILSIFKISSSEGRSKPSSTCSAR LTVVITFCGTIFLMYMKPKSQETLNSDDLDATDKLIFIFYRVMTPMMNPLIYSLRNKDVK EAVKHLLRRKNFNK >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_7|1665_bp atgtatgtggtcatccttctggggaatggtactctcattttaatcagcatcttggaccct caccttcacacccctatgtacttctttctggggaacctctccttcttggacatctgctac accaccacctctattccctccacgctagtgagcttcctttcagaaagaaagaccatttcc ctttctggctgtgcagtgcagatgttcctcggcttggccatggggacaacagagtgtgtg cttctgggcatgatggcctttgaccgctatgtggctatctgcaaccctctgagatatccc atcatcatgagtaaggatgcctatgtacccatggcagctgggtcctggatcataggagct gtcaattctgcagtacaatcagtgtttgtggtacaattgcctttctgcaggaataacatc atcaatcatttcacctgtgaaattctggctgtcatgaaactggcctgtgctgacatctca gacaatgagttcatcatgcttgtggccacaacattgttcatattgacacctttgttatta atcattgtctcttacacgttaatcattgtgagcatcttcaaaattagctcttccgagggg agaagcaaagcttcctctacctgttcagcccatctgactgtggtcataatattctatggg accatcctcttcatgtacatgaagcccaagtctaaagagacacttaattcggatgacttg gatgctaccgacaaaattatatccatgttctatggggtgatgactcccatgatgaatcct ttaatctacagtcttagaaacaaggatgtgaaagaggcagtaaaacacctactgaacaga aggttctttagcaacatcttggaccctcaccttcacacccctatgtacttctttctgggg aacctctccttcttggacatctgctacaccaccacctctattccctccacgctagtgagc ttcctttcagaaagaaagaccatttccctttctggctgtgcagtgcagatgttcctcagc ttggccatggggacaacagagtgtgtgcttctgggcgtgatggcctttgaccgctatgtg gctatctgcaaccctctgagatatcccatcatcatgagtaaggatgcctatgtacccatg gcagctgggtcctggatcataggagctgtcaattctgcagtacaaacagtgtttgtggta caattgcctttctgcaggaataacatcatcaatcatttcacctgtgaaattctagctgtc atgaaactggcctgtgctgacatctcaggcaatgagttcatcctgcttgtgaccacaaca ttgttcctattgacacctttgttattaattattgtctcttacacgttaatcattttgagc atcttcaaaattagctcttcggaggggagaagcaaaccttcctctacctgctcagctcgt ctgactgtggtgataacattctgtgggaccatcttcctcatgtacatgaagcccaagtct caagagacacttaattcagatgacttggatgccactgacaaacttatattcatattctac agggtgatgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtgaag gaggcagtaaaacacctactgagaagaaaaaattttaacaagtaa >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_8|212_aa MGTTECVLLGMMAFDRYVAICNPLRYPIIMSKNAYVPMAVGSWFAGIVNSAVQTTFVVQL PFCRKNVINHFSCEILAVMKLACADISGNEFLMLVATILFTLMPLLLIVISYSLIISSIL KIHSSEGRSKAFSTCSAHLTVVIIFYGTILFMYMKPKSKETLNSDDLDATDKIISMFYGV MTPMMNPLIYSLRNKDVKEAVKHLPNRRFFSK >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_8|639_bp atggggacaacagagtgtgtgcttctgggcatgatggcctttgaccgctatgtggctatc tgcaaccctctgagatatcccatcatcatgagcaagaatgcctatgtacccatggctgtt gggtcctggtttgcagggattgtcaactctgcagtacaaactacatttgtagtacaattg cctttctgcaggaagaatgtcatcaatcatttctcatgtgaaattctagctgtcatgaag ttggcctgtgctgacatctcaggcaatgagttcctcatgcttgtggccacaatattgttc acattgatgccactgctcttgatagttatctcttactcattaatcatttccagcatcctc aagattcactcctctgaggggagaagcaaagctttctctacctgctcagcccatctgact gtggtcataatattctatgggaccatcctcttcatgtatatgaagcccaagtctaaagag acacttaattcagatgacttggatgctaccgacaaaattatatccatgttctatggggtg atgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtgaaagaggca gtaaaacacctaccgaacagaaggttctttagcaagtga >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_9|250_aa MWKRLWNWVTGRRWKSLEGSEEDRKMWESLELPTDLLNGFTQNVDSNMDNKVQAEVVSDG DEKLVGSWSKAAPDVAKRGQCRARAMASEGVSLKPQKLPHGVEPASAPKSRIGVWKPLPR LQKMYGNTWMSRQKFAAGMGCSWRTSDRAVQEGKVGLEPPHRVPTGALPSGAVRRGPPPS RLQNGRSTNSLHHLPGKATDTQYQPVKVAEKEAVLCKATGSELPKTMGMHFLHQCDLDMR PESKEIILEL >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_9|753_bp atgtggaagcgactttggaactgggtaacaggcagaagatggaagagtttggagggctca gaagaagacaggaaaatgtgggaaagtttagaacttcctacagacttgctgaatggcttt acccaaaatgttgatagcaatatggataataaggtccaggctgaggtggtctcagatgga gatgagaaacttgttgggagctggagcaaagctgctccagatgtggctaaaaggggccaa tgcagagctcgggccatggcttcagagggggtaagcctcaagcctcagaagcttccacat ggtgttgagcctgcaagtgcaccgaagtcaagaattggggtttggaaacctctacctaga cttcagaagatgtatggaaacacctggatgtccaggcagaagtttgctgcagggatgggg tgctcatggagaacttctgatagggcagtgcaggagggaaaagtggggttggaaccccca cacagagtccctactggggcactgcctagtggagctgtgagaagagggccaccaccctcc agactccagaatggtagatctaccaacagcttgcaccatttgcctggaaaagccacagac actcaataccagcctgtgaaagtagctgagaaggaggctgtactctgcaaagccacaggg tcagagctgcccaagaccatgggaatgcacttcttgcatcagtgtgacctggatatgaga ccagagtcaaaggagatcattttggagctttaa >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_10|71_aa MPVFTELQAAEPERAVTFPGGSDLGTPRAKAVTSLGGLRLLASPSFRVPPPRLDASAQHG SCCGTLSPAAG >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_10|216_bp atgcccgtgttcactgagctgcaggcggcagaaccagagagagctgtaacatttcctggg ggctcagacctcgggactcccagagcaaaagctgtaacatcccttgggggtctgcggttg ctggcatctccaagttttcgggtgccaccacctcgtctagacgccagtgcccaacacgga agctgctgtggcacgctgagtccagccgcaggctga >gi568815589f:104469168_104670127|GENSCAN_predicted_peptide_11|95_aa MSSNAQRSGSHTLSNACKWLELTVMLVGAIRLGLNAITILFMYAKPKAKDSSGADKEQVT DKIISLFYGVVTPMLNPLIYSLRNKDVKAAVKSIL >gi568815589f:104469168_104670127|GENSCAN_predicted_CDS_11|288_bp atgagtagtaatgcccagagatctgggtctcatactctctccaatgcctgcaagtggttg gaactgacagtgatgctggttggggcaatcaggcttggactgaatgccataaccatcctt ttcatgtatgcaaagcccaaggctaaagactcttctggtgcagacaaagaacaagtcaca gacaaaatcatctccctgttctatggagtggtgacacctatgcttaatcctcttatctat agtttgaggaacaaagacgtgaaggcagctgtgaagagtatactgtga