GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:48:47 Sequence gi568815597f:229186485_229402974 : 216490 bp : 43.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 73 68 6 1.05 1.03 Term - 2128 2064 65 2 2 125 43 121 0.920 9.35 1.02 Intr - 11320 11214 107 0 2 101 67 46 0.501 3.66 1.01 Init - 13010 12961 50 2 2 65 94 43 0.547 1.67 1.00 Prom - 13389 13350 40 -5.56 2.02 PlyA - 17488 17483 6 1.05 2.01 Sngl - 18274 17927 348 1 0 51 53 184 0.609 7.25 2.00 Prom - 23352 23313 40 -0.36 3.06 PlyA - 30710 30705 6 1.05 3.05 Term - 33245 33181 65 0 2 94 42 47 0.313 -1.25 3.04 Intr - 35020 34967 54 2 0 118 84 8 0.493 2.35 3.03 Intr - 39226 39131 96 2 0 63 93 27 0.409 0.68 3.02 Intr - 43797 43666 132 1 0 73 65 41 0.521 0.92 3.01 Init - 48002 47942 61 2 1 76 63 79 0.944 5.61 3.00 Prom - 48725 48686 40 -5.26 4.04 PlyA - 48949 48944 6 1.05 4.03 Term - 58050 57898 153 0 0 52 47 94 0.007 -0.38 4.02 Intr - 81949 81759 191 2 2 53 91 64 0.406 2.50 4.01 Init - 84872 84476 397 2 1 93 75 188 0.444 12.87 4.00 Prom - 87384 87345 40 -2.36 5.00 Prom + 95342 95381 40 -6.06 5.01 Init + 108924 109033 110 2 2 69 81 100 0.741 7.09 5.02 Intr + 110998 111107 110 1 2 50 -21 151 0.010 0.33 5.03 Intr + 112493 112588 96 0 0 95 98 42 0.015 5.88 5.04 Intr + 124932 125047 116 1 2 25 84 113 0.003 4.77 5.05 Intr + 139962 140024 63 2 0 104 28 77 0.028 2.21 5.06 Intr + 140412 140449 38 2 2 106 68 11 0.052 -2.04 5.07 Term + 143720 144011 292 2 1 75 43 210 0.113 10.02 5.08 PlyA + 144411 144416 6 1.05 6.06 PlyA - 145308 145303 6 1.05 6.05 Term - 155007 154856 152 1 2 102 42 95 0.898 4.37 6.04 Intr - 156029 155615 415 2 1 70 80 691 0.869 60.18 6.03 Intr - 156314 156179 136 1 1 86 76 -17 0.511 -2.53 6.02 Intr - 156561 156428 134 0 2 75 94 65 0.912 5.24 6.01 Init - 158893 158795 99 1 0 63 64 61 0.559 1.46 6.00 Prom - 167962 167923 40 -5.56 7.00 Prom + 179881 179920 40 -2.86 7.01 Init + 181737 181794 58 2 1 74 88 27 0.598 2.85 7.02 Intr + 190201 190240 40 2 1 104 101 4 0.292 0.58 7.03 Intr + 191013 191224 212 0 2 101 42 59 0.150 1.16 7.04 Term + 192440 192558 119 0 2 73 44 124 0.213 5.20 7.05 PlyA + 192606 192611 6 -3.94 8.03 PlyA - 192662 192657 6 1.05 8.02 Term - 194552 194437 116 0 2 73 51 152 0.942 8.73 8.01 Intr - 211565 211407 159 0 0 132 57 -3 0.065 0.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 101902 101999 98 2 2 75 39 116 0.864 3.53 S.002 Term + 110998 111160 163 1 1 50 43 193 0.984 8.41 S.003 Term - 138927 138751 177 0 0 71 43 126 0.809 4.09 S.004 Init - 141807 141735 73 0 1 67 49 37 0.868 -1.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:229186485_229402974|GENSCAN_predicted_peptide_1|73_aa MVLAGPVLAWHYCYVLRKPVKHNACCGSHLLQLTPPDPSFQLTPRSVKEAAPATQCEDDE DEDFDDDLLPLNQ >gi568815597f:229186485_229402974|GENSCAN_predicted_CDS_1|222_bp atggtcctggctggaccagtccttgcttggcactactgctatgtcctcaggaaacctgtc aagcacaatgcctgctgtgggagccaccttctccaactgacaccgcctgaccccagcttc cagctcactccccgctctgtgaaggaagcagctccagctactcaatgtgaagacgatgag gatgaagactttgatgatgatctgcttccacttaatcaatag >gi568815597f:229186485_229402974|GENSCAN_predicted_peptide_2|115_aa MSPVPQAADSVKFWLAHCLDGKQCVPHTTFVRSVTPTLVGPHCPGETYDVHMGNSESVFL DKWSSALPEASTRKPPRHSLLQALSAVLLWTGGTLSTPEEHLSGTDTTLKPVAVY >gi568815597f:229186485_229402974|GENSCAN_predicted_CDS_2|348_bp atgtctccagttccccaggccgctgactcagtgaagttctggctggcacattgcttggat ggaaaacagtgtgttccacatacaacctttgtaagaagtgtgacccccacgctggtgggg ccccactgtcctggagaaacttatgacgttcacatgggcaattctgagtctgtgttcttg gataagtggtcctccgctctgcccgaggccagcaccaggaagccacctcgccacagtctc ctgcaggcactctctgctgtcctgttatggactggaggaaccctcagcacccctgaagag catttgtctgggacagacaccaccctgaagccagttgctgtttattga >gi568815597f:229186485_229402974|GENSCAN_predicted_peptide_3|135_aa MGDENFRVLENALNNNCQRKACRSGQLGFRWSFGLHPLTAELQNPCSTLGLGFWLLEGFG FDSGGAPPRTQIPQQEGTRALAHTPSIPDRLQHALPGECGVEEGGHEQEHGCRGVPDNHH STLNLYEFDYSRYLM >gi568815597f:229186485_229402974|GENSCAN_predicted_CDS_3|408_bp atgggggatgaaaacttcagggtgctggaaaatgctcttaacaacaactgccagagaaaa gcttgcagatctgggcagttgggttttcgctggagctttgggcttcacccactgactgca gagctgcagaatccttgcagcacattggggcttgggttctggcttttggagggatttggc tttgacagtggtggagctcctccccgcacccagatcccacagcaggagggcactagagcc ttggcccacaccccatccatccccgacaggctgcagcatgccctgcctggtgagtgtggg gtggaggagggtgggcatgagcaggaacatggctgtcgtggggtccctgacaaccaccat tctactttgaatctctatgaatttgactactctaggtacctcatgtaa >gi568815597f:229186485_229402974|GENSCAN_predicted_peptide_4|246_aa MAVCDILGAAPPLAGSPAALARGPPARLGGEGPGAGDRRREGPDRSPRQPPVSQRLRPSR TPAPRRRRALHPPSGRDREEEEEMGYARPGPPRVRACARGGRGGAREDFGARRKHVRGLG ALAVCAEVGRAAGNIQSEWARLTAVMQISQPIYDLRSKVPRSDYLSTPTSRDLLMGPPRP RLTECVLSKCISTITQCCEPLKGTEIVHSRSSDFKAVACQCSQLNKALPSTTWCLTGFVC GSSCYN >gi568815597f:229186485_229402974|GENSCAN_predicted_CDS_4|741_bp atggccgtctgcgacatcttgggagcggcgccgcctctcgccgggtcaccggctgcactc gcccgcggtccgccggcgcgtctcggtggggaaggacctggggcgggggaccggaggagg gaggggcccgaccggtccccacggcaaccgccggtctcccagcggctgcggcccagccgg actccagctccgcgcaggcgcagggccctccaccctccgtccggccgcgaccgcgaggag gaagaggagatgggctacgcgaggcccggcccaccccgcgttcgcgcgtgcgcacgggga ggccggggcggggcgcgtgaggacttcggcgcgcgccggaagcacgtgcgcgggctgggc gctctggcggtgtgcgctgaggtgggcagagcggcagggaatattcagtcagaatgggct agattaactgcagtaatgcagatctctcaacccatttatgacttacgcagcaaagttcct cgctcagactacctgtccactcccacgagcagagacttgctcatgggccctcctagaccc aggctgaccgagtgtgttttatctaaatgcatctccacaatcactcagtgttgtgagccc ttaaaagggacagaaattgtgcactcgagaagctcggattttaaggcagtagcttgccaa tgctcccagctgaataaagcccttccttctacaacttggtgtctgacaggttttgtctgc ggctcgtcctgctacaattga >gi568815597f:229186485_229402974|GENSCAN_predicted_peptide_5|274_aa MDSCGDYGFHRAERWRELEEMEKIVPGSGVDEELAENRETYNALTNWLTDARMLASQNIV IILCGNKKDLDADQLMFLETSALTGENVEEAFVQCARKILNKIESGLFHGWVTLHGKGIL ELGFKETFSYVVIVESTDDKRGLSELVNPNSIYIFAWLGVTKLFWQIRHLVAVEAQYWCS EWMVMLSSDIGPLKDEERVLVTLDARHSWSRQPDLQLELRREDGMEMKATAPGVDQVTQK KHRAGRQEVQTGPGTPSFNAQAKKGEPEMGAEEG >gi568815597f:229186485_229402974|GENSCAN_predicted_CDS_5|825_bp atggattcctgtggtgactatggattccaccgcgccgagaggtggcgagaactggaagag atggagaaaattgtgccagggagtggggtagatgaggagctcgcagagaaccgagaaacc tacaatgcgcttactaattggttaacagatgcccgaatgctagcgagccagaacattgtg atcatcctttgtggaaacaagaaggacctggatgcagatcagctgatgtttttggaaaca agtgcgctcacaggggagaatgtagaagaggcttttgtacagtgtgcaagaaaaatactt aacaaaatcgaatcaggtttgttccatggctgggtgacactacacggcaaaggaatactt gagctgggcttcaaggaaacatttagctatgtggtcattgtagaaagcacagatgacaaa cgggggctgagtgaacttgtgaaccccaactccatctacatctttgcctggcttggtgtc actaaactcttctggcaaataaggcacttggttgctgtcgaggctcagtactggtgttct gagtggatggtcatgctgtcctctgacataggccccctaaaggatgaggaaagggtgctt gtcacgttggatgcaagacactcctggtcccggcagccagaccttcaactggagctcagg agagaagatggcatggagatgaaagccacggctcctggtgtggaccaagtcacccagaag aaacacagagccggaagacaggaggtgcaaacaggcccggggacaccgtcatttaatgca caggcaaagaaaggagaacccgagatgggagcagaggaagggtag >gi568815597f:229186485_229402974|GENSCAN_predicted_peptide_6|311_aa MQQEMYMTLRCQSRLLEDVTKTAQPTAHPTASYLERGPPRRHLAQEESARESELRFSCCA TRASGLGAEDRPGAAGRSPSLLRSPRPSPQPPCLPPTEQPSSAARPSTPAAPVPASALGP GRRAAGSEERLALEAADGTMSPGSGVKSEYMKRYQEPRWEEYGPCYRELLHYRLGRRLLE QAHAPWLWDDWGPAGSSEDSASSESSGAGGPAPRCAPPSPPPPVEPATQEEAERRARGAP EEQDAEAGDAEAEDAEDAALPDPKAPVALRVFSRHGPFTFFPAQRLAGLGTQLNASVQPG VTNGSFQITGF >gi568815597f:229186485_229402974|GENSCAN_predicted_CDS_6|936_bp atgcagcaggagatgtatatgaccttgagatgccagtccaggcttctggaggatgtgacc aagacggcccagcccacagcccaccccactgccagttatctggagcgcgggccgccgcgg cgacatcttgctcaggaggaaagcgcgcgggagagcgagctgcgctttagctgctgcgcc acgcgcgcctcgggcctgggcgcagaggatcggccgggcgcggcgggaaggagccccagt ctcctgcggtcccctcgccccagcccgcagcctccctgccttccgcccactgagcagccc tcctcggcggcgcgcccctcgaccccagcagccccggtccccgcctctgctcttggtccc ggccgccgggctgcgggcagcgaggagcggctggcgctcgaggcggcggacggcaccatg tccccggggagcggggtgaagagcgagtacatgaagcgctaccaggagccgcgctgggag gagtacgggccgtgctaccgcgagctgctgcactaccgcctaggccgccggctgctggag caggcgcacgcgccctggctctgggacgactggggcccggccggctcctcggaggactcg gcgtcgtcagagtcgtcgggcgccgggggccccgcaccccggtgcgccccgccctcgccc ccgccgcccgtagagccggcgacccaggaggaggcggaacggcgggcgcgcggggccccg gaggagcaggacgcggaggccggggacgcggaggccgaggacgcggaggacgcggctctg ccagatccgaaagcgccggtggcgctgcgcgtgttttcaaggcatggccccttcaccttc tttcctgcccagaggcttgcagggctggggacccagttgaacgcctcagtacagcctgga gtaaccaatggttccttccagatcactggattttaa >gi568815597f:229186485_229402974|GENSCAN_predicted_peptide_7|142_aa MVPGSASGEGFRLLLFVVRGQKDKEEPPSRPLREKGSLSLTHGGALSGHLCTGPLQGQMR QPEQNPLPAWPTKEPTTLHLAQPQPKAAPTRCAFRDLPGPLECESGITVIFPCHFKTYEY MPPVATQQYIIEHKQYLTDPSC >gi568815597f:229186485_229402974|GENSCAN_predicted_CDS_7|429_bp atggtgccaggatctgcttctggggagggcttcaggctgctgctattcgtggtgagaggg cagaaagataaagaggagcctccctccaggcctctgcgggagaagggaagtctctccctc acccatgggggagcactgagtggccacctctgcacaggccctctgcaggggcagatgagg cagcccgaacagaaccctctgcccgcctggccaaccaaagagcccacgacactccatctg gcccaaccgcagccgaaggcggcaccaactcgctgcgccttcagggacttgccagggcca cttgagtgtgaatctggcatcacagtcattttcccatgtcacttcaaaacctatgagtat atgcctccagtggccacacagcagtacatcatagagcacaagcagtacctcacagaccca tcatgctag >gi568815597f:229186485_229402974|GENSCAN_predicted_peptide_8|91_aa XVQFGEERSRISVLSPIADVISRELHFPQWRLHTSQLNPPRRLGCGILMKNAIAFGKEKS LMQDAGGKAAAPAELAGRTLDANPGCSIDVL >gi568815597f:229186485_229402974|GENSCAN_predicted_CDS_8|276_bp naagtgcagtttggagaggaaagaagccgtatctcagtcctttcaccaatagctgatgta atctcaagagagttgcatttcccacagtggagacttcacacatcacagctaaatcctcca agacgactcggctgtggaatcctaatgaagaatgcaatagcattcggcaaagagaagtca ctcatgcaggatgcgggcggcaaggcggcggcccctgcagagctagcaggcaggaccctg gatgctaacccaggatgcagcattgatgtcctctga