GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:41:21 Sequence gi568815597r:20014117_20219498 : 205382 bp : 46.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 386 381 6 1.05 1.02 Term - 10778 10602 177 2 0 37 49 133 0.656 1.99 1.01 Init - 13144 13085 60 1 0 74 82 30 0.507 2.25 1.00 Prom - 15404 15365 40 0.34 2.00 Prom + 39279 39318 40 -2.56 2.01 Init + 59907 59949 43 2 1 72 69 58 0.357 2.88 2.02 Term + 62645 62760 116 0 2 86 33 108 0.424 3.83 2.03 PlyA + 62803 62808 6 1.05 3.00 Prom + 68854 68893 40 -3.96 3.01 Init + 71951 72111 161 1 2 96 85 248 0.816 24.60 3.02 Intr + 75673 75779 107 1 2 54 91 101 0.999 6.86 3.03 Term + 76452 76576 125 1 2 117 47 124 0.999 9.75 3.04 PlyA + 77023 77028 6 1.05 4.08 PlyA - 77545 77540 6 1.05 4.07 Term - 79174 78997 178 0 1 23 36 280 0.981 13.46 4.06 Intr - 93587 93421 167 2 2 81 103 84 0.708 7.96 4.05 Intr - 99494 99359 136 1 1 78 22 85 0.373 1.47 4.04 Intr - 100143 100005 139 1 1 94 -17 202 0.554 9.82 4.03 Intr - 101497 101391 107 0 2 94 68 127 0.995 11.16 4.02 Intr - 102361 102217 145 2 1 77 53 150 0.988 9.74 4.01 Init - 105382 105343 40 1 1 82 105 84 0.766 7.98 4.00 Prom - 108521 108482 40 -3.66 5.00 Prom + 109104 109143 40 -4.76 5.01 Init + 112001 112014 14 1 2 72 115 8 0.145 1.52 5.02 Intr + 126050 126102 53 2 2 114 101 89 0.816 11.35 5.03 Intr + 129330 129474 145 1 1 82 82 231 0.495 21.24 5.04 Intr + 130464 130573 110 0 2 120 73 193 0.999 20.93 5.05 Intr + 134074 134240 167 2 2 94 26 274 0.979 21.48 5.06 Intr + 135677 135861 185 1 2 98 11 82 0.032 0.09 5.07 Intr + 141456 141738 283 0 1 88 49 162 0.067 9.72 5.08 Intr + 142839 143520 682 2 1 44 77 346 0.546 20.26 5.09 Intr + 143991 144260 270 1 0 39 116 122 0.276 7.61 5.10 Term + 145514 145758 245 0 2 107 42 76 0.245 0.96 5.11 PlyA + 146430 146435 6 1.05 6.04 PlyA - 148962 148957 6 1.05 6.03 Term - 150041 149875 167 0 2 110 50 148 0.857 11.28 6.02 Intr - 158781 158678 104 2 2 5 59 177 0.694 6.32 6.01 Init - 161078 160891 188 2 2 74 94 159 0.483 11.79 6.00 Prom - 161747 161708 40 -7.46 7.00 Prom + 161965 162004 40 -10.55 7.01 Init + 162638 162716 79 1 1 70 50 76 0.422 3.22 7.02 Intr + 164836 164894 59 2 2 54 86 103 0.351 5.30 7.03 Intr + 167411 167602 192 1 0 15 121 65 0.408 2.19 7.04 Intr + 167870 167962 93 1 0 69 113 64 0.983 7.16 7.05 Intr + 169068 169131 64 2 1 52 94 94 0.920 4.69 7.06 Term + 171916 172037 122 2 2 76 43 63 0.804 -0.76 7.07 PlyA + 176270 176275 6 1.05 8.00 Prom + 176289 176328 40 -8.56 8.01 Sngl + 176446 177288 843 0 0 84 47 616 0.946 52.86 8.02 PlyA + 179111 179116 6 1.05 9.03 PlyA - 180979 180974 6 1.05 9.02 Term - 189748 189617 132 1 0 85 47 63 0.582 -0.01 9.01 Init - 189915 189862 54 0 0 125 94 23 0.656 6.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 136958 137062 105 1 0 111 52 62 0.917 3.31 S.002 Init + 184867 184965 99 0 0 80 95 52 0.809 5.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_1|78_aa MNNIDRPLARFTKKRREKIQMQKTRLTKSRIIKIKTFRKISIEGTYLKIIKAIYDKPTVN IILNKEKLKAFPLRTGTR >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_1|237_bp atgaacaatattgatagaccattagcaagattcaccaagaaaagaagagagaagatccaa atgcagaaaactcgtttgacaaaatcaagaatcataaagattaaaaccttcagaaaaatc agcatagaagggacatacctcaagataataaaagccatctatgacaagcccacagtcaac attatactgaacaaggaaaagttgaaagcatttcccctgagaactggaacaagataa >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_2|52_aa MTVSAMKGKNRVTLGNARVDSISDEFYNIPETGKLPVERRIMEDFPEEEDLM >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_2|159_bp atgacggtcagtgccatgaaggggaaaaacagggtgaccttgggaaatgccagagtggac agcatatcagatgagttctataacattccagagacaggtaagctccctgtggaaagaagg atcatggaagacttcccggaggaggaggacttgatgtag >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_3|130_aa MGYLPGVPAVQGGLLDLKSMIEKVTGKNALTNYGFYGCYCGWGGRGTPKDGTDWCCWAHD HCYGRLEEKGCNIRTQSYKYRFAWGVVTCEPGPFCHVNLCACDRKLVYCLKRNLRSYNPQ YQYFPNILCS >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_3|393_bp atgggctatcttccaggtgtgcctgctgtgcaaggaggcttgctggacctaaaatcaatg atcgagaaggtgacagggaagaacgccctgacaaactacggcttctacggctgttactgc ggctggggcggccgaggaacccccaaggatggcaccgattggtgctgttgggcgcatgac cactgctatgggcggctggaggagaagggctgcaacattcgcacacagtcctacaaatac agattcgcgtggggcgtggtcacctgcgagcccgggcccttctgccatgtgaacctctgt gcctgtgaccggaagctcgtctactgcctcaagagaaacctacggagctacaacccacag taccaatactttcccaacatcctctgctcctag >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_4|303_aa MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK RNLDTYQKRLRFYWRPHCRGQTPGQALKSLQVEAMPVLNSVASLVPKATPEPRRSSINPS WVKPRQSRQEAIEVGSKCILLVGCLCWSPFQDPGLGWGERKPKLSLFPPQGFSPEMMQQL SLALRLGDKSKIPSEEEEGEGEGRGRRGGGREEEEEEEEEEEEEEEEEEEEEYPYTSYLI SPP >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_4|912_bp atggaacttgcactgctgtgtgggctggtggtgatggctggtgtgattccaatccagggc gggatcctgaacctgaacaagatggtcaagcaagtgactgggaaaatgcccatcctctcc tactggccctacggctgtcactgcggactaggtggcagaggccaacccaaagatgccacg gactggtgctgccagacccatgactgctgctatgaccacctgaagacccaggggtgcagc atctacaaggactattacagatacaacttttcccaggggaacatccactgctctgacaag ggaagctggtgtgagcagcagctgtgtgcctgtgacaaggaggtggccttctgcctgaag cgcaacctggacacctaccagaagcgactgcgtttctactggcggccccactgccggggg cagacccctgggcaagctctgaagtccctgcaggtggaggccatgcctgtcttaaactca gttgcatccctggtgcccaaagcaacaccagaaccaagaaggagctccataaatccttct tgggtgaagcctagacaaagccgccaggaggccattgaagtggggtcaaagtgcattctg ctcgtgggctgcctctgctggagccccttccaggatccagggctaggctggggtgagagg aagccaaagctgtccctgttccctcctcaaggattcagccctgagatgatgcagcagctg tcactggctctgcgcctgggtgacaagagcaaaattccatctgaagaagaagaaggagaa ggagaaggaagaggaagaagaggaggaggaagggaggaagaagaggaggaggaagaggag gaagaagaggaggaggaagaagaagaagaagaagaatatccatacacctcctacctcatc agcccaccttag >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_5|717_aa MTGTWSSLGMKKFFTVAILAGSVLSTAHGSLLNLKAMVEAVTGRSAILSFVGYGCYCGLG GRGQPKDEVDWCCHAHDCCYQELFDQGCHPYVDHYDHTIENNTEIVCSDLNKTECDKQTC MCDKNMVLCLMNQTYREEYRGFLNVYCQGPTPNCSIYEPPPEEVHLTFLLTCPSLNSGSA ASTEPAPPARKPLKPPSARTQQPPASSPGSAPGAPLSLGRESREGKNNTPSYTGKPQAII DLLQTIIQTHNPTWADCHQVLMYLFNTDERRRVLQAATKWLEEHVPADYQNPQEYVRIQL PGTDPQWDPDERQYAKAKPPGTKDYRLIQDLHLVNQATVTLHPTVPNPYTLLWLLPAEDS WFTYLGLKDAFFSIRLAPESQKLFAFQWEDPGSGVTTQYTWTQLPQGFKNSTTIFGEALA RDLQKFPTRDLGCVLLQYVDDLLLGHLMAVGCAKGMDTLLWHLEDCGYKVSKKKAQICQQ EVRYLGFTIQQGERSLGSERMQVICNLPEPKTKRQVREFLGAVGFCRLWIPNFAVLAKPL YEVIKGVSESPVERNYVEVLDTVYSSRPDLQDHPWTSVDWELYMDRSSFVNPQGERCAGY AVVTLDAVTEAKSLPQDTSAQKVKLIALIWALELSEGDRVWITDWNIASLQPRWRGPQTI VLTTPTAIKVEGIPAWIHHSHVKPKASETWEVRPSLDNPCKMTLKKMTSPAPVTPGS >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_5|2154_bp atgactggcacatggtctagcctgggtatgaagaagttcttcaccgtggccatccttgct ggcagcgttctgtccacagctcacggcagcctgctcaacctgaaggccatggtggaggcc gtcacagggaggagcgccatcctgtccttcgtgggctacggttgctactgtgggctgggg ggccgtggccagcccaaggatgaggtggactggtgctgccacgcccacgactgctgctac caggaactctttgaccaaggctgtcacccctatgtggaccactatgatcacaccatcgag aacaacactgagatagtctgcagtgacctcaacaagacagagtgtgacaagcagacatgc atgtgtgacaagaacatggttctgtgcctcatgaaccagacgtaccgagaggagtaccgt ggcttcctcaatgtctactgccagggccccacgcccaactgcagcatctatgaaccgccc cctgaggaggtgcaccttaccttcctccttacctgcccgagcctcaactcagggtcagca gcctccacggagccagccccacctgcccgaaaaccactgaagccaccctccgcccggact caacagccgccagcaagctcgcctggcagtgcccctggagcaccattgtcgctggggagg gagagccgggaagggaagaacaataccccatcctataccggaaagcctcaagctataatt gatttgctccaaactataatccagacccacaaccccacctgggctgattgccaccaggtg ctcatgtacctctttaacacagatgaaaggaggagagtgctccaagcagcaactaagtgg ctggaagaacatgttcctgctgattaccaaaacccccaagagtatgtgaggatccaatta ccaggaacagacccccagtgggacccagatgaaagacagtatgcaaaggctaaaccgcca gggaccaaggactacaggctgatacaggacttgcacttggtcaatcaagctacggtgact ttacatccaacagtacctaacccatacacattgttgtggttgctgccagctgaggacagc tggttcacctacctgggcctgaaggacgctttctttagcatcagactagcccctgagagc cagaaactgtttgcctttcagtgggaggatccagggtcaggtgtcaccactcagtacact tggacccagcttccccaagggttcaagaactccaccaccatcttcggggaggccctggct cgagacctgcaaaagtttcccaccagagacctaggctgcgtgttgctccagtatgtcgat gacctcctgctgggacacctcatggcagttgggtgtgccaagggaatggataccctgctc tggcacctggaggactgtgggtataaggtatccaagaagaaagctcagatatgtcaacag gaggtacgttacttgggatttactatccaacagggggagcgcagcctgggatcagaaaga atgcaggtcatttgcaacctaccggagcctaagaccaaaaggcaggtgagagaattctta ggagctgtggggttctgcaggttatggatcccaaactttgcagtactggccaaacctctg tacgaagtcataaagggagtgtcagagagcccagtcgaacgtaactatgtagaggtgttg gacacagtttattctagcaggcctgacctccaagaccatccttggacatcagtagactgg gagctgtacatggacaggagcagcttcgtcaacccacaaggagagaggtgtgcaggatat gcagtggtaaccctggatgctgtcactgaagccaaatcattgccccaggatacttcagcc caaaaggtcaaactcattgctttaatttgggccttagagctaagtgaaggtgatcgggta tggatcacggattggaacatagcctccttgcagccacggtggagaggaccccagaccatc gtcttgaccactcccacagccataaaggtagaaggaatcccagcctggattcaccacagc catgtaaaacctaaagcatctgagacctgggaggtgagaccaagcttggacaacccatgc aagatgactttgaagaagatgacaagccctgctccagtcacacctggaagctga >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_6|152_aa MLIATSFFLFFSSVVAAPTHSSFWQFQRRVKHITGRSAFFSYYGYGCYCGLGDKGIPVDD TDRHSPSSPSPYEKLKEFSCQPVLNSYQFHIVNGAVVCGCTLGPGASCHCRLKACECDKQ SVHCFKESLPTYEKNFKQFSSQPRCGRHKPWC >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_6|459_bp atgctcattgcaacttccttcttcctttttttctcatcggtggtggcagcccccacccac agcagtttctggcagtttcagaggagggtcaaacacatcacggggcgaagtgccttcttc tcatattacggatatggctgctactgtgggcttggggataaagggatccccgtggatgac actgacaggcacagcccctcatctccctctccctacgagaagctgaaggagttcagctgc cagcctgtgttgaacagctaccagttccacatcgtcaatggcgcagtggtttgtggatgc acccttggtcctggtgccagctgccactgcaggctgaaggcctgtgagtgtgacaagcaa tccgtgcactgcttcaaagagagcctgcccacctatgagaaaaacttcaagcagttctcc agccagcccaggtgtggcagacataagccctggtgctag >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_7|202_aa MDLDKPPLNESSKRVYQSPVDPKLFPRLGYQLLLTLSLTENHSLLQLSCFRAYPKHHREG KTQNSPSSEAGITTLLDSCFRCSSNKLLSFISQAQQPDRNGVRAASLNRKRGGDNDSGPP QNASKIPQAPEGENTWSTVAKPHNNPVLIITILEMRKLTSSEGRFRSRRKHVAPHPAALP GRPGLFTSRTEPGRRRRMERNG >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_7|609_bp atggatctggacaaacctccactcaacgaatccagcaaaagagtctatcagtctccagta gaccccaaactgtttcccaggctggggtaccagctgctgctgacgctgtccctcactgag aaccactctctgctgcagctctcctgcttccgtgcctatccaaaacaccaccgagaaggc aaaacccaaaacagcccaagcagcgaagcaggtatcacaacactcttagactcttgtttt agatgcagttctaacaagctgctcagcttcatttctcaggcccagcagccggacaggaat ggagtcagagcagcttctcttaataggaagagaggtggagacaatgattctggcccccca cagaatgccagcaagattccacaagctcctgaaggtgaaaatacttggtcaacagtagcc aagccgcacaacaatcctgtgcttatcatcaccattctagagatgcggaaactgacatcc agcgaaggccggttccggtcacgccggaagcacgtggccccccaccctgcggcgttgccc gggagaccagggttgtttaccagcaggacagagcccgggcgcaggcggcggatggagcgg aacggctag >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_8|280_aa MATEAPVNIAPPECSTVVSTAVDSLIWQPNSLNMHMIRPKSAKGRTRPSLQKSQGVEVCA HHIPSPPPAIPYELPSSQKPGACAPKSPNQGASDEIPELQQQVPTGASSSLNKYPVLPSI NRKNLEEEAVETVAKKASSLQLSSIRALYQDETGTMKTSEEDSRARACAVERKFIVRTKK QGSSRAGNLEEPSDQEPRLLLAVRSPTGQRFVRHFRPTDDLQTIVAVAEQKNKTSYRHCS IETMEVPRRRFSDLTKSLQECRIPHKSVLGISLEDGEGWP >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_8|843_bp atggccacagaagcccctgtgaatatagcaccacctgagtgtagcactgttgtcagcaca gcagttgacagcctcatttggcagccaaactcactaaatatgcacatgataaggcccaag tccgccaagggacggacaagaccgagtctgcagaaatcccagggcgtggaggtgtgcgct catcatataccatctccgcctccagccattccctatgagttgccaagcagccaaaaacca ggagcctgtgcacccaaatctccaaaccagggagcttctgatgagatccctgagctgcag cagcaagtacccactggggcttcctcttctctcaataagtatccagtccttccttccatc aacagaaagaacctggaggaggaggctgtggaaaccgttgccaaaaaggccagctcactg caactgagcagtatccgggctctttaccaagacgagacgggcaccatgaagacaagtgaa gaagattccagagctcgagcttgtgccgtggagaggaaattcatcgtccgaaccaagaaa cagggctcttccagggctggaaatctggaggaaccatcggaccaagaaccaaggttgctg cttgctgttagatcaccaacaggccaaaggtttgtacgccatttccggccaacagatgat ttgcaaaccattgttgctgtggccgaacagaaaaacaaaacctcctaccgacactgcagc attgaaacaatggaggtgcccaggaggcgattttctgacctcaccaaatctctgcaagag tgcagaatcccccacaagtctgtgctgggcatctcactggaagatggggaagggtggccc tga >gi568815597r:20014117_20219498|GENSCAN_predicted_peptide_9|61_aa MAKLLFLSESHFLLCRMELEIPPRFHRRNGTPGGHIGSSLGTFTVWTVQLRHRKNEALGK T >gi568815597r:20014117_20219498|GENSCAN_predicted_CDS_9|186_bp atggccaagttgctcttcctctctgagtcccatttcctcctctgtagaatggagctagaa attccccccaggttccaccgaagaaacggcaccccaggtggccacataggaagttctctg ggcactttcactgtctggacagtgcagcttcggcacagaaagaatgaggccttgggcaaa acatag