GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:52:54 Sequence gi568815582r:57156916_57384540 : 227625 bp : 45.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 7150 7086 65 0 2 79 98 42 0.525 2.66 1.05 Intr - 10299 10178 122 0 2 -10 88 138 0.524 3.29 1.04 Intr - 15457 15309 149 2 2 82 67 147 0.988 11.95 1.03 Intr - 15959 15861 99 0 0 147 63 141 0.999 17.58 1.02 Intr - 16954 16813 142 1 1 104 69 147 0.812 14.33 1.01 Init - 17549 17526 24 2 0 45 90 2 0.220 -4.32 1.00 Prom - 18418 18379 40 -3.66 2.00 Prom + 24251 24290 40 -6.66 2.01 Init + 28708 28869 162 0 0 56 43 137 0.717 3.79 2.02 Term + 29099 29254 156 1 0 84 55 126 0.961 6.83 2.03 PlyA + 30718 30723 6 1.05 3.00 Prom + 39158 39197 40 -5.36 3.01 Init + 47861 48093 233 1 2 81 97 127 0.580 10.63 3.02 Intr + 51143 51195 53 2 2 98 68 -9 0.481 -3.35 3.03 Intr + 52160 52272 113 0 2 79 106 -4 0.610 0.60 3.04 Intr + 56057 56183 127 1 1 95 93 11 0.590 2.55 3.05 Intr + 59989 60120 132 2 0 97 84 45 0.965 5.62 3.06 Intr + 63817 63932 116 2 2 74 92 122 0.837 11.37 3.07 Intr + 64357 64500 144 0 0 76 84 168 0.999 15.68 3.08 Intr + 70427 70505 79 1 1 123 74 197 0.940 20.92 3.09 Intr + 73796 73898 103 0 1 106 75 5 0.733 0.23 3.10 Intr + 74252 74404 153 2 0 78 116 40 0.707 4.99 3.11 Intr + 78209 78313 105 2 0 65 110 23 0.222 1.53 3.12 Intr + 89165 89226 62 2 2 31 94 77 0.036 0.98 3.13 Intr + 91622 91728 107 0 2 36 102 144 0.986 10.53 3.14 Intr + 92852 92937 86 1 2 80 79 111 0.998 8.02 3.15 Intr + 93496 93592 97 1 1 78 92 113 0.739 10.71 3.16 Term + 95251 95352 102 0 0 74 42 26 0.324 -5.12 3.17 PlyA + 97371 97376 6 1.05 4.09 PlyA - 97705 97700 6 1.05 4.08 Term - 100114 99998 117 1 0 103 40 173 0.992 12.54 4.07 Intr - 101669 101547 123 2 0 96 109 164 0.999 20.08 4.06 Intr - 105155 104982 174 2 0 90 89 217 0.986 22.14 4.05 Intr - 107610 107458 153 0 0 66 73 45 0.300 1.07 4.04 Intr - 112220 112092 129 2 0 57 94 30 0.097 1.39 4.03 Intr - 122658 122634 25 2 1 65 88 20 0.011 -2.27 4.02 Intr - 127340 127261 80 2 2 86 49 85 0.235 2.75 4.01 Init - 127625 127491 135 2 0 122 85 274 0.996 30.64 4.00 Prom - 128285 128246 40 -3.96 5.00 Prom + 158013 158052 40 -3.16 5.01 Sngl + 164382 164648 267 2 0 30 47 199 0.581 5.23 5.02 PlyA + 165347 165352 6 1.05 6.03 PlyA - 166403 166398 6 1.05 6.02 Term - 186460 186291 170 2 2 79 41 57 0.322 -1.86 6.01 Init - 191535 191457 79 0 1 83 90 54 0.806 6.32 6.00 Prom - 195250 195211 40 -4.06 7.00 Prom + 195260 195299 40 -4.06 7.01 Init + 201902 201974 73 1 1 68 94 141 0.999 11.94 7.02 Term + 203522 203649 128 0 2 73 48 193 0.432 12.24 7.03 PlyA + 204942 204947 6 1.05 8.00 Prom + 208947 208986 40 -7.46 8.01 Init + 215654 215723 70 1 1 94 119 130 0.993 15.92 8.02 Intr + 222719 222839 121 0 1 75 53 74 0.728 2.15 8.03 Term + 225115 226117 1003 1 1 99 43 634 0.855 51.57 8.04 PlyA + 226421 226426 6 -1.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 89154 89226 73 2 1 51 94 83 0.904 6.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:57156916_57384540|GENSCAN_predicted_peptide_1|201_aa MRFTMEHQIGCFIMDGGDDGNLIIKKRFVSEAELDERRKRRQEEWEKVRKPEDPEECPEE VYDPRSLYERLQEQKDRKQQEYEEQFKFKNMVRGLDEDETNFLDEVSRQQELIEKQRREE ELKELKEYRISFVLGGKPKVGISQENKKEVEKKLTVKPIETKNKFSQAKLLAGAVKHKSS ESGNSVKRLKPDPEPDDKNQX >gi568815582r:57156916_57384540|GENSCAN_predicted_CDS_1|603_bp atgagatttactatggaacaccagattggttgtttcattatggatggaggggatgatggt aaccttattatcaaaaagaggtttgtgtctgaggcagaactagatgaacggcgcaaaagg aggcaagaagaatgggagaaagttcgaaaacctgaagatccagaagaatgtccagaggag gtttatgaccctcgatctctatatgaaaggctacaggaacagaaggacaggaagcagcag gagtacgaggaacagttcaaattcaaaaacatggtaagaggcttagatgaagatgagacc aacttccttgatgaggtttctcgacagcaggaactaatagaaaagcaacgaagagaagaa gaactgaaagaactgaaggaatacagaatatcctttgtacttggaggaaaacctaaggtt ggaatttctcaagagaacaagaaggaagtggaaaagaaactgactgtgaagcctatagaa accaagaacaagttctcccaggcgaagctgttggcaggagctgtgaagcataagagctca gagagtggcaacagtgtgaaaagactgaaaccggaccctgagccagatgacaagaatcaa gnn >gi568815582r:57156916_57384540|GENSCAN_predicted_peptide_2|105_aa MPSRGHPQLPSGWARKPQASATPDKERVRGLRTGLALTLTAAAAHLFRKEAGTQGFLFLT ATIESRPTSGDVTQSAELFLPGSPMEIRYPENEPERLDWAIHASG >gi568815582r:57156916_57384540|GENSCAN_predicted_CDS_2|318_bp atgccttcccgcgggcaccctcagctcccctcaggctgggcccgcaagccccaggcttcg gctactccggacaaggagcgggtgcgcggactgagaacaggcctggccctaaccctaaca gcagccgcagctcatctcttccgtaaggaagctggaacccagggcttcctgttcctcacc gccacaatagagtcccgccccacttccggcgacgtaacccaatccgcggagctcttcctc cccgggagcccgatggaaatccggtaccctgaaaacgagccggagagacttgattgggcc attcacgcctcaggatga >gi568815582r:57156916_57384540|GENSCAN_predicted_peptide_3|603_aa MGNSCICRDDSGTDDSVDTQQQQAENSAVPTADTRSQPRDPVRPPRRGRGPHEPRRKKQN VDGLVLDTLAVIRTLVDNDQEPPYSMITLHEMAETDEGWLDVVQSLIRVIPLEDPLGPAV ITLLLDECPLPTKDALQKLTEILNLNGEVACQDSSHPAKHRNTSAVLGCLAEKLAGENKL TISESSISDRLVTLESWANDPDYLKRQVGFCAQWSLDNLFLKEGRQLTYEKVNLSSIRAM LNSNDVSEYLKISPHGLEARCDASSFESVRCTFCVDAGVWYYEVTVVTSGVMQIGWATRD SKFLNHEGYGIGDDEYSCAYDGCRQLIWYNARRDTVGFLLDLNEKQMIFFLNGNQLPPEK QVFSSTVSGFFAAASFMSYQQCEFNFGAKPFKYPPSMKFSTFNDYAFLTAEEKIILPRHR RLALLKQVSIRENCCSLCCDEVADTQLKPCGHSSSASDAEFDAVVGYLEDIIMDDEFQLL QRNFMDKYYLEFEDTEENKLIYTPIFNEYISLVEKYIEEQLLQRIPEFNMAAFTTTLQHH KDEVAGDIFDMLLTFTDFLAFKEMFLDYRAEKEGRGLDLSSGLVVTSLCKSSSLPASQNN LRH >gi568815582r:57156916_57384540|GENSCAN_predicted_CDS_3|1812_bp atgggtaattcctgtatctgccgagatgacagtggaacagatgacagtgttgacacccaa cagcaacaggccgagaacagtgcagtacccactgctgacacaaggagccaaccacgggac cctgttcggccaccaaggaggggccgaggacctcatgagccaaggagaaagaaacaaaat gtggatgggctagtgttggacacactggcagtaatacggactcttgtagataatgatcag gaacctccctattcaatgataacattacacgaaatggcagaaacagatgaaggatggttg gatgttgtccagtctttaattagagttattccactggaagatccactgggaccagctgtt ataacattgttactagatgaatgtccattgcccactaaagatgcactccagaaattgact gaaattctcaatttaaatggagaagtagcttgccaggactcaagccatcctgccaaacac aggaacacatctgcagtcctaggctgcttggccgagaaactagcaggtgaaaataaattg actatttctgaatccagtattagtgaccggcttgtcacattggagtcctgggctaatgat cctgattatctgaaacgtcaagttggtttctgtgcccagtggagcttagacaatctcttt ttaaaagaaggtagacagctgacctatgagaaagtgaacttgagtagcattagggccatg ctgaatagcaatgatgtcagcgagtacctgaagatctcacctcatggcttagaggctcgc tgtgatgcctcctcttttgaaagtgtgcgttgcaccttttgtgtggatgccggggtatgg tactatgaagtaacagtggtcacttctggcgtcatgcagattggctgggccactcgagac agcaaattcctcaatcatgaaggctacggcattggggatgatgaatactcctgtgcgtat gatggctgccggcagctgatttggtacaatgccagaagagatacagtaggatttctgtta gacttgaatgaaaagcaaatgatcttctttttaaatggcaaccagctgcctcctgaaaag caagtcttttcatctactgtatctggattttttgctgcagctagtttcatgtcatatcaa caatgtgagttcaattttggagcaaaaccattcaaatacccaccatctatgaaatttagc acttttaatgactacgccttcctaacagctgaagaaaaaatcattttgccaaggcacagg cgtcttgctctgttgaagcaagtcagtatccgagaaaactgctgttccctttgttgtgat gaggtagcagacacacaattgaagccatgtggacacagctcctccgcctctgatgcagaa tttgatgctgtggttggatatttagaggacattatcatggatgacgagttccagttatta cagagaaatttcatggacaagtactacctggagtttgaagacacagaagagaataaactc atctacacacctatttttaatgaatacatttctttggtagaaaaatacattgaagaacag ctgctgcagcggattcctgagttcaacatggcagccttcaccacaacattacagcaccat aaggatgaagtggctggtgacatattcgacatgctgctcaccttcacagattttctggct tttaaagaaatgtttttggactacagagcagaaaaagaaggccgaggactggacttaagc agtggcttagtggtgacttcattgtgcaaatcatcttctctgccagcttcccagaacaat ctgcggcactag >gi568815582r:57156916_57384540|GENSCAN_predicted_peptide_4|311_aa MAEFPSKVSTRTSSPAQGAEASVSALRPDLGFVRSRLGALMLLQLGASRTDAPKIALHPS HHDTTPWPSLRAPPEYLGRQGVKGSAQGPSAYEELRECSSKGDLRAGLDFWGLLWEHHRL ICQGRVLVSVMVEEIGLLVPKHIVYLAYIMCRALLGTEDTAVNKTQNLPRRAGKVLGLLV WALIADTPYHLYPAYGWVMFVAVFLWLVTIVLFNLYLFQLHMKLYMVPWPLVLMIFNISA TVLYITAFIACSAAVDLTSLRGTRPYNQRAAASFFACLVMIAYGVSAFFSYQAWRGVGSN AATSQMAGGYA >gi568815582r:57156916_57384540|GENSCAN_predicted_CDS_4|936_bp atggccgagttcccgtcgaaagttagcacgcggaccagcagtcctgcgcagggcgccgaa gcctcggtgtcggcgctgcgcccggacctgggcttcgtgcgctcccgcctcggggcgctc atgctgctgcagctgggcgcttcccgcacggacgcccccaagatcgccctgcatccctct caccacgacactaccccgtggccgtcgctgcgagcgcctcctgagtatctgggacgacag ggggtgaaggggtctgcccaaggcccctcagcctatgaggagctgagggagtgcagctcc aagggagatctgagggctgggctggatttctggggcctcctgtgggagcaccaccgcctt atctgtcagggcagggttctggtctctgtcatggttgaagaaataggattgttagttcct aagcacattgtgtatttagcgtatattatgtgtcgggcactgttgggcactgaagataca gcagtgaacaaaacccagaatcttcctcggagagcagggaaggtgctggggctgctggtg tgggcgctgattgcggacaccccgtaccacctgtatccggcctatggctgggtgatgttc gtcgctgtcttcctctggctggtgacaatcgtcctcttcaacctctacctgtttcagctg cacatgaagttgtacatggttccctggccactggtgttaatgatctttaacatcagcgcc accgttctctacatcaccgccttcatcgcctgctctgcggcagttgacctgacatccctg aggggcacccggccttataaccagcgcgcggctgcctcgttctttgcgtgtttggtgatg atcgcctatggagtgagtgccttcttcagctaccaggcctggcgaggagtaggcagcaat gcggccaccagtcagatggctggcggctatgcctaa >gi568815582r:57156916_57384540|GENSCAN_predicted_peptide_5|88_aa MLYDRINDEKSRNRPFSQDGTKGKEGSSCPPKAEAKSKALKDKKAVLKGVHSHIKKKIHT SPIFPWPRRCDSRGSPDILRRAPQEKQA >gi568815582r:57156916_57384540|GENSCAN_predicted_CDS_5|267_bp atgctatatgatagaataaatgatgaaaaatctcgaaatagacccttttcacaagatggc accaaaggcaaagaaggaagctcctgcccccctaaagctgaagccaaatcgaaggctttg aaggacaagaaggcagtgctgaaaggtgtccacagccacataaaaaagaagatccacaca tcacccatcttcccgtggccaagacgctgcgactccagaggcagcccagatatcctcaga agagcaccccaggagaaacaagcttga >gi568815582r:57156916_57384540|GENSCAN_predicted_peptide_6|82_aa MQFWVLGVPNKELEETHMESKAKQQQETRSYQKTTPTCNCRICPLHFPCSSLKDKLVQPK ACGSLEAQDSFECGPTQMRKLS >gi568815582r:57156916_57384540|GENSCAN_predicted_CDS_6|249_bp atgcagttctgggttcttggtgtcccgaacaaagaactggaggagacacacatggaaagc aaagcaaagcagcaacaggaaacgagaagttaccagaagaccacacccacctgcaactgt cgcatctgcccactgcattttccttgttcctcactcaaggataagcttgtccaacccaag gcctgtgggtcgcttgaggcccaggacagctttgaatgtggcccaacacaaatgcgtaaa ctttcttaa >gi568815582r:57156916_57384540|GENSCAN_predicted_peptide_7|66_aa MDRLQTALLVVLVLLAVALQATEAGPYGANMEDSVCCRDYVRYRLPLRVVKHFYWTSDSC PRPGVV >gi568815582r:57156916_57384540|GENSCAN_predicted_CDS_7|201_bp atggatcgcctacagactgcactcctggttgtcctcgtcctccttgctgtggcgcttcaa gcaactgaggcaggcccctacggcgccaacatggaagacagcgtctgctgccgtgattac gtccgttaccgtctgcccctgcgcgtggtgaaacacttctactggacctcagactcctgc ccgaggcctggcgtggtgtga >gi568815582r:57156916_57384540|GENSCAN_predicted_peptide_8|397_aa MAPISLSWLLRLATFCHLTVLLAGQHHGVTKCNITCSKMTSKIPVALLIHYQQNQASCGK RAIILETRQHRLFCADPKEQWVKDAMQHLDRQAAALTRNGGTFEKQIGEVKPRTTPAAGG MDESVVLEPEATGESSSLEPTPSSQEAQRALGTSPELPTGVTGSSGTRLPPTPKAQDGGP VGTELFRVPPVSTAATWQSSAPHQPGPSLWAEAKTSEAPSTQDPSTQASTASSPAPEENA PSEGQRVWGQGQSPRPENSLEREEMGPVPAHTDAFQDWGPGSMAHVSVVPVSSEGTPSRE PVASGSWTPKAEEPIHATMDPQRLGVLITPVPDAQAATRRQAVGLLAFLGLLFCLGVAMF TYQSLQGCPRKMAGEMAEGLRYIPRSCGSNSYVLVPV >gi568815582r:57156916_57384540|GENSCAN_predicted_CDS_8|1194_bp atggctccgatatctctgtcgtggctgctccgcttggccaccttctgccatctgactgtc ctgctggctggacagcaccacggtgtgacgaaatgcaacatcacgtgcagcaagatgaca tcaaagatacctgtagctttgctcatccactatcaacagaaccaggcatcatgcggcaaa cgcgcaatcatcttggagacgagacagcacaggctgttctgtgccgacccgaaggagcaa tgggtcaaggacgcgatgcagcatctggaccgccaggctgctgccctaactcgaaatggc ggcaccttcgagaagcagatcggcgaggtgaagcccaggaccacccctgccgccggggga atggacgagtctgtggtcctggagcccgaagccacaggcgaaagcagtagcctggagccg actccttcttcccaggaagcacagagggccctggggacctccccagagctgccgacgggc gtgactggttcctcagggaccaggctccccccgacgccaaaggctcaggatggagggcct gtgggcacggagcttttccgagtgcctcccgtctccactgccgccacgtggcagagttct gctccccaccaacctgggcccagcctctgggctgaggcaaagacctctgaggccccgtcc acccaggacccctccacccaggcctccactgcgtcctccccagccccagaggagaatgct ccgtctgaaggccagcgtgtgtggggtcagggacagagccccaggccagagaactctctg gagcgggaggagatgggtcccgtgccagcgcacacggatgccttccaggactgggggcct ggcagcatggcccacgtctctgtggtccctgtctcctcagaagggacccccagcagggag ccagtggcttcaggcagctggacccctaaggctgaggaacccatccatgccaccatggac ccccagaggctgggcgtccttatcactcctgtccctgacgcccaggctgccacccggagg caggcggtggggctgctggccttccttggcctcctcttctgcctgggggtggccatgttc acctaccagagcctccagggctgccctcgaaagatggcaggagagatggcggagggcctt cgctacatcccccggagctgtggtagtaattcatatgtcctggtgcccgtgtga