GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:25:23 Sequence gi568815582f:57145368_57352264 : 206897 bp : 45.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 646 954 309 0 0 40 92 571 0.895 48.01 1.02 Term + 2184 2291 108 2 0 112 49 117 0.998 8.71 1.03 PlyA + 2578 2583 6 1.05 2.08 PlyA - 3277 3272 6 1.05 2.07 Term - 9068 8923 146 0 2 75 29 241 0.679 14.97 2.06 Intr - 18698 18634 65 1 2 79 98 42 0.935 2.66 2.05 Intr - 21847 21726 122 1 2 -10 88 138 0.603 3.29 2.04 Intr - 27005 26857 149 0 2 82 67 147 0.991 11.95 2.03 Intr - 27507 27409 99 1 0 147 63 141 0.999 17.58 2.02 Intr - 28502 28361 142 2 1 104 69 147 0.812 14.33 2.01 Init - 29097 29074 24 0 0 45 90 2 0.220 -4.32 2.00 Prom - 29966 29927 40 -3.66 3.00 Prom + 35799 35838 40 -6.66 3.01 Init + 40256 40417 162 1 0 56 43 137 0.717 3.79 3.02 Term + 40647 40802 156 2 0 84 55 126 0.961 6.83 3.03 PlyA + 42266 42271 6 1.05 4.00 Prom + 50706 50745 40 -5.36 4.01 Init + 59409 59641 233 2 2 81 97 127 0.580 10.63 4.02 Intr + 62691 62743 53 0 2 98 68 -9 0.481 -3.35 4.03 Intr + 63708 63820 113 1 2 79 106 -4 0.610 0.60 4.04 Intr + 67605 67731 127 2 1 95 93 11 0.590 2.55 4.05 Intr + 71537 71668 132 0 0 97 84 45 0.965 5.62 4.06 Intr + 75365 75480 116 0 2 74 92 122 0.837 11.37 4.07 Intr + 75905 76048 144 1 0 76 84 168 0.999 15.68 4.08 Intr + 81975 82053 79 2 1 123 74 197 0.940 20.92 4.09 Intr + 85344 85446 103 1 1 106 75 5 0.733 0.23 4.10 Intr + 85800 85952 153 0 0 78 116 40 0.707 4.99 4.11 Intr + 89757 89861 105 0 0 65 110 23 0.222 1.53 4.12 Intr + 100713 100774 62 0 2 31 94 77 0.036 0.98 4.13 Intr + 103170 103276 107 1 2 36 102 144 0.986 10.53 4.14 Intr + 104400 104485 86 2 2 80 79 111 0.998 8.02 4.15 Intr + 105044 105140 97 2 1 78 92 113 0.739 10.71 4.16 Term + 106799 106900 102 1 0 74 42 26 0.324 -5.12 4.17 PlyA + 108919 108924 6 1.05 5.09 PlyA - 109253 109248 6 1.05 5.08 Term - 111662 111546 117 2 0 103 40 173 0.992 12.54 5.07 Intr - 113217 113095 123 0 0 96 109 164 0.999 20.08 5.06 Intr - 116703 116530 174 0 0 90 89 217 0.986 22.14 5.05 Intr - 119158 119006 153 1 0 66 73 45 0.300 1.07 5.04 Intr - 123768 123640 129 0 0 57 94 30 0.097 1.39 5.03 Intr - 134206 134182 25 0 1 65 88 20 0.011 -2.27 5.02 Intr - 138888 138809 80 0 2 86 49 85 0.235 2.75 5.01 Init - 139173 139039 135 0 0 122 85 274 0.996 30.64 5.00 Prom - 139833 139794 40 -3.96 6.00 Prom + 169561 169600 40 -3.16 6.01 Sngl + 175930 176196 267 0 0 30 47 199 0.581 5.23 6.02 PlyA + 176895 176900 6 1.05 7.03 PlyA - 177951 177946 6 1.05 7.02 Term - 198008 197839 170 0 2 79 41 57 0.324 -1.86 7.01 Init - 203083 203005 79 1 1 83 90 54 0.794 6.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100702 100774 73 0 1 51 94 83 0.904 6.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:57145368_57352264|GENSCAN_predicted_peptide_1|138_aa MKEGGRRGARASTLLSAPLAPSLQQYFILLIITDGVISDMEETRHAVVQASKLPMSIIIV GVGNADFAAMEFLDGDSRMLRSHTGEEAARDIVQFVPFREFRNAAKETLAKAVLAELPQQ VVQYFKHKNLPPTNSEPA >gi568815582f:57145368_57352264|GENSCAN_predicted_CDS_1|417_bp atgaaggagggagggagaaggggtgccagagcctcgaccttgctgagtgccccgttggcc ccctccctgcagcagtacttcatcctcctcatcatcacggacggggtcatcagtgacatg gaggagacacggcatgccgtggtgcaggcttccaagctgcccatgtccatcatcatcgtg ggcgtgggcaatgcggacttcgctgccatggagttcctggatggggacagccgcatgctg cgctcccacacgggggaggaggcagcccgcgatattgtgcagttcgttccctttcgagag ttccgcaacgcagcaaaagagaccttggccaaagctgtgctggcggagctgccccaacaa gttgtgcagtatttcaagcataaaaacctgccccccaccaactcggagcccgcctga >gi568815582f:57145368_57352264|GENSCAN_predicted_peptide_2|248_aa MRFTMEHQIGCFIMDGGDDGNLIIKKRFVSEAELDERRKRRQEEWEKVRKPEDPEECPEE VYDPRSLYERLQEQKDRKQQEYEEQFKFKNMVRGLDEDETNFLDEVSRQQELIEKQRREE ELKELKEYRISFVLGGKPKVGISQENKKEVEKKLTVKPIETKNKFSQAKLLAGAVKHKSS ESGNSVKRLKPDPEPDDKNQVCIGILPGLGAYSGSSDSESSSDSEGTINATGKIVSSIFR TNTFLEAP >gi568815582f:57145368_57352264|GENSCAN_predicted_CDS_2|747_bp atgagatttactatggaacaccagattggttgtttcattatggatggaggggatgatggt aaccttattatcaaaaagaggtttgtgtctgaggcagaactagatgaacggcgcaaaagg aggcaagaagaatgggagaaagttcgaaaacctgaagatccagaagaatgtccagaggag gtttatgaccctcgatctctatatgaaaggctacaggaacagaaggacaggaagcagcag gagtacgaggaacagttcaaattcaaaaacatggtaagaggcttagatgaagatgagacc aacttccttgatgaggtttctcgacagcaggaactaatagaaaagcaacgaagagaagaa gaactgaaagaactgaaggaatacagaatatcctttgtacttggaggaaaacctaaggtt ggaatttctcaagagaacaagaaggaagtggaaaagaaactgactgtgaagcctatagaa accaagaacaagttctcccaggcgaagctgttggcaggagctgtgaagcataagagctca gagagtggcaacagtgtgaaaagactgaaaccggaccctgagccagatgacaagaatcaa gtatgtatcggcatcctcccaggcctgggtgcctactctgggagcagcgactccgagtcc agctcagacagcgaaggcaccatcaatgccaccggaaagattgtctcctccatcttccga accaacaccttcctcgaggccccctag >gi568815582f:57145368_57352264|GENSCAN_predicted_peptide_3|105_aa MPSRGHPQLPSGWARKPQASATPDKERVRGLRTGLALTLTAAAAHLFRKEAGTQGFLFLT ATIESRPTSGDVTQSAELFLPGSPMEIRYPENEPERLDWAIHASG >gi568815582f:57145368_57352264|GENSCAN_predicted_CDS_3|318_bp atgccttcccgcgggcaccctcagctcccctcaggctgggcccgcaagccccaggcttcg gctactccggacaaggagcgggtgcgcggactgagaacaggcctggccctaaccctaaca gcagccgcagctcatctcttccgtaaggaagctggaacccagggcttcctgttcctcacc gccacaatagagtcccgccccacttccggcgacgtaacccaatccgcggagctcttcctc cccgggagcccgatggaaatccggtaccctgaaaacgagccggagagacttgattgggcc attcacgcctcaggatga >gi568815582f:57145368_57352264|GENSCAN_predicted_peptide_4|603_aa MGNSCICRDDSGTDDSVDTQQQQAENSAVPTADTRSQPRDPVRPPRRGRGPHEPRRKKQN VDGLVLDTLAVIRTLVDNDQEPPYSMITLHEMAETDEGWLDVVQSLIRVIPLEDPLGPAV ITLLLDECPLPTKDALQKLTEILNLNGEVACQDSSHPAKHRNTSAVLGCLAEKLAGENKL TISESSISDRLVTLESWANDPDYLKRQVGFCAQWSLDNLFLKEGRQLTYEKVNLSSIRAM LNSNDVSEYLKISPHGLEARCDASSFESVRCTFCVDAGVWYYEVTVVTSGVMQIGWATRD SKFLNHEGYGIGDDEYSCAYDGCRQLIWYNARRDTVGFLLDLNEKQMIFFLNGNQLPPEK QVFSSTVSGFFAAASFMSYQQCEFNFGAKPFKYPPSMKFSTFNDYAFLTAEEKIILPRHR RLALLKQVSIRENCCSLCCDEVADTQLKPCGHSSSASDAEFDAVVGYLEDIIMDDEFQLL QRNFMDKYYLEFEDTEENKLIYTPIFNEYISLVEKYIEEQLLQRIPEFNMAAFTTTLQHH KDEVAGDIFDMLLTFTDFLAFKEMFLDYRAEKEGRGLDLSSGLVVTSLCKSSSLPASQNN LRH >gi568815582f:57145368_57352264|GENSCAN_predicted_CDS_4|1812_bp atgggtaattcctgtatctgccgagatgacagtggaacagatgacagtgttgacacccaa cagcaacaggccgagaacagtgcagtacccactgctgacacaaggagccaaccacgggac cctgttcggccaccaaggaggggccgaggacctcatgagccaaggagaaagaaacaaaat gtggatgggctagtgttggacacactggcagtaatacggactcttgtagataatgatcag gaacctccctattcaatgataacattacacgaaatggcagaaacagatgaaggatggttg gatgttgtccagtctttaattagagttattccactggaagatccactgggaccagctgtt ataacattgttactagatgaatgtccattgcccactaaagatgcactccagaaattgact gaaattctcaatttaaatggagaagtagcttgccaggactcaagccatcctgccaaacac aggaacacatctgcagtcctaggctgcttggccgagaaactagcaggtgaaaataaattg actatttctgaatccagtattagtgaccggcttgtcacattggagtcctgggctaatgat cctgattatctgaaacgtcaagttggtttctgtgcccagtggagcttagacaatctcttt ttaaaagaaggtagacagctgacctatgagaaagtgaacttgagtagcattagggccatg ctgaatagcaatgatgtcagcgagtacctgaagatctcacctcatggcttagaggctcgc tgtgatgcctcctcttttgaaagtgtgcgttgcaccttttgtgtggatgccggggtatgg tactatgaagtaacagtggtcacttctggcgtcatgcagattggctgggccactcgagac agcaaattcctcaatcatgaaggctacggcattggggatgatgaatactcctgtgcgtat gatggctgccggcagctgatttggtacaatgccagaagagatacagtaggatttctgtta gacttgaatgaaaagcaaatgatcttctttttaaatggcaaccagctgcctcctgaaaag caagtcttttcatctactgtatctggattttttgctgcagctagtttcatgtcatatcaa caatgtgagttcaattttggagcaaaaccattcaaatacccaccatctatgaaatttagc acttttaatgactacgccttcctaacagctgaagaaaaaatcattttgccaaggcacagg cgtcttgctctgttgaagcaagtcagtatccgagaaaactgctgttccctttgttgtgat gaggtagcagacacacaattgaagccatgtggacacagctcctccgcctctgatgcagaa tttgatgctgtggttggatatttagaggacattatcatggatgacgagttccagttatta cagagaaatttcatggacaagtactacctggagtttgaagacacagaagagaataaactc atctacacacctatttttaatgaatacatttctttggtagaaaaatacattgaagaacag ctgctgcagcggattcctgagttcaacatggcagccttcaccacaacattacagcaccat aaggatgaagtggctggtgacatattcgacatgctgctcaccttcacagattttctggct tttaaagaaatgtttttggactacagagcagaaaaagaaggccgaggactggacttaagc agtggcttagtggtgacttcattgtgcaaatcatcttctctgccagcttcccagaacaat ctgcggcactag >gi568815582f:57145368_57352264|GENSCAN_predicted_peptide_5|311_aa MAEFPSKVSTRTSSPAQGAEASVSALRPDLGFVRSRLGALMLLQLGASRTDAPKIALHPS HHDTTPWPSLRAPPEYLGRQGVKGSAQGPSAYEELRECSSKGDLRAGLDFWGLLWEHHRL ICQGRVLVSVMVEEIGLLVPKHIVYLAYIMCRALLGTEDTAVNKTQNLPRRAGKVLGLLV WALIADTPYHLYPAYGWVMFVAVFLWLVTIVLFNLYLFQLHMKLYMVPWPLVLMIFNISA TVLYITAFIACSAAVDLTSLRGTRPYNQRAAASFFACLVMIAYGVSAFFSYQAWRGVGSN AATSQMAGGYA >gi568815582f:57145368_57352264|GENSCAN_predicted_CDS_5|936_bp atggccgagttcccgtcgaaagttagcacgcggaccagcagtcctgcgcagggcgccgaa gcctcggtgtcggcgctgcgcccggacctgggcttcgtgcgctcccgcctcggggcgctc atgctgctgcagctgggcgcttcccgcacggacgcccccaagatcgccctgcatccctct caccacgacactaccccgtggccgtcgctgcgagcgcctcctgagtatctgggacgacag ggggtgaaggggtctgcccaaggcccctcagcctatgaggagctgagggagtgcagctcc aagggagatctgagggctgggctggatttctggggcctcctgtgggagcaccaccgcctt atctgtcagggcagggttctggtctctgtcatggttgaagaaataggattgttagttcct aagcacattgtgtatttagcgtatattatgtgtcgggcactgttgggcactgaagataca gcagtgaacaaaacccagaatcttcctcggagagcagggaaggtgctggggctgctggtg tgggcgctgattgcggacaccccgtaccacctgtatccggcctatggctgggtgatgttc gtcgctgtcttcctctggctggtgacaatcgtcctcttcaacctctacctgtttcagctg cacatgaagttgtacatggttccctggccactggtgttaatgatctttaacatcagcgcc accgttctctacatcaccgccttcatcgcctgctctgcggcagttgacctgacatccctg aggggcacccggccttataaccagcgcgcggctgcctcgttctttgcgtgtttggtgatg atcgcctatggagtgagtgccttcttcagctaccaggcctggcgaggagtaggcagcaat gcggccaccagtcagatggctggcggctatgcctaa >gi568815582f:57145368_57352264|GENSCAN_predicted_peptide_6|88_aa MLYDRINDEKSRNRPFSQDGTKGKEGSSCPPKAEAKSKALKDKKAVLKGVHSHIKKKIHT SPIFPWPRRCDSRGSPDILRRAPQEKQA >gi568815582f:57145368_57352264|GENSCAN_predicted_CDS_6|267_bp atgctatatgatagaataaatgatgaaaaatctcgaaatagacccttttcacaagatggc accaaaggcaaagaaggaagctcctgcccccctaaagctgaagccaaatcgaaggctttg aaggacaagaaggcagtgctgaaaggtgtccacagccacataaaaaagaagatccacaca tcacccatcttcccgtggccaagacgctgcgactccagaggcagcccagatatcctcaga agagcaccccaggagaaacaagcttga >gi568815582f:57145368_57352264|GENSCAN_predicted_peptide_7|82_aa MQFWVLGVPNKELEETHMESKAKQQQETRSYQKTTPTCNCRICPLHFPCSSLKDKLVQPK ACGSLEAQDSFECGPTQMRKLS >gi568815582f:57145368_57352264|GENSCAN_predicted_CDS_7|249_bp atgcagttctgggttcttggtgtcccgaacaaagaactggaggagacacacatggaaagc aaagcaaagcagcaacaggaaacgagaagttaccagaagaccacacccacctgcaactgt cgcatctgcccactgcattttccttgttcctcactcaaggataagcttgtccaacccaag gcctgtgggtcgcttgaggcccaggacagctttgaatgtggcccaacacaaatgcgtaaa ctttcttaa