GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:54:58 Sequence gi568815582f:57104659_57338972 : 234314 bp : 46.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1389 1516 128 0 2 65 96 51 0.723 3.92 1.02 Intr + 6050 6264 215 0 2 134 78 182 0.490 20.23 1.03 Intr + 8630 8809 180 1 0 81 101 311 0.992 31.76 1.04 Intr + 10818 10892 75 2 0 130 33 76 0.984 6.01 1.05 Intr + 12838 12909 72 0 0 69 116 88 0.996 9.30 1.06 Intr + 14537 14620 84 1 0 118 90 76 0.999 10.82 1.07 Intr + 14903 14992 90 1 0 153 94 110 0.999 18.29 1.08 Intr + 16435 16533 99 0 0 86 97 216 0.998 22.61 1.09 Intr + 17016 17498 483 2 0 60 47 246 0.442 10.92 1.10 Intr + 18756 18815 60 2 0 85 75 42 0.597 1.53 1.11 Intr + 21202 21335 134 0 2 106 110 159 0.999 19.34 1.12 Intr + 23374 23539 166 1 1 122 97 -20 0.399 2.26 1.13 Intr + 30117 30168 52 2 1 108 109 88 0.999 11.38 1.14 Intr + 32491 32624 134 2 2 30 113 215 0.969 18.56 1.15 Intr + 41427 41663 237 2 0 81 92 468 0.123 44.31 1.16 Term + 42893 43000 108 1 0 112 49 117 0.998 8.71 1.17 PlyA + 43287 43292 6 1.05 2.08 PlyA - 43986 43981 6 1.05 2.07 Term - 49777 49632 146 2 2 75 29 241 0.679 14.97 2.06 Intr - 59407 59343 65 0 2 79 98 42 0.935 2.66 2.05 Intr - 62556 62435 122 0 2 -10 88 138 0.603 3.29 2.04 Intr - 67714 67566 149 2 2 82 67 147 0.991 11.95 2.03 Intr - 68216 68118 99 0 0 147 63 141 0.999 17.58 2.02 Intr - 69211 69070 142 1 1 104 69 147 0.812 14.33 2.01 Init - 69806 69783 24 2 0 45 90 2 0.220 -4.32 2.00 Prom - 70675 70636 40 -3.66 3.00 Prom + 76508 76547 40 -6.66 3.01 Init + 80965 81126 162 0 0 56 43 137 0.717 3.79 3.02 Term + 81356 81511 156 1 0 84 55 126 0.961 6.83 3.03 PlyA + 82975 82980 6 1.05 4.00 Prom + 91415 91454 40 -5.36 4.01 Init + 100118 100350 233 1 2 81 97 127 0.580 10.63 4.02 Intr + 103400 103452 53 2 2 98 68 -9 0.481 -3.35 4.03 Intr + 104417 104529 113 0 2 79 106 -4 0.610 0.60 4.04 Intr + 108314 108440 127 1 1 95 93 11 0.590 2.55 4.05 Intr + 112246 112377 132 2 0 97 84 45 0.964 5.62 4.06 Intr + 116074 116189 116 2 2 74 92 122 0.837 11.37 4.07 Intr + 116614 116757 144 0 0 76 84 168 0.999 15.68 4.08 Intr + 122684 122762 79 1 1 123 74 197 0.940 20.92 4.09 Intr + 126053 126155 103 0 1 106 75 5 0.733 0.23 4.10 Intr + 126509 126661 153 2 0 78 116 40 0.707 4.99 4.11 Intr + 130466 130570 105 2 0 65 110 23 0.222 1.53 4.12 Intr + 141422 141483 62 2 2 31 94 77 0.036 0.98 4.13 Intr + 143879 143985 107 0 2 36 102 144 0.986 10.53 4.14 Intr + 145109 145194 86 1 2 80 79 111 0.998 8.02 4.15 Intr + 145753 145849 97 1 1 78 92 113 0.739 10.71 4.16 Term + 147508 147609 102 0 0 74 42 26 0.324 -5.12 4.17 PlyA + 149628 149633 6 1.05 5.09 PlyA - 149962 149957 6 1.05 5.08 Term - 152371 152255 117 1 0 103 40 173 0.992 12.54 5.07 Intr - 153926 153804 123 2 0 96 109 164 0.999 20.08 5.06 Intr - 157412 157239 174 2 0 90 89 217 0.986 22.14 5.05 Intr - 159867 159715 153 0 0 66 73 45 0.300 1.07 5.04 Intr - 164477 164349 129 2 0 57 94 30 0.097 1.39 5.03 Intr - 174915 174891 25 2 1 65 88 20 0.011 -2.27 5.02 Intr - 179597 179518 80 2 2 86 49 85 0.235 2.75 5.01 Init - 179882 179748 135 2 0 122 85 274 0.996 30.64 5.00 Prom - 180542 180503 40 -3.96 6.00 Prom + 210270 210309 40 -3.16 6.01 Sngl + 216639 216905 267 2 0 30 47 199 0.581 5.23 6.02 PlyA + 217604 217609 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 41355 41663 309 2 0 40 92 571 0.856 48.01 S.002 Init + 141411 141483 73 2 1 51 94 83 0.904 6.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:57104659_57338972|GENSCAN_predicted_peptide_1|772_aa XPQGPVRLAAAGLAQRDPLFPPPAPTPDIMCSPFRGSRSFQPEDCQELLPPPPAPMAHIP SGGAPAAGAAPMGPQYCVCKVELSVSGQNLLDRDVTSKSDPFCVLFTENNGRWIEYDRTE TAINNLNPAFSKKFVLDYHFEEVQKLKFALFDQDKSSMRLDEHDFLGQFSCSLGTIVSSK KITRPLLLLNDKPAGKGLITIAAQELSDNRVITLSLAGRRLDKKDLFGKSDPFLEFYKPG DDGKWMLVHRTEVIKYTLDPVWKPFTVPLVSLCDGDMEKPIQVMCYDYDNDGGHDFIGEF QTSVSQMCEARDSVPLEFECINPKKQRKKKNYKNSGIIILRSCKVNQRGQARARGQVSGH TEGTDRALDLEPARPVFKSRLCHILALWPSELHLTRPQSPHLWNGDINEAEHVQGYSSLT FLCGGQGRHELTFQGLSWWENCHCELLPPTVGMQGAEDIGWGHPVLSGISVTVTPMINRD YSFLDYILGGCQLMFTVGIDFTASNGNPLDPSSLHYINPMGTNEYLSAIWAVGQIIQDYD RPRSFLKPDPWAGTEAVTGQHVMFSVYETHVEWGRQIGLPDVANQNTRHPVKFEFQVSHE FAINFNPTNPFCSGVDGIAQAYSACLPHIRFYGPTNFSPIVNHVARFAAQATQQRTATQY FILLIITDGVISDMEETRHAVVQASKLPMSIIIVGVGNADFAAMEFLDGDSRMLRSHTGE EAARDIVQFVPFREFRNAAKETLAKAVLAELPQQVVQYFKHKNLPPTNSEPA >gi568815582f:57104659_57338972|GENSCAN_predicted_CDS_1|2319_bp nngcctcagggtccagtacgcctggctgctgctgggctggcgcagagggacccactcttc cccccgcctgccccaactccagacatcatgtgctccccgtttcggggatccagaagcttc cagccagaggactgccaggaactcctgccaccgccaccggctcccatggcccacataccc agtgggggtgccccagcagcgggggcagcccccatgggcccccagtattgcgtgtgcaag gtggagctgtcagtgagtggccagaacctactggaccgggatgttacctccaagtccgac cccttctgtgtcctctttacagagaacaatggcagatggatcgagtacgacaggacagaa accgcgatcaacaacctcaaccccgccttctccaagaagttcgtgcttgactaccacttc gaggaggtacagaagctcaagttcgcgctctttgaccaggacaagtccagtatgcggctg gacgagcatgacttcctgggccagttctcctgcagcctgggcacgatcgtctccagcaag aagatcactaggcctctgctgctgctgaatgacaagcctgcggggaagggcttgattacg atcgctgcccaggagctgtccgacaaccgcgtcatcacactaagcctggcgggcaggagg ctggacaagaaggacctctttgggaagtcagacccctttctggagttttataagccagga gacgatggcaagtggatgctggtccacaggactgaggtgatcaagtacacactggaccct gtgtggaagccattcacagtgcccttggtgtccctgtgtgatggggacatggagaagccc atccaggtcatgtgctacgactatgacaatgacgggggccatgacttcatcggcgagttc cagacctcagtgtcacagatgtgtgaggctcgagacagcgtcccgctggagttcgagtgc atcaaccccaagaagcagaggaagaagaagaactataaaaactcgggcatcatcatcctg cgatcctgcaaggtgaaccagcgtgggcaagcacgagccaggggccaggtttcaggccac acagaagggacagacagagcactggatctggagccagcgaggcctgtgttcaaatcccgg ctctgccacatcctggccctatggccttctgagcttcacctcaccaggcctcagtctcct catctgtggaatggagatattaacgaggcagagcatgtccagggttactcttctctcaca ttcctgtgtggaggccagggaaggcacgagctgaccttccagggcctgtcctggtgggag aattgtcattgcgagttgctgccgcccacggtgggcatgcaaggggcagaggacattggg tggggacaccctgtcctctctggcatcagcgtcactgtgactcccatgataaaccgagac tactccttccttgactacatcctgggaggctgccagctcatgttcaccgttggaatagac tttacagcctccaacgggaatcccctcgacccttcctctttgcactatatcaaccctatg ggcaccaacgaatatctgtcggccatctgggctgttgggcagatcattcaggactacgac aggccacgctcatttctcaagcctgatccttgggcaggcactgaggctgtaacggggcag cacgttatgttctctgtttatgagacccatgtggagtgggggagacagataggattgcca gatgtcgcaaatcaaaatacaagacacccagttaagtttgaatttcaggtctcccatgag tttgccatcaacttcaaccccaccaaccccttctgctcaggtgtggatggtattgcccag gcgtactcagcttgcctgccccacatccgcttctacggtcctaccaatttctcccccatc gtcaaccacgtggcccggtttgcggcccaggccacacaacagcggacggccacgcagtac ttcatcctcctcatcatcacggacggggtcatcagtgacatggaggagacacggcatgcc gtggtgcaggcttccaagctgcccatgtccatcatcatcgtgggcgtgggcaatgcggac ttcgctgccatggagttcctggatggggacagccgcatgctgcgctcccacacgggggag gaggcagcccgcgatattgtgcagttcgttccctttcgagagttccgcaacgcagcaaaa gagaccttggccaaagctgtgctggcggagctgccccaacaagttgtgcagtatttcaag cataaaaacctgccccccaccaactcggagcccgcctga >gi568815582f:57104659_57338972|GENSCAN_predicted_peptide_2|248_aa MRFTMEHQIGCFIMDGGDDGNLIIKKRFVSEAELDERRKRRQEEWEKVRKPEDPEECPEE VYDPRSLYERLQEQKDRKQQEYEEQFKFKNMVRGLDEDETNFLDEVSRQQELIEKQRREE ELKELKEYRISFVLGGKPKVGISQENKKEVEKKLTVKPIETKNKFSQAKLLAGAVKHKSS ESGNSVKRLKPDPEPDDKNQVCIGILPGLGAYSGSSDSESSSDSEGTINATGKIVSSIFR TNTFLEAP >gi568815582f:57104659_57338972|GENSCAN_predicted_CDS_2|747_bp atgagatttactatggaacaccagattggttgtttcattatggatggaggggatgatggt aaccttattatcaaaaagaggtttgtgtctgaggcagaactagatgaacggcgcaaaagg aggcaagaagaatgggagaaagttcgaaaacctgaagatccagaagaatgtccagaggag gtttatgaccctcgatctctatatgaaaggctacaggaacagaaggacaggaagcagcag gagtacgaggaacagttcaaattcaaaaacatggtaagaggcttagatgaagatgagacc aacttccttgatgaggtttctcgacagcaggaactaatagaaaagcaacgaagagaagaa gaactgaaagaactgaaggaatacagaatatcctttgtacttggaggaaaacctaaggtt ggaatttctcaagagaacaagaaggaagtggaaaagaaactgactgtgaagcctatagaa accaagaacaagttctcccaggcgaagctgttggcaggagctgtgaagcataagagctca gagagtggcaacagtgtgaaaagactgaaaccggaccctgagccagatgacaagaatcaa gtatgtatcggcatcctcccaggcctgggtgcctactctgggagcagcgactccgagtcc agctcagacagcgaaggcaccatcaatgccaccggaaagattgtctcctccatcttccga accaacaccttcctcgaggccccctag >gi568815582f:57104659_57338972|GENSCAN_predicted_peptide_3|105_aa MPSRGHPQLPSGWARKPQASATPDKERVRGLRTGLALTLTAAAAHLFRKEAGTQGFLFLT ATIESRPTSGDVTQSAELFLPGSPMEIRYPENEPERLDWAIHASG >gi568815582f:57104659_57338972|GENSCAN_predicted_CDS_3|318_bp atgccttcccgcgggcaccctcagctcccctcaggctgggcccgcaagccccaggcttcg gctactccggacaaggagcgggtgcgcggactgagaacaggcctggccctaaccctaaca gcagccgcagctcatctcttccgtaaggaagctggaacccagggcttcctgttcctcacc gccacaatagagtcccgccccacttccggcgacgtaacccaatccgcggagctcttcctc cccgggagcccgatggaaatccggtaccctgaaaacgagccggagagacttgattgggcc attcacgcctcaggatga >gi568815582f:57104659_57338972|GENSCAN_predicted_peptide_4|603_aa MGNSCICRDDSGTDDSVDTQQQQAENSAVPTADTRSQPRDPVRPPRRGRGPHEPRRKKQN VDGLVLDTLAVIRTLVDNDQEPPYSMITLHEMAETDEGWLDVVQSLIRVIPLEDPLGPAV ITLLLDECPLPTKDALQKLTEILNLNGEVACQDSSHPAKHRNTSAVLGCLAEKLAGENKL TISESSISDRLVTLESWANDPDYLKRQVGFCAQWSLDNLFLKEGRQLTYEKVNLSSIRAM LNSNDVSEYLKISPHGLEARCDASSFESVRCTFCVDAGVWYYEVTVVTSGVMQIGWATRD SKFLNHEGYGIGDDEYSCAYDGCRQLIWYNARRDTVGFLLDLNEKQMIFFLNGNQLPPEK QVFSSTVSGFFAAASFMSYQQCEFNFGAKPFKYPPSMKFSTFNDYAFLTAEEKIILPRHR RLALLKQVSIRENCCSLCCDEVADTQLKPCGHSSSASDAEFDAVVGYLEDIIMDDEFQLL QRNFMDKYYLEFEDTEENKLIYTPIFNEYISLVEKYIEEQLLQRIPEFNMAAFTTTLQHH KDEVAGDIFDMLLTFTDFLAFKEMFLDYRAEKEGRGLDLSSGLVVTSLCKSSSLPASQNN LRH >gi568815582f:57104659_57338972|GENSCAN_predicted_CDS_4|1812_bp atgggtaattcctgtatctgccgagatgacagtggaacagatgacagtgttgacacccaa cagcaacaggccgagaacagtgcagtacccactgctgacacaaggagccaaccacgggac cctgttcggccaccaaggaggggccgaggacctcatgagccaaggagaaagaaacaaaat gtggatgggctagtgttggacacactggcagtaatacggactcttgtagataatgatcag gaacctccctattcaatgataacattacacgaaatggcagaaacagatgaaggatggttg gatgttgtccagtctttaattagagttattccactggaagatccactgggaccagctgtt ataacattgttactagatgaatgtccattgcccactaaagatgcactccagaaattgact gaaattctcaatttaaatggagaagtagcttgccaggactcaagccatcctgccaaacac aggaacacatctgcagtcctaggctgcttggccgagaaactagcaggtgaaaataaattg actatttctgaatccagtattagtgaccggcttgtcacattggagtcctgggctaatgat cctgattatctgaaacgtcaagttggtttctgtgcccagtggagcttagacaatctcttt ttaaaagaaggtagacagctgacctatgagaaagtgaacttgagtagcattagggccatg ctgaatagcaatgatgtcagcgagtacctgaagatctcacctcatggcttagaggctcgc tgtgatgcctcctcttttgaaagtgtgcgttgcaccttttgtgtggatgccggggtatgg tactatgaagtaacagtggtcacttctggcgtcatgcagattggctgggccactcgagac agcaaattcctcaatcatgaaggctacggcattggggatgatgaatactcctgtgcgtat gatggctgccggcagctgatttggtacaatgccagaagagatacagtaggatttctgtta gacttgaatgaaaagcaaatgatcttctttttaaatggcaaccagctgcctcctgaaaag caagtcttttcatctactgtatctggattttttgctgcagctagtttcatgtcatatcaa caatgtgagttcaattttggagcaaaaccattcaaatacccaccatctatgaaatttagc acttttaatgactacgccttcctaacagctgaagaaaaaatcattttgccaaggcacagg cgtcttgctctgttgaagcaagtcagtatccgagaaaactgctgttccctttgttgtgat gaggtagcagacacacaattgaagccatgtggacacagctcctccgcctctgatgcagaa tttgatgctgtggttggatatttagaggacattatcatggatgacgagttccagttatta cagagaaatttcatggacaagtactacctggagtttgaagacacagaagagaataaactc atctacacacctatttttaatgaatacatttctttggtagaaaaatacattgaagaacag ctgctgcagcggattcctgagttcaacatggcagccttcaccacaacattacagcaccat aaggatgaagtggctggtgacatattcgacatgctgctcaccttcacagattttctggct tttaaagaaatgtttttggactacagagcagaaaaagaaggccgaggactggacttaagc agtggcttagtggtgacttcattgtgcaaatcatcttctctgccagcttcccagaacaat ctgcggcactag >gi568815582f:57104659_57338972|GENSCAN_predicted_peptide_5|311_aa MAEFPSKVSTRTSSPAQGAEASVSALRPDLGFVRSRLGALMLLQLGASRTDAPKIALHPS HHDTTPWPSLRAPPEYLGRQGVKGSAQGPSAYEELRECSSKGDLRAGLDFWGLLWEHHRL ICQGRVLVSVMVEEIGLLVPKHIVYLAYIMCRALLGTEDTAVNKTQNLPRRAGKVLGLLV WALIADTPYHLYPAYGWVMFVAVFLWLVTIVLFNLYLFQLHMKLYMVPWPLVLMIFNISA TVLYITAFIACSAAVDLTSLRGTRPYNQRAAASFFACLVMIAYGVSAFFSYQAWRGVGSN AATSQMAGGYA >gi568815582f:57104659_57338972|GENSCAN_predicted_CDS_5|936_bp atggccgagttcccgtcgaaagttagcacgcggaccagcagtcctgcgcagggcgccgaa gcctcggtgtcggcgctgcgcccggacctgggcttcgtgcgctcccgcctcggggcgctc atgctgctgcagctgggcgcttcccgcacggacgcccccaagatcgccctgcatccctct caccacgacactaccccgtggccgtcgctgcgagcgcctcctgagtatctgggacgacag ggggtgaaggggtctgcccaaggcccctcagcctatgaggagctgagggagtgcagctcc aagggagatctgagggctgggctggatttctggggcctcctgtgggagcaccaccgcctt atctgtcagggcagggttctggtctctgtcatggttgaagaaataggattgttagttcct aagcacattgtgtatttagcgtatattatgtgtcgggcactgttgggcactgaagataca gcagtgaacaaaacccagaatcttcctcggagagcagggaaggtgctggggctgctggtg tgggcgctgattgcggacaccccgtaccacctgtatccggcctatggctgggtgatgttc gtcgctgtcttcctctggctggtgacaatcgtcctcttcaacctctacctgtttcagctg cacatgaagttgtacatggttccctggccactggtgttaatgatctttaacatcagcgcc accgttctctacatcaccgccttcatcgcctgctctgcggcagttgacctgacatccctg aggggcacccggccttataaccagcgcgcggctgcctcgttctttgcgtgtttggtgatg atcgcctatggagtgagtgccttcttcagctaccaggcctggcgaggagtaggcagcaat gcggccaccagtcagatggctggcggctatgcctaa >gi568815582f:57104659_57338972|GENSCAN_predicted_peptide_6|88_aa MLYDRINDEKSRNRPFSQDGTKGKEGSSCPPKAEAKSKALKDKKAVLKGVHSHIKKKIHT SPIFPWPRRCDSRGSPDILRRAPQEKQA >gi568815582f:57104659_57338972|GENSCAN_predicted_CDS_6|267_bp atgctatatgatagaataaatgatgaaaaatctcgaaatagacccttttcacaagatggc accaaaggcaaagaaggaagctcctgcccccctaaagctgaagccaaatcgaaggctttg aaggacaagaaggcagtgctgaaaggtgtccacagccacataaaaaagaagatccacaca tcacccatcttcccgtggccaagacgctgcgactccagaggcagcccagatatcctcaga agagcaccccaggagaaacaagcttga