GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:25:05 Sequence gi568815582f:57258817_57463585 : 204769 bp : 48.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 3254 3081 174 2 0 90 89 217 0.782 22.14 1.05 Intr - 5709 5557 153 0 0 66 73 45 0.243 1.07 1.04 Intr - 10319 10191 129 2 0 57 94 30 0.079 1.39 1.03 Intr - 20757 20733 25 2 1 65 88 20 0.010 -2.27 1.02 Intr - 25439 25360 80 2 2 86 49 85 0.235 2.75 1.01 Init - 25724 25590 135 2 0 122 85 274 0.996 30.64 1.00 Prom - 26384 26345 40 -3.96 2.00 Prom + 56112 56151 40 -3.16 2.01 Sngl + 62481 62747 267 2 0 30 47 199 0.581 5.23 2.02 PlyA + 63446 63451 6 1.05 3.03 PlyA - 64502 64497 6 1.05 3.02 Term - 84559 84390 170 2 2 79 41 57 0.322 -1.86 3.01 Init - 89634 89556 79 0 1 83 90 54 0.806 6.32 3.00 Prom - 93349 93310 40 -4.06 4.00 Prom + 93359 93398 40 -4.06 4.01 Init + 100001 100073 73 1 1 68 94 141 0.999 11.94 4.02 Term + 101621 101748 128 0 2 73 48 193 0.432 12.24 4.03 PlyA + 103041 103046 6 1.05 5.00 Prom + 107046 107085 40 -7.46 5.01 Init + 113753 113822 70 1 1 94 119 130 0.993 15.92 5.02 Intr + 120818 120938 121 0 1 75 53 74 0.728 2.15 5.03 Term + 123214 124216 1003 1 1 99 43 634 0.853 51.57 5.04 PlyA + 126205 126210 6 1.05 6.07 PlyA - 126962 126957 6 1.05 6.06 Term - 133617 133505 113 1 2 56 42 75 0.204 -1.58 6.05 Intr - 135259 135170 90 2 0 120 92 -33 0.158 0.27 6.04 Intr - 139214 139026 189 0 0 90 80 91 0.971 8.06 6.03 Intr - 140549 140404 146 1 2 138 94 33 0.688 8.83 6.02 Intr - 145662 145578 85 1 1 94 80 20 0.464 0.68 6.01 Init - 149519 149474 46 2 1 35 92 41 0.149 0.05 6.00 Prom - 150214 150175 40 -9.75 7.00 Prom + 151783 151822 40 -6.86 7.01 Init + 155117 155186 70 1 1 120 94 197 0.999 22.71 7.02 Intr + 156265 156382 118 2 1 122 86 149 0.822 17.62 7.03 Term + 158416 158515 100 1 1 63 41 102 0.438 0.60 7.04 PlyA + 159973 159978 6 1.05 8.09 PlyA - 162151 162146 6 1.05 8.08 Term - 170464 170354 111 1 0 115 49 119 0.990 9.26 8.07 Intr - 171523 171442 82 0 1 88 46 70 0.933 2.44 8.06 Intr - 172654 172335 320 1 2 85 105 180 0.798 14.16 8.05 Intr - 173744 173671 74 0 2 32 89 153 0.991 8.93 8.04 Intr - 175396 175228 169 1 1 59 110 129 0.955 11.72 8.03 Intr - 177916 177840 77 2 2 78 94 31 0.993 1.93 8.02 Intr - 180518 180366 153 0 0 72 82 40 0.750 1.84 8.01 Init - 182112 181956 157 0 1 76 101 179 0.999 18.08 8.00 Prom - 185395 185356 40 -8.16 9.00 Prom + 186337 186376 40 -6.16 9.01 Init + 188690 188762 73 1 1 93 93 129 0.990 13.13 9.02 Intr + 192224 192392 169 0 1 69 96 98 0.987 7.70 9.03 Intr + 193985 194120 136 2 1 63 89 226 0.981 20.77 9.04 Intr + 197688 197830 143 2 2 76 83 75 0.881 4.95 9.05 Intr + 198115 198199 85 1 1 110 48 33 0.890 1.32 9.06 Intr + 199430 199534 105 1 0 106 76 122 0.999 13.21 9.07 Intr + 200749 200904 156 0 0 25 96 261 0.993 20.81 9.08 Intr + 201235 201288 54 0 0 76 113 37 0.912 4.18 9.09 Intr + 203867 203994 128 1 2 29 60 206 0.237 11.38 9.10 Intr + 204213 204262 50 0 2 91 87 51 0.843 3.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_1|232_aa MAEFPSKVSTRTSSPAQGAEASVSALRPDLGFVRSRLGALMLLQLGASRTDAPKIALHPS HHDTTPWPSLRAPPEYLGRQGVKGSAQGPSAYEELRECSSKGDLRAGLDFWGLLWEHHRL ICQGRVLVSVMVEEIGLLVPKHIVYLAYIMCRALLGTEDTAVNKTQNLPRRAGKVLGLLV WALIADTPYHLYPAYGWVMFVAVFLWLVTIVLFNLYLFQLHMKLYMVPWPLV >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_1|696_bp atggccgagttcccgtcgaaagttagcacgcggaccagcagtcctgcgcagggcgccgaa gcctcggtgtcggcgctgcgcccggacctgggcttcgtgcgctcccgcctcggggcgctc atgctgctgcagctgggcgcttcccgcacggacgcccccaagatcgccctgcatccctct caccacgacactaccccgtggccgtcgctgcgagcgcctcctgagtatctgggacgacag ggggtgaaggggtctgcccaaggcccctcagcctatgaggagctgagggagtgcagctcc aagggagatctgagggctgggctggatttctggggcctcctgtgggagcaccaccgcctt atctgtcagggcagggttctggtctctgtcatggttgaagaaataggattgttagttcct aagcacattgtgtatttagcgtatattatgtgtcgggcactgttgggcactgaagataca gcagtgaacaaaacccagaatcttcctcggagagcagggaaggtgctggggctgctggtg tgggcgctgattgcggacaccccgtaccacctgtatccggcctatggctgggtgatgttc gtcgctgtcttcctctggctggtgacaatcgtcctcttcaacctctacctgtttcagctg cacatgaagttgtacatggttccctggccactggtg >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_2|88_aa MLYDRINDEKSRNRPFSQDGTKGKEGSSCPPKAEAKSKALKDKKAVLKGVHSHIKKKIHT SPIFPWPRRCDSRGSPDILRRAPQEKQA >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_2|267_bp atgctatatgatagaataaatgatgaaaaatctcgaaatagacccttttcacaagatggc accaaaggcaaagaaggaagctcctgcccccctaaagctgaagccaaatcgaaggctttg aaggacaagaaggcagtgctgaaaggtgtccacagccacataaaaaagaagatccacaca tcacccatcttcccgtggccaagacgctgcgactccagaggcagcccagatatcctcaga agagcaccccaggagaaacaagcttga >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_3|82_aa MQFWVLGVPNKELEETHMESKAKQQQETRSYQKTTPTCNCRICPLHFPCSSLKDKLVQPK ACGSLEAQDSFECGPTQMRKLS >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_3|249_bp atgcagttctgggttcttggtgtcccgaacaaagaactggaggagacacacatggaaagc aaagcaaagcagcaacaggaaacgagaagttaccagaagaccacacccacctgcaactgt cgcatctgcccactgcattttccttgttcctcactcaaggataagcttgtccaacccaag gcctgtgggtcgcttgaggcccaggacagctttgaatgtggcccaacacaaatgcgtaaa ctttcttaa >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_4|66_aa MDRLQTALLVVLVLLAVALQATEAGPYGANMEDSVCCRDYVRYRLPLRVVKHFYWTSDSC PRPGVV >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_4|201_bp atggatcgcctacagactgcactcctggttgtcctcgtcctccttgctgtggcgcttcaa gcaactgaggcaggcccctacggcgccaacatggaagacagcgtctgctgccgtgattac gtccgttaccgtctgcccctgcgcgtggtgaaacacttctactggacctcagactcctgc ccgaggcctggcgtggtgtga >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_5|397_aa MAPISLSWLLRLATFCHLTVLLAGQHHGVTKCNITCSKMTSKIPVALLIHYQQNQASCGK RAIILETRQHRLFCADPKEQWVKDAMQHLDRQAAALTRNGGTFEKQIGEVKPRTTPAAGG MDESVVLEPEATGESSSLEPTPSSQEAQRALGTSPELPTGVTGSSGTRLPPTPKAQDGGP VGTELFRVPPVSTAATWQSSAPHQPGPSLWAEAKTSEAPSTQDPSTQASTASSPAPEENA PSEGQRVWGQGQSPRPENSLEREEMGPVPAHTDAFQDWGPGSMAHVSVVPVSSEGTPSRE PVASGSWTPKAEEPIHATMDPQRLGVLITPVPDAQAATRRQAVGLLAFLGLLFCLGVAMF TYQSLQGCPRKMAGEMAEGLRYIPRSCGSNSYVLVPV >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_5|1194_bp atggctccgatatctctgtcgtggctgctccgcttggccaccttctgccatctgactgtc ctgctggctggacagcaccacggtgtgacgaaatgcaacatcacgtgcagcaagatgaca tcaaagatacctgtagctttgctcatccactatcaacagaaccaggcatcatgcggcaaa cgcgcaatcatcttggagacgagacagcacaggctgttctgtgccgacccgaaggagcaa tgggtcaaggacgcgatgcagcatctggaccgccaggctgctgccctaactcgaaatggc ggcaccttcgagaagcagatcggcgaggtgaagcccaggaccacccctgccgccggggga atggacgagtctgtggtcctggagcccgaagccacaggcgaaagcagtagcctggagccg actccttcttcccaggaagcacagagggccctggggacctccccagagctgccgacgggc gtgactggttcctcagggaccaggctccccccgacgccaaaggctcaggatggagggcct gtgggcacggagcttttccgagtgcctcccgtctccactgccgccacgtggcagagttct gctccccaccaacctgggcccagcctctgggctgaggcaaagacctctgaggccccgtcc acccaggacccctccacccaggcctccactgcgtcctccccagccccagaggagaatgct ccgtctgaaggccagcgtgtgtggggtcagggacagagccccaggccagagaactctctg gagcgggaggagatgggtcccgtgccagcgcacacggatgccttccaggactgggggcct ggcagcatggcccacgtctctgtggtccctgtctcctcagaagggacccccagcagggag ccagtggcttcaggcagctggacccctaaggctgaggaacccatccatgccaccatggac ccccagaggctgggcgtccttatcactcctgtccctgacgcccaggctgccacccggagg caggcggtggggctgctggccttccttggcctcctcttctgcctgggggtggccatgttc acctaccagagcctccagggctgccctcgaaagatggcaggagagatggcggagggcctt cgctacatcccccggagctgtggtagtaattcatatgtcctggtgcccgtgtga >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_6|222_aa MSEWMDGGWMMDGRVAPSFRLLGPETWALSLTPLPLAPNFPSVGPPLLLPWWLPTPDTLE RGPQPCGWDFSFIDLMVVVAMYKHLETGQTGPEDGEPIEHDCQQIAAQTYATQEDLLEVP LANPDLNLYADGSSFAENGIQRAGYAIVTDVTVLERSAVVTGTQRETGHIHAFPHRSFRN TVGQADIPRNNTLHPIKLTFNINHHNIRAIKTLEHQRISEKL >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_6|669_bp atgagtgaatggatggatggaggatggatgatggatggacgtgtggccccttccttccgg ttactagggccagaaacctgggcgctatccctgacgcctcttcctctcgcccccaacttc ccatctgttgggccaccactgctgttgccctggtggttacctacccctgacactctagaa agagggccccagccatgtggatgggatttctccttcattgatctgatggtggtcgtggca atgtataagcacctggaaacaggtcagacaggtccagaggatggggaaccaattgagcat gactgccaacaaattgcagcccagacttatgccacccaagaggatctcttggaagtcccc ttagctaatcctgaccttaacctatatgctgatggaagttcatttgcggagaatgggata caaagggcaggttatgccatagttactgatgtaacagtacttgaaagatctgcagtggtc actgggacacaaagggaaacagggcacatccatgccttccctcacagatctttcagaaac acagtggggcaggcagacatacctaggaacaatactttgcatccaatcaagttgacattc aacattaaccatcacaacatccgagccattaaaactctcgagcaccaacgtatttcagag aaactgtaa >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_7|95_aa MAPLKMLALVTLLLGASLQHIHAARGTNVGRECCLEYFKGAIPLRKLKTWYQTSEDCSRD AIVQLIAGQMIEPKFTFQYGHSDNEVDKTSELACL >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_7|288_bp atggccccactgaagatgctggccctggtcaccctcctcctgggggcttctctgcagcac atccacgcagctcgagggaccaatgtgggccgggagtgctgcctggagtacttcaaggga gccattccccttagaaagctgaagacgtggtaccagacatctgaggactgctccagggat gccatcgttcagctcattgcagggcagatgatagaacccaagtttactttccagtacggg cacagcgataacgaggttgataaaaccagtgaactggcttgcctgtga >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_8|380_aa MADFGISAGQFVAVVWDKSSPVEALKGLVDKLQALTGNEGRVSVENIKQLLQSAHKESSF DIILSGLVPGSTTLHSAEILAEIARILRPGGCLFLKEPVETAVDNNSKVKTASKLCSALT LSGLVEVKELQREPLTPEEVQSVREHLGHESDNLLFVQITGKKPNFEVGSSRQLKLSITK KSSPSVKPAVDPAAAKLWTLSANDMEDDSMCIFCGCSLTHRWPLEHVVRLNMMINQKEDR VDTFFTLDSKFPLEACSHFSFSLAETTTVSLIALNTLQDLIDSDELLDPEDLKKPDPASL RAASCGEGKKRKACKNCTCGLAEELEKEKSREQMSSQPKSACGNCYLGDAFRCASCPYLG MPAFKPGEKVLLSDSNLHDA >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_8|1143_bp atggcagattttgggatctctgctggccagtttgtggcagtggtctgggataagtcatcc ccagtggaggctctgaaaggtctggtggataagcttcaagcgttaaccggcaatgagggc cgcgtgtctgtggaaaacatcaagcagctgttgcaatctgcccacaaagaatccagcttt gacattattttgtcaggtttagtcccaggaagcaccactctgcacagtgctgagattttg gctgaaatcgcccggatccttcggcctggtggatgtctttttctgaaggagccagtagag acagctgtagataacaatagcaaagtgaagacagcatctaagctgtgttcagccctgact ctttctggtcttgtggaagtgaaagagctgcagcgggagcccctaacccctgaggaagta cagtctgttcgagaacaccttggtcatgaaagtgacaacctgctgtttgttcagatcaca ggcaaaaaaccaaactttgaagtgggttcttctaggcagcttaagctttccatcaccaag aagtcttctccttcagtgaaacctgctgtggaccctgctgctgccaagctgtggaccctc tcagccaacgatatggaggacgacagcatgtgcatcttctgtggatgtagtttaactcac cgttggcctcttgagcatgtggtcaggttgaacatgatgatcaaccaaaaggaggacagg gtggacaccttctttaccctggactccaagtttcctctcgaagcctgcagtcactttagc ttttcattagcagagaccacgactgtatcactcattgctttgaacactctccaggatctc attgactcagatgagctgctggatccagaagatttgaagaagccagatccagcttccctg cgggctgcttcttgtggggaagggaaaaagaggaaggcctgtaagaactgcacctgtggc cttgccgaagaactggaaaaagagaagtcaagggaacagatgagctcccaacccaagtca gcttgtggaaactgctacctgggcgatgccttccgctgtgccagctgcccctaccttggg atgccagccttcaaacctggggaaaaggtgcttctgagtgatagcaatcttcatgatgcc tag >gi568815582f:57258817_57463585|GENSCAN_predicted_peptide_9|367_aa MAAAAVSGALGRAGWRLLQLRCLPVARCRQALVPRAFHASAVGLRSSDEQKQQPPNSFSQ QHSETQGAEKPDPESSHSPPRYTDQGGEEEEDYESEEQLQHRILTAALEFVPAHGWTAEA IAEGAQSLGLSSAAASMFGKDGSELILHFVTQCNTRLTRVLEEEQKLVQLGQAEKRKTDQ FLRDAVETRLRMLIPYIEHWPRALSILMLPHNIPSSLSLLTSMVDDMWHYAGDQSTDFNW YTRRAMLAAIYNTTELVMMQDSSPDFEDTWRFLENRVNDAMNMGHTAKQVKSTGEALVQG LMGAAVTSPRSRRGGWWPLGEMPYANQPTVRITELTDENVKFIIENTDLAVANSIRRVFI AEVPIIX >gi568815582f:57258817_57463585|GENSCAN_predicted_CDS_9|1101_bp atggcggcggcggcggtatctggtgcgcttggccgggcgggctggaggctcctgcagctg cgatgcctgcccgtggcccgttgccgacaagccctggtgccgcgtgccttccatgcttca gctgtggggctaaggtcttcagatgagcagaagcagcagcctcccaactcattttctcag cagcattctgagacacagggggcagaaaaacctgatccagagtcttctcattcacccccc aggtatacagaccagggcggcgaggaggaggaggactatgaaagtgaggagcagttgcag caccgcatcctgacggcagcccttgagtttgtgcccgcccacgggtggacagcagaggcg attgcagaaggagcccagtctctgggtctctccagtgcagcagccagcatgttcgggaag gatggcagtgagctaatactgcattttgtgacccagtgcaatacccggctcacacgtgtg ctagaagaggagcagaagctggtacagttgggccaggcggagaagaggaagacagaccag ttcctgagggatgcagtggaaaccagactgagaatgctgatcccatacattgagcactgg ccccgggccctcagcatcctcatgctccctcacaacatcccgtccagcctgagcctgctc accagcatggtggatgacatgtggcattacgctggggaccagtccactgattttaactgg tacacccgccgagccatgctggctgccatctacaacacaacagagctggtgatgatgcag gactcctctccagactttgaggacacttggcgcttcctggaaaaccgggttaatgatgca atgaacatgggccacactgccaagcaggtaaagtccacaggagaggcactggtgcaagga ctcatgggtgcagcagtgacgtcaccgcggagcagacgcggaggctggtggcccctgggc gagatgccgtacgccaaccagcctaccgtgcggatcacggagctcactgacgagaatgtc aagttcatcatcgagaacaccgacctggcggtggccaattcgattcggagggtcttcatc gctgaggttcccataatagnn