GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:13:38 Sequence gi568815582f:57272569_57483029 : 210461 bp : 48.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1431 1426 6 1.05 1.02 Term - 11687 11604 84 2 0 86 53 76 0.604 1.55 1.01 Init - 11972 11838 135 2 0 122 85 274 0.996 30.64 1.00 Prom - 12632 12593 40 -3.96 2.00 Prom + 42360 42399 40 -3.16 2.01 Sngl + 48729 48995 267 2 0 30 47 199 0.581 5.23 2.02 PlyA + 49694 49699 6 1.05 3.03 PlyA - 50750 50745 6 1.05 3.02 Term - 70807 70638 170 2 2 79 41 57 0.322 -1.86 3.01 Init - 75882 75804 79 0 1 83 90 54 0.806 6.32 3.00 Prom - 79597 79558 40 -4.06 4.00 Prom + 79607 79646 40 -4.06 4.01 Init + 86249 86321 73 1 1 68 94 141 0.999 11.94 4.02 Term + 87869 87996 128 0 2 73 48 193 0.432 12.24 4.03 PlyA + 89289 89294 6 1.05 5.00 Prom + 93294 93333 40 -7.46 5.01 Init + 100001 100070 70 1 1 94 119 130 0.993 15.92 5.02 Intr + 107066 107186 121 0 1 75 53 74 0.728 2.15 5.03 Term + 109462 110464 1003 1 1 99 43 634 0.853 51.57 5.04 PlyA + 112453 112458 6 1.05 6.07 PlyA - 113210 113205 6 1.05 6.06 Term - 119865 119753 113 1 2 56 42 75 0.204 -1.58 6.05 Intr - 121507 121418 90 2 0 120 92 -33 0.158 0.27 6.04 Intr - 125462 125274 189 0 0 90 80 91 0.971 8.06 6.03 Intr - 126797 126652 146 1 2 138 94 33 0.688 8.83 6.02 Intr - 131910 131826 85 1 1 94 80 20 0.464 0.68 6.01 Init - 135767 135722 46 2 1 35 92 41 0.149 0.05 6.00 Prom - 136462 136423 40 -9.75 7.00 Prom + 138031 138070 40 -6.86 7.01 Init + 141365 141434 70 1 1 120 94 197 0.999 22.71 7.02 Intr + 142513 142630 118 2 1 122 86 149 0.822 17.62 7.03 Term + 144664 144763 100 1 1 63 41 102 0.438 0.60 7.04 PlyA + 146221 146226 6 1.05 8.09 PlyA - 148399 148394 6 1.05 8.08 Term - 156712 156602 111 1 0 115 49 119 0.990 9.26 8.07 Intr - 157771 157690 82 0 1 88 46 70 0.933 2.44 8.06 Intr - 158902 158583 320 1 2 85 105 180 0.798 14.16 8.05 Intr - 159992 159919 74 0 2 32 89 153 0.991 8.93 8.04 Intr - 161644 161476 169 1 1 59 110 129 0.955 11.72 8.03 Intr - 164164 164088 77 2 2 78 94 31 0.993 1.93 8.02 Intr - 166766 166614 153 0 0 72 82 40 0.750 1.84 8.01 Init - 168360 168204 157 0 1 76 101 179 0.999 18.08 8.00 Prom - 171643 171604 40 -8.16 9.00 Prom + 172585 172624 40 -6.16 9.01 Init + 174938 175010 73 1 1 93 93 129 0.990 13.13 9.02 Intr + 178472 178640 169 0 1 69 96 98 0.987 7.70 9.03 Intr + 180233 180368 136 2 1 63 89 226 0.981 20.77 9.04 Intr + 183936 184078 143 2 2 76 83 75 0.881 4.95 9.05 Intr + 184363 184447 85 1 1 110 48 33 0.890 1.32 9.06 Intr + 185678 185782 105 1 0 106 76 122 0.999 13.21 9.07 Intr + 186997 187152 156 0 0 25 96 261 0.993 20.81 9.08 Intr + 187483 187536 54 0 0 76 113 37 0.911 4.18 9.09 Intr + 190115 190242 128 1 2 29 60 206 0.253 11.38 9.10 Intr + 190461 190510 50 0 2 91 87 51 0.989 3.62 9.11 Intr + 193385 193453 69 0 0 91 97 35 0.934 3.85 9.12 Intr + 193607 193659 53 0 2 110 79 92 0.937 9.13 9.13 Intr + 196597 196725 129 0 0 119 77 162 0.997 19.09 9.14 Intr + 197142 197193 52 2 1 74 115 136 0.999 13.38 9.15 Intr + 197393 197561 169 0 1 131 80 292 0.999 31.70 9.16 Intr + 197712 197786 75 0 0 76 116 179 0.999 18.13 9.17 Term + 198407 198551 145 2 1 75 36 116 0.986 2.48 9.18 PlyA + 199094 199099 6 1.05 10.09 PlyA - 199377 199372 6 1.05 10.08 Term - 200927 200809 119 0 2 117 47 71 0.947 4.60 10.07 Intr - 201168 201045 124 0 1 112 105 197 0.999 23.96 10.06 Intr - 201471 201333 139 2 1 99 72 219 0.876 21.87 10.05 Intr - 202414 202225 190 2 1 88 74 329 0.999 30.14 10.04 Intr - 202651 202532 120 2 0 28 94 95 0.506 4.57 10.03 Intr - 203052 202938 115 0 1 80 94 218 0.999 21.62 10.02 Intr - 203389 203282 108 1 0 102 72 168 0.863 17.08 10.01 Init - 206939 206874 66 2 0 96 84 133 0.395 14.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_1|72_aa MAEFPSKVSTRTSSPAQGAEASVSALRPDLGFVRSRLGALMLLQLGASRTDAPKIALHPS HHDTTPWPSLRA >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_1|219_bp atggccgagttcccgtcgaaagttagcacgcggaccagcagtcctgcgcagggcgccgaa gcctcggtgtcggcgctgcgcccggacctgggcttcgtgcgctcccgcctcggggcgctc atgctgctgcagctgggcgcttcccgcacggacgcccccaagatcgccctgcatccctct caccacgacactaccccgtggccgtcgctgcgagcgtga >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_2|88_aa MLYDRINDEKSRNRPFSQDGTKGKEGSSCPPKAEAKSKALKDKKAVLKGVHSHIKKKIHT SPIFPWPRRCDSRGSPDILRRAPQEKQA >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_2|267_bp atgctatatgatagaataaatgatgaaaaatctcgaaatagacccttttcacaagatggc accaaaggcaaagaaggaagctcctgcccccctaaagctgaagccaaatcgaaggctttg aaggacaagaaggcagtgctgaaaggtgtccacagccacataaaaaagaagatccacaca tcacccatcttcccgtggccaagacgctgcgactccagaggcagcccagatatcctcaga agagcaccccaggagaaacaagcttga >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_3|82_aa MQFWVLGVPNKELEETHMESKAKQQQETRSYQKTTPTCNCRICPLHFPCSSLKDKLVQPK ACGSLEAQDSFECGPTQMRKLS >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_3|249_bp atgcagttctgggttcttggtgtcccgaacaaagaactggaggagacacacatggaaagc aaagcaaagcagcaacaggaaacgagaagttaccagaagaccacacccacctgcaactgt cgcatctgcccactgcattttccttgttcctcactcaaggataagcttgtccaacccaag gcctgtgggtcgcttgaggcccaggacagctttgaatgtggcccaacacaaatgcgtaaa ctttcttaa >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_4|66_aa MDRLQTALLVVLVLLAVALQATEAGPYGANMEDSVCCRDYVRYRLPLRVVKHFYWTSDSC PRPGVV >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_4|201_bp atggatcgcctacagactgcactcctggttgtcctcgtcctccttgctgtggcgcttcaa gcaactgaggcaggcccctacggcgccaacatggaagacagcgtctgctgccgtgattac gtccgttaccgtctgcccctgcgcgtggtgaaacacttctactggacctcagactcctgc ccgaggcctggcgtggtgtga >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_5|397_aa MAPISLSWLLRLATFCHLTVLLAGQHHGVTKCNITCSKMTSKIPVALLIHYQQNQASCGK RAIILETRQHRLFCADPKEQWVKDAMQHLDRQAAALTRNGGTFEKQIGEVKPRTTPAAGG MDESVVLEPEATGESSSLEPTPSSQEAQRALGTSPELPTGVTGSSGTRLPPTPKAQDGGP VGTELFRVPPVSTAATWQSSAPHQPGPSLWAEAKTSEAPSTQDPSTQASTASSPAPEENA PSEGQRVWGQGQSPRPENSLEREEMGPVPAHTDAFQDWGPGSMAHVSVVPVSSEGTPSRE PVASGSWTPKAEEPIHATMDPQRLGVLITPVPDAQAATRRQAVGLLAFLGLLFCLGVAMF TYQSLQGCPRKMAGEMAEGLRYIPRSCGSNSYVLVPV >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_5|1194_bp atggctccgatatctctgtcgtggctgctccgcttggccaccttctgccatctgactgtc ctgctggctggacagcaccacggtgtgacgaaatgcaacatcacgtgcagcaagatgaca tcaaagatacctgtagctttgctcatccactatcaacagaaccaggcatcatgcggcaaa cgcgcaatcatcttggagacgagacagcacaggctgttctgtgccgacccgaaggagcaa tgggtcaaggacgcgatgcagcatctggaccgccaggctgctgccctaactcgaaatggc ggcaccttcgagaagcagatcggcgaggtgaagcccaggaccacccctgccgccggggga atggacgagtctgtggtcctggagcccgaagccacaggcgaaagcagtagcctggagccg actccttcttcccaggaagcacagagggccctggggacctccccagagctgccgacgggc gtgactggttcctcagggaccaggctccccccgacgccaaaggctcaggatggagggcct gtgggcacggagcttttccgagtgcctcccgtctccactgccgccacgtggcagagttct gctccccaccaacctgggcccagcctctgggctgaggcaaagacctctgaggccccgtcc acccaggacccctccacccaggcctccactgcgtcctccccagccccagaggagaatgct ccgtctgaaggccagcgtgtgtggggtcagggacagagccccaggccagagaactctctg gagcgggaggagatgggtcccgtgccagcgcacacggatgccttccaggactgggggcct ggcagcatggcccacgtctctgtggtccctgtctcctcagaagggacccccagcagggag ccagtggcttcaggcagctggacccctaaggctgaggaacccatccatgccaccatggac ccccagaggctgggcgtccttatcactcctgtccctgacgcccaggctgccacccggagg caggcggtggggctgctggccttccttggcctcctcttctgcctgggggtggccatgttc acctaccagagcctccagggctgccctcgaaagatggcaggagagatggcggagggcctt cgctacatcccccggagctgtggtagtaattcatatgtcctggtgcccgtgtga >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_6|222_aa MSEWMDGGWMMDGRVAPSFRLLGPETWALSLTPLPLAPNFPSVGPPLLLPWWLPTPDTLE RGPQPCGWDFSFIDLMVVVAMYKHLETGQTGPEDGEPIEHDCQQIAAQTYATQEDLLEVP LANPDLNLYADGSSFAENGIQRAGYAIVTDVTVLERSAVVTGTQRETGHIHAFPHRSFRN TVGQADIPRNNTLHPIKLTFNINHHNIRAIKTLEHQRISEKL >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_6|669_bp atgagtgaatggatggatggaggatggatgatggatggacgtgtggccccttccttccgg ttactagggccagaaacctgggcgctatccctgacgcctcttcctctcgcccccaacttc ccatctgttgggccaccactgctgttgccctggtggttacctacccctgacactctagaa agagggccccagccatgtggatgggatttctccttcattgatctgatggtggtcgtggca atgtataagcacctggaaacaggtcagacaggtccagaggatggggaaccaattgagcat gactgccaacaaattgcagcccagacttatgccacccaagaggatctcttggaagtcccc ttagctaatcctgaccttaacctatatgctgatggaagttcatttgcggagaatgggata caaagggcaggttatgccatagttactgatgtaacagtacttgaaagatctgcagtggtc actgggacacaaagggaaacagggcacatccatgccttccctcacagatctttcagaaac acagtggggcaggcagacatacctaggaacaatactttgcatccaatcaagttgacattc aacattaaccatcacaacatccgagccattaaaactctcgagcaccaacgtatttcagag aaactgtaa >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_7|95_aa MAPLKMLALVTLLLGASLQHIHAARGTNVGRECCLEYFKGAIPLRKLKTWYQTSEDCSRD AIVQLIAGQMIEPKFTFQYGHSDNEVDKTSELACL >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_7|288_bp atggccccactgaagatgctggccctggtcaccctcctcctgggggcttctctgcagcac atccacgcagctcgagggaccaatgtgggccgggagtgctgcctggagtacttcaaggga gccattccccttagaaagctgaagacgtggtaccagacatctgaggactgctccagggat gccatcgttcagctcattgcagggcagatgatagaacccaagtttactttccagtacggg cacagcgataacgaggttgataaaaccagtgaactggcttgcctgtga >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_8|380_aa MADFGISAGQFVAVVWDKSSPVEALKGLVDKLQALTGNEGRVSVENIKQLLQSAHKESSF DIILSGLVPGSTTLHSAEILAEIARILRPGGCLFLKEPVETAVDNNSKVKTASKLCSALT LSGLVEVKELQREPLTPEEVQSVREHLGHESDNLLFVQITGKKPNFEVGSSRQLKLSITK KSSPSVKPAVDPAAAKLWTLSANDMEDDSMCIFCGCSLTHRWPLEHVVRLNMMINQKEDR VDTFFTLDSKFPLEACSHFSFSLAETTTVSLIALNTLQDLIDSDELLDPEDLKKPDPASL RAASCGEGKKRKACKNCTCGLAEELEKEKSREQMSSQPKSACGNCYLGDAFRCASCPYLG MPAFKPGEKVLLSDSNLHDA >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_8|1143_bp atggcagattttgggatctctgctggccagtttgtggcagtggtctgggataagtcatcc ccagtggaggctctgaaaggtctggtggataagcttcaagcgttaaccggcaatgagggc cgcgtgtctgtggaaaacatcaagcagctgttgcaatctgcccacaaagaatccagcttt gacattattttgtcaggtttagtcccaggaagcaccactctgcacagtgctgagattttg gctgaaatcgcccggatccttcggcctggtggatgtctttttctgaaggagccagtagag acagctgtagataacaatagcaaagtgaagacagcatctaagctgtgttcagccctgact ctttctggtcttgtggaagtgaaagagctgcagcgggagcccctaacccctgaggaagta cagtctgttcgagaacaccttggtcatgaaagtgacaacctgctgtttgttcagatcaca ggcaaaaaaccaaactttgaagtgggttcttctaggcagcttaagctttccatcaccaag aagtcttctccttcagtgaaacctgctgtggaccctgctgctgccaagctgtggaccctc tcagccaacgatatggaggacgacagcatgtgcatcttctgtggatgtagtttaactcac cgttggcctcttgagcatgtggtcaggttgaacatgatgatcaaccaaaaggaggacagg gtggacaccttctttaccctggactccaagtttcctctcgaagcctgcagtcactttagc ttttcattagcagagaccacgactgtatcactcattgctttgaacactctccaggatctc attgactcagatgagctgctggatccagaagatttgaagaagccagatccagcttccctg cgggctgcttcttgtggggaagggaaaaagaggaaggcctgtaagaactgcacctgtggc cttgccgaagaactggaaaaagagaagtcaagggaacagatgagctcccaacccaagtca gcttgtggaaactgctacctgggcgatgccttccgctgtgccagctgcccctaccttggg atgccagccttcaaacctggggaaaaggtgcttctgagtgatagcaatcttcatgatgcc tag >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_9|596_aa MAAAAVSGALGRAGWRLLQLRCLPVARCRQALVPRAFHASAVGLRSSDEQKQQPPNSFSQ QHSETQGAEKPDPESSHSPPRYTDQGGEEEEDYESEEQLQHRILTAALEFVPAHGWTAEA IAEGAQSLGLSSAAASMFGKDGSELILHFVTQCNTRLTRVLEEEQKLVQLGQAEKRKTDQ FLRDAVETRLRMLIPYIEHWPRALSILMLPHNIPSSLSLLTSMVDDMWHYAGDQSTDFNW YTRRAMLAAIYNTTELVMMQDSSPDFEDTWRFLENRVNDAMNMGHTAKQVKSTGEALVQG LMGAAVTSPRSRRGGWWPLGEMPYANQPTVRITELTDENVKFIIENTDLAVANSIRRVFI AEVPIIAIDWVQIDANSSVLHDEFIAHRLGLIPLISDDIVDKLQYSRDCTCEEFCPECSV EFTLDVRCNEDQTRHVTSRDLISNSPRVIPVTSRNRDNDPNDYVEQDDILIVKLRKGQEL RLRAYAKKGFGKEHAKWNPTAGVAFEYDPDNALRHTVYPKPEEWPKSEYSELDEDESQAP YDPNGKPERFYYNVESCGSLRPETIVLSALSGLKKKLSDLQTQLSHEIQSDVLTIN >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_9|1791_bp atggcggcggcggcggtatctggtgcgcttggccgggcgggctggaggctcctgcagctg cgatgcctgcccgtggcccgttgccgacaagccctggtgccgcgtgccttccatgcttca gctgtggggctaaggtcttcagatgagcagaagcagcagcctcccaactcattttctcag cagcattctgagacacagggggcagaaaaacctgatccagagtcttctcattcacccccc aggtatacagaccagggcggcgaggaggaggaggactatgaaagtgaggagcagttgcag caccgcatcctgacggcagcccttgagtttgtgcccgcccacgggtggacagcagaggcg attgcagaaggagcccagtctctgggtctctccagtgcagcagccagcatgttcgggaag gatggcagtgagctaatactgcattttgtgacccagtgcaatacccggctcacacgtgtg ctagaagaggagcagaagctggtacagttgggccaggcggagaagaggaagacagaccag ttcctgagggatgcagtggaaaccagactgagaatgctgatcccatacattgagcactgg ccccgggccctcagcatcctcatgctccctcacaacatcccgtccagcctgagcctgctc accagcatggtggatgacatgtggcattacgctggggaccagtccactgattttaactgg tacacccgccgagccatgctggctgccatctacaacacaacagagctggtgatgatgcag gactcctctccagactttgaggacacttggcgcttcctggaaaaccgggttaatgatgca atgaacatgggccacactgccaagcaggtaaagtccacaggagaggcactggtgcaagga ctcatgggtgcagcagtgacgtcaccgcggagcagacgcggaggctggtggcccctgggc gagatgccgtacgccaaccagcctaccgtgcggatcacggagctcactgacgagaatgtc aagttcatcatcgagaacaccgacctggcggtggccaattcgattcggagggtcttcatc gctgaggttcccataatagccattgactgggttcagattgatgccaattcctcagttctt catgatgaattcattgctcacaggcttggattaattcccctcattagtgatgacattgtg gacaagctgcagtactctcgggactgcacatgtgaggagttctgccccgagtgctcggtg gagttcaccctcgatgtgcggtgcaatgaagaccagacgcgacatgtcacgtctcgagac ctcatctccaacagcccccgggtcattccggtgacatcccggaaccgagataatgacccc aatgactacgtggagcaggatgacatcctcatcgtcaagttgagaaagggccaggagctg agacttcgagcctatgccaaaaagggctttggcaaggagcatgccaagtggaaccctact gcaggggtggcttttgaatacgatccagacaatgccctgaggcacacagtgtaccccaag cccgaggaatggccaaagagtgagtactcggagctggatgaggatgagtcgcaggctccc tatgaccccaacggcaagccagaaaggttttactacaatgtggagtcctgtggctctctg cgtcctgaaaccattgtcctgtcagccctctcaggattgaagaagaaactgagtgattta caaactcaattaagccacgagatccagagtgatgtgctaaccataaattaa >gi568815582f:57272569_57483029|GENSCAN_predicted_peptide_10|326_aa MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET DLLNRFILLKPKPSQGDSSEAKTPSQ >gi568815582f:57272569_57483029|GENSCAN_predicted_CDS_10|981_bp atggcgaccaatttcagtgacatcgtcaagcaaggctacgtgaagatgaagagcaggaag ctcgggatctaccggaggtgctggctggtgttccggaaatcctccagcaaggggccccag cggctggagaagtatccagatgagaagtcggtgtgcctccggggctgccccaaggtgact gagatcagcaacgtcaagtgtgttacgcggctccccaaggagaccaagcggcaggcggtg gccatcatattcactgatgactcggcacgtaccttcacctgcgactcagagctagaggca gaggagtggtacaagacactatctgtggagtgtctggggtcccgcctcaacgacatcagt ctgggagaacctgacctcctggccccaggggtgcagtgtgaacagacagatcgcttcaat gtcttcctgctgccctgccccaacctggacgtgtatggcgagtgcaagctgcagatcacc cacgagaacatctacctctgggacatccacaacccccgtgtgaagctcgtctcgtggccc ctctgctcactgcgccgctatggccgggatgccacacgctttaccttcgaggctggccgg atgtgtgatgctggggaaggactctataccttccagacacaagagggggagcagatttac cagcgcgtccacagtgccaccctggccatcgcagagcagcacaagcgggtcctgctggaa atggagaagaacgtgaggctgctgaacaagggcacggaacattactcgtatccctgcaca cccacgaccatgctgccgcgcagtgcctactggcaccacatcactggttcccagaacatc gccgaagcctccagctatgctggtgaggggtatggggcagcccaggccagctcggaaaca gacctcctcaacagattcatcctgctaaagccaaagcccagccagggggacagcagtgag gccaagaccccatcccagtga