GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:10:29 Sequence gi568815596f:74734581_74990938 : 256358 bp : 44.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 20051 20270 220 1 1 82 84 164 0.955 14.19 1.02 Term + 35145 35272 128 1 2 54 44 142 0.620 4.84 1.03 PlyA + 36523 36528 6 1.05 2.00 Prom + 41042 41081 40 0.34 2.01 Init + 45954 46100 147 2 0 79 94 100 0.429 7.89 2.02 Intr + 85246 85355 110 0 2 39 80 87 0.038 2.08 2.03 Intr + 99379 99657 279 1 0 99 16 118 0.184 2.29 2.04 Intr + 100000 100063 64 1 1 48 115 96 0.218 7.02 2.05 Intr + 119713 119875 163 0 1 134 84 191 0.913 22.85 2.06 Intr + 127415 127508 94 0 1 38 84 95 0.042 3.12 2.07 Intr + 130723 130880 158 1 2 89 -46 120 0.357 -1.65 2.08 Intr + 133056 133204 149 1 2 109 80 202 0.988 21.45 2.09 Intr + 137720 137839 120 1 0 88 94 116 0.999 12.89 2.10 Intr + 138696 138791 96 2 0 82 84 64 0.977 5.61 2.11 Intr + 139264 139363 100 0 1 92 92 205 0.999 20.98 2.12 Intr + 139686 139869 184 1 1 62 74 384 0.999 33.25 2.13 Intr + 142586 142741 156 2 0 103 84 242 0.999 24.43 2.14 Intr + 144108 144341 234 0 0 106 81 491 0.970 47.00 2.15 Intr + 145685 145989 305 2 2 10 86 527 0.869 40.93 2.16 Intr + 147131 147279 149 0 2 78 83 259 0.999 24.35 2.17 Intr + 147540 147659 120 2 0 128 77 185 0.746 22.09 2.18 Intr + 150914 151009 96 1 0 77 75 176 0.984 15.41 2.19 Intr + 151714 151813 100 0 1 104 87 168 0.999 17.98 2.20 Intr + 151910 152093 184 0 1 54 80 340 0.999 28.65 2.21 Intr + 153323 153478 156 2 0 95 78 189 0.999 17.73 2.22 Intr + 154665 154898 234 0 0 97 70 273 0.997 23.20 2.23 Term + 156217 156361 145 1 1 92 42 201 0.889 13.28 2.24 PlyA + 157723 157728 6 1.05 3.03 PlyA - 157897 157892 6 1.05 3.02 Term - 160554 160502 53 1 2 87 50 34 0.577 -2.91 3.01 Init - 163735 163666 70 1 1 105 47 65 0.497 5.31 3.00 Prom - 165507 165468 40 -2.76 4.12 PlyA - 166166 166161 6 1.05 4.11 Term - 172724 172576 149 0 2 89 44 100 0.276 3.76 4.10 Intr - 174713 174584 130 2 1 16 59 88 0.301 -1.03 4.09 Intr - 176723 176648 76 1 1 101 66 56 0.738 4.22 4.08 Intr - 180958 180823 136 2 1 99 100 108 0.575 12.73 4.07 Intr - 185675 185620 56 1 2 88 9 94 0.179 0.12 4.06 Intr - 188048 187975 74 2 2 51 70 41 0.023 -3.20 4.05 Intr - 192441 192359 83 1 2 76 94 46 0.232 3.36 4.04 Intr - 193676 193597 80 1 2 92 47 44 0.079 -0.11 4.03 Intr - 194159 194088 72 1 0 54 103 75 0.154 4.12 4.02 Intr - 199029 198837 193 1 1 130 7 50 0.317 -0.35 4.01 Init - 200166 200085 82 0 1 75 53 88 0.619 5.13 4.00 Prom - 201945 201906 40 -2.66 5.00 Prom + 216174 216213 40 -0.46 5.01 Init + 224100 224312 213 2 0 102 62 126 0.810 10.28 5.02 Intr + 224761 224845 85 0 1 119 15 70 0.827 2.19 5.03 Intr + 225525 225566 42 1 0 96 113 46 0.819 6.11 5.04 Term + 235049 235143 95 0 2 32 43 136 0.244 1.49 5.05 PlyA + 235354 235359 6 1.05 6.02 PlyA - 235405 235400 6 1.05 6.01 Sngl - 238865 236580 2286 2 0 43 41 794 0.942 63.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:74734581_74990938|GENSCAN_predicted_peptide_1|115_aa MRANPTSFYIEEPNGAFYIADDGPARDVHEFHAHLCALSLRAGPAQDLGDPGQLDRLHTA GVHDGGAATMWRRVTSQVHQFTHLLKAEDDDADEEDDEDEDDGTHLLRVTEGELN >gi568815596f:74734581_74990938|GENSCAN_predicted_CDS_1|348_bp atgcgggcgaacccaacgtccttttacattgaagaacccaacggggccttttacattgcg gatgatggaccggctcgtgacgtccatgaatttcacgcgcacctgtgtgcactgtccctg agagccggtcctgcccaggaccttggtgaccctggccagcttgataggctgcacacggct ggtgttcatgatggcggcgcagcgacgatgtggcggagagttaccagccaggtgcatcag tttactcatctgttaaaagcagaggatgatgatgctgatgaagaggatgatgaagatgaa gatgatggtacccatctcctcagggttacagaaggtgaattgaattga >gi568815596f:74734581_74990938|GENSCAN_predicted_peptide_2|1180_aa MAMTSRLAGLQVAPAGRLSQLVMREVHTVGQSASRRFKQLGSDENPGRQLSANYYKTGPD TFKEHWSEMSRHHILLSVKPHNHPVRLPAPVSERPRPRAVSDDWLRHGGGRSVGAHTLPA QPMGVRTSLIRRPAGRQPLNKPHCCMKLRRRSPGLPLATSCHPAKKIRGPEPRACESERM IASHLLAYFFTELNHDQVQKVDQYLYHMRLSDETLLEISKRFRKEMEKGLGATTHPTAAV KMLPTFVRSTPDGTARLKPSREQGTMMLMVTVMMMMIKIVDIVSASPSENPAHHPFLVEL WPLSPAGLASDDDPPGAASPPRSPVVTDSADRVYPGLEKHGEFLALDLGGTNFRVLWVKV TDNGLQKVEMENQIYAIPEDIMRGSGTQLFDHIAECLANFMDKLQIKDKKLPLGFTFSFP CHQTKLDESFLVSWTKGFKSSGVEGRDVVALIRKAIQRRGDFDIDIVAVVNDTVGTMMTC GYDDHNCEIGLIVGTGSNACYMEEMRHIDMVEGDEGRMCINMEWGAFGDDGSLNDIRTEF DQEIDMGSLNPGKQLFEKMISGMYMGELVRLILVKMAKEELLFGGKLSPELLNTGRFETK DISDIEGEKDGIRKAREVLMRLGLDPTQEDCVATHRICQIVSTRSASLCAATLAAVLQRI KENKGEERLRSTIGVDGSVYKKHPHFAKRLHKTVRRLVPGCDVRFLRSEDGSGKGAAMVT AVAYRLADQHRARQKTLEHLQLSHDQLLEVKRRMKVEMERGLSKETHASAPVKMLPTYVC ATPDGTEKGDFLALDLGGTNFRVLLVRVRNGKWGGVEMHNKIYAIPQEVMHGTGDELFDH IVQCIADFLEYMGMKGVSLPLGFTFSFPCQQNSLDESILLKWTKGFKASGCEGEDVVTLL KEAIHRREEFDLDVVAVVNDTVGTMMTCGFEDPHCEVGLIVGTGSNACYMEEMRNVELVE GEEGRMCVNMEWGAFGDNGCLDDFRTEFDVAVDELSLNPGKQRFEKMISGMYLGEIVRNI LIDFTKRGLLFRGRISERLKTRGIFETKFLSQIESDCLALLQVRAILQHLGLESTCDDSI IVKEVCTVVARRAAQLCGAGMAAVVDRIRENRGLDALKVTVGVDGTLYKLHPHFAKVMHE TVKDLAPKCDVSFLQSEDGSGKGAALITAVACRIREAGQR >gi568815596f:74734581_74990938|GENSCAN_predicted_CDS_2|3543_bp atggccatgaccagcagactggctggattgcaggtggccccagcagggagactctcacag ttggtgatgagagaagttcacactgtaggtcagtcagcatcaagaaggttcaagcagctg ggaagtgatgaaaaccctggtaggcagctcagtgccaactactataaaacaggtcctgat acttttaaggaacactggtctgagatgtctagacaccacatcctgctctcagttaagccc cacaaccatcctgtgaggctgccggctccggtgtctgagcggccgcgcccgcgagccgtg agcgatgattggctgcgccacggcggcgggcggtccgtgggcgcacacaccctccccgcg cagccaatgggcgtgcgcacgtcactgatccggaggcccgcgggccggcagcccctcaat aagccacattgttgcatgaaactccggcgcaggagtcccgggctgccgctggcaacatcg tgtcacccagctaagaaaatccgcgggcccgagccacgcgcctgtgaatcggagaggatg attgcctcgcatctgcttgcctacttcttcacggagctcaaccatgaccaagtgcagaag gttgaccagtatctctaccacatgcgcctctctgatgagaccctcttggagatctctaag cggttccgcaaggagatggagaaagggcttggagccaccactcaccctactgcagcagtg aagatgctgcccacctttgtgaggtccactccagatgggacagccagacttaaaccctct agagagcagggaaccatgatgctgatggtgacggtgatgatgatgatgataaaaattgta gacatcgttagtgccagtccatcagaaaatcctgcacaccacccgttcttggtggagctg tggcctctgagtccagctgggctggcctcggatgatgacccaccaggggcagcctcgcca cctcggtcccctgtggtcactgattcagcggatcgcgtgtatcctggacttgagaaacac ggagagttcctggctctggatcttggagggaccaacttccgtgtgctttgggtgaaagta acggacaatgggctccagaaggtggagatggagaatcagatctatgccatccctgaggac atcatgcgaggcagtggcacccagctgtttgaccacattgccgaatgcctggctaacttc atggataagctacaaatcaaagacaagaagctcccactgggttttaccttctcgttcccc tgccaccagactaaactagacgagagtttcctggtctcatggaccaagggattcaagtcc agtggagtggaaggcagagacgttgtggctctgatccggaaggccatccagaggagaggg gactttgatatcgacattgtggctgtggtgaatgacacagttgggaccatgatgacctgt ggttatgatgaccacaactgtgagattggtctcattgtgggcacgggcagcaacgcctgc tacatggaagagatgcgccacatcgacatggtggaaggcgatgaggggcggatgtgtatc aatatggagtggggggccttcggggacgatggctcgctcaacgacattcgcactgagttt gaccaggagattgacatgggctcactgaacccgggaaagcaactgtttgagaagatgatc agtgggatgtacatgggggagctggtgaggcttatcctggtgaagatggccaaggaggag ctgctctttggggggaagctcagcccagagcttctcaacaccggtcgctttgagaccaaa gacatctcagacattgaaggggagaaggatggcatccggaaggcccgtgaggtcctgatg cggttgggcctggacccgactcaggaggactgcgtggccactcaccggatctgccagatc gtgtccacacgctccgccagcctgtgcgcagccaccctggccgccgtgctgcagcgcatc aaggagaacaaaggcgaggagcggctgcgctctactattggggtcgacggttccgtctac aagaaacacccccattttgccaagcgtctacataagaccgtgcggcggctggtgcccggc tgcgatgtccgcttcctccgctccgaggatggcagtggcaaaggtgcagccatggtgaca gcagtggcttaccggctggccgatcaacaccgtgcccgccagaagacattagagcatctg cagctgagccatgaccagctgctggaggtcaagaggaggatgaaggtagaaatggagcga ggtctgagcaaggagactcatgccagtgcccccgtcaagatgctgcccacctacgtgtgt gctaccccggacggcacagagaaaggggacttcttggccttggaccttggaggaacaaat ttccgggtcctgctggtccgtgttcggaatgggaagtggggtggagtggagatgcacaac aagatctacgccatcccgcaggaggtcatgcacggcaccggggacgagctctttgaccac attgtccagtgcatcgcggacttcctcgagtacatgggcatgaagggcgtgtccctgcct ctgggttttaccttctccttcccctgccagcagaacagcctggacgagagcatcctcctc aagtggacaaaaggcttcaaggcatctggctgcgagggcgaggacgtggtgaccctgctg aaggaagcgatccaccggcgagaggagtttgacctggatgtggttgctgtggtgaacgac acagtcggaactatgatgacctgtggctttgaagaccctcactgtgaagttggcctcatt gttggcacgggcagcaatgcctgctacatggaggagatgcgcaacgtggaactggtggaa ggagaagaggggcggatgtgtgtgaacatggaatggggggccttcggggacaatggatgc ctagatgacttccgcacagaatttgatgtggctgtggatgagctttcactcaaccccggc aagcagaggttcgagaaaatgatcagtggaatgtacctgggtgagattgtccgtaacatt ctcatcgatttcaccaagcgtggactactcttccgaggccgcatctcagagcggctcaag acaaggggcatctttgaaaccaagttcttgtctcagattgagagtgactgcctggccctg ctgcaagtccgagccatcctgcaacacttagggcttgagagcacctgtgacgacagcatc attgttaaggaggtgtgcactgtggtggcccggcgggcagcccagctctgtggcgcaggc atggccgctgtggtggacaggatacgagaaaaccgtgggctggacgctctcaaagtgaca gtgggtgtggatgggaccctctacaagctacatcctcactttgccaaagtcatgcatgag acagtgaaggacctggctccgaaatgtgatgtgtctttcctgcagtcagaggatggcagc gggaagggggcggcgctcatcactgctgtggcctgccgcatccgtgaggctggacagcga tag >gi568815596f:74734581_74990938|GENSCAN_predicted_peptide_3|40_aa MATYPDPRIQLLKTKDISLQQDVYPSSNTVNYEICDMRYG >gi568815596f:74734581_74990938|GENSCAN_predicted_CDS_3|123_bp atggccacctatccagaccccagaatacagctgctgaaaaccaaagatataagcttgcag caagatgtgtacccctcctccaacactgtgaattatgagatatgtgacatgagatatggg tag >gi568815596f:74734581_74990938|GENSCAN_predicted_peptide_4|376_aa MQLEHQQKMLGLSDSGTNYASTAAPEKGPRPKEQPYCGTFSSHGRQQKIKRDEPKYALSH NLVLRTGTLSLPPMFHGSKQQLRPSPTSVEQGCFGEQTCQRSLPTANISQHQPLDMVMTS NSIRREERVVMKHAPLVLLDSEEPHMEPAGHRAVWFAKFQPQNHRVEYRQLRTPASAGIP EDRFSVIVNHTTWSSCHSRQQTADSLTSKHRGLVMGSHKKWQSKSTARVVGILLETAGTC SRWNRKASAAFQCQPVSERDRFVGFPAMLTPESLDHIPQSQDFQLCRPGTAKCQPAQWRV RVVAPALLAPEFSFGVQEESAHTNRLKVGKKYQEWGIAEKIPENVEVTLEPGNRQRLEQF EGLRRRQGKVEKFGTS >gi568815596f:74734581_74990938|GENSCAN_predicted_CDS_4|1131_bp atgcagcttgagcatcagcagaagatgctgggtctttctgactccggaaccaactatgcc tctactgctgcccctgaaaagggtcccaggcctaaagagcaaccatactgtgggacattc tcttctcatggcagacagcagaagatcaagagggacgaaccaaaatatgccttgtctcat aaccttgtgcttagaactggcactttgtcgcttccacccatgttccacggatcaaagcaa caattaaggccaagccccacatcagtggagcaggggtgtttcggggaacagacctgtcag aggtccttgccaacagccaacatcagtcagcatcaaccactagacatggtgatgacatca aactcaattcggagagaggaaagggtagttatgaagcatgctccactggtgctgctggat agtgaggagccccacatggagccagctggccacagagcagtgtggttcgccaagttccag ccccagaatcacagggtagagtacagacagttgaggacaccagcctctgcagggatccct gaagacagattctcagtcattgtcaaccacaccacctggtccagctgtcacagccgccag cagacagccgactccctcaccagtaaacaccgcggcttagtgatgggcagccacaagaag tggcagagcaagagcacagccagagttgtcgggatcctgctggaaacagcaggcacatgc tcccgttggaacaggaaggcatcagctgcttttcagtgccagccagtgagtgaaagggat aggtttgtgggattccctgcgatgctgacacctgagagcctggaccacatcccacagtcc caggattttcagctctgccgtccaggtacggccaagtgccaaccagctcagtggagagtc agggtggtagcccctgccctcttggcacctgagttctcgtttggcgtccaggaagaatca gctcacacgaaccgtttgaaagtgggtaagaaataccaggagtggggtattgctgaaaag atacctgaaaatgtggaagtgactttggaaccgggtaacaggcagagattggaacagttt gaagggctcagaagaagacaaggaaaagtggaaaagtttggaacttcctag >gi568815596f:74734581_74990938|GENSCAN_predicted_peptide_5|144_aa MAAAAAAGSGTPREEEGPAGEAAASQPQAPTSVPGARLSRLPLARVKALVKADPDVTLAG QEAIFILARAAELFVETIAKDAYCCAQQGKRKTLQRRDLDNAIEAVDEFAFLEAWMQLLL LRAEMKKVFCISGFLNDEDQNKGF >gi568815596f:74734581_74990938|GENSCAN_predicted_CDS_5|435_bp atggcggcggcggcggcggcaggaagcgggacgccccgagaggaggagggacctgctggg gaggcagcggcctcgcagccccaggccccaacgagtgtgcctggggctcgtctctcgagg ttgcctctggcgcgagtgaaggccttggtgaaggcagatcccgacgtgacgctagcggga caggaagccatcttcattctggcacgagccgcggaactgtttgtggagaccattgcaaaa gatgcctactgttgcgctcagcagggaaaaaggaaaacccttcagaggagagacttggat aatgcaatagaagctgtggatgaatttgcttttctggaagcctggatgcagctgctgctg cttagagcagagatgaagaaagtgttctgcataagtggcttcctgaatgatgaggaccag aataaaggtttttga >gi568815596f:74734581_74990938|GENSCAN_predicted_peptide_6|761_aa MKRELRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQNLWDIFK AVCRGKFIALNAHKRKQERSKTDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIET QKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIR EYYKHLYANKLEDLEEMDKFLNTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPG PDGFTAEFNQRYKEELVPFLLKLFQSIEKEGILPNSLDEASIILIPKPGRDTTKKENFRP ISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHITRTKDK NHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANSILNGQKLEAFP LKTGTRQGCPLSPLLFNIVLEVLARAMRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPI VSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLRIQ LTRDVKDLFKENYKPLVKEIKEDTNKWKNIPCSWVGRISIVKMAIPPKVIYRFNAIPIKL PMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQ NRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPF LTPYTKINSRWIKDLNIRPKTTKTLEENLGITILHGQGLHV >gi568815596f:74734581_74990938|GENSCAN_predicted_CDS_6|2286_bp atgaaacgagaactcaggattaagaatctcactcaaaaccgctcaactacatggaaactg aacaacctgctcctgaatgactactgggtacataacgaaatgaaggcagaaataaagatg ttctttgaaaccaacgagaacaaagacacaacataccagaatctctgggacatattcaaa gcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaagcaggaaagatcc aaaactgacaccctaacatcacaattaaaagaactagaaaagcaagagcaaacacattca aaagctagcagaaggcaagaaataactaagatcagagcagaactgaaggaaatagagaca caaaaaacccttcaaaaaattaacgaatccaggagctggttttttgaaaggatcaacaaa attgatagaccgctagcaagactaataaagaaaaagagagagaagaatcaaatagacgca ataaaaaatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatcaga gaatactacaaacacctctacgcaaataaactagaagatctagaagaaatggataaattc ctcaacacatacactctcccaagactaaaccaggaagaagttgaatctctgaatagacca ataacaggagctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtccagga ccagatggattcacagctgaattcaaccagaggtacaaggaggaactggtaccattcctt ctgaaactattccaatcaatagaaaaagagggaatcctccctaactcacttgatgaggcc agcatcatcctgataccaaagccaggcagagacacaaccaaaaaagagaattttagacca atatccctgatgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccag cagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggc tggttcaatatacgcaaatcaataaatgtaatccagcatataaccagaaccaaagacaaa aaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttc atgctaaaaactctcaataaattaggtattgatgggacgtatctcaaaatcataagagct atctatgacaaacccacagccaatagcatactgaatgggcaaaaactggaagcattccct ttgaaaactggcacaagacagggatgccctctctcaccactcctattcaacatagtgttg gaagttctggccagggcaatgaggcaggagaaggaaataaagggtattcagctaggaaaa gaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccatt gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaagaatccaa cttacaagggatgtgaaagacctcttcaaggagaactacaaaccactggtcaaggaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcagtatt gtgaaaatggccataccgcccaaggtaatttacagattcaatgccatccccatcaagcta ccaatgactttctttacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgcta cctgacttcaaactatactataaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagatcaatggaacagaacagagccctcagaaataacgccgcatatctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagacttaaacattagacctaaa accacaaaaaccctagaagaaaacctaggcattaccattctgcatgggcaaggacttcat gtctaa