GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:20:40 Sequence gi568815581f:66944661_67156268 : 211608 bp : 45.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13668 13763 96 2 0 77 41 97 0.494 4.21 1.02 Term + 16161 16373 213 2 0 99 47 134 0.609 7.63 1.03 PlyA + 16743 16748 6 -0.45 2.00 Prom + 16814 16853 40 -6.86 2.01 Init + 20252 20471 220 1 1 107 102 509 0.966 50.99 2.02 Term + 22978 23111 134 2 2 74 49 55 0.434 -1.55 2.03 PlyA + 26246 26251 6 1.05 3.06 PlyA - 27186 27181 6 1.05 3.05 Term - 27795 27709 87 0 0 62 48 123 0.127 3.36 3.04 Intr - 32742 32634 109 2 1 46 69 78 0.114 1.99 3.03 Intr - 49952 49775 178 0 1 56 72 80 0.147 2.18 3.02 Intr - 53256 53191 66 1 0 123 64 16 0.523 1.58 3.01 Init - 54062 54014 49 2 1 56 80 57 0.622 2.88 3.00 Prom - 54799 54760 40 -4.86 4.00 Prom + 54933 54972 40 -4.26 4.01 Init + 57843 57909 67 2 1 63 89 83 0.949 7.21 4.02 Intr + 71700 71804 105 1 0 84 49 71 0.781 2.99 4.03 Intr + 73529 73612 84 0 0 119 75 169 0.970 18.49 4.04 Intr + 76992 77123 132 1 0 77 84 18 0.365 0.92 4.05 Intr + 79543 79569 27 2 0 72 75 77 0.873 2.89 4.06 Intr + 80200 80340 141 2 0 77 99 285 0.999 28.82 4.07 Term + 85806 86344 539 1 2 140 52 685 0.848 64.51 4.08 PlyA + 87317 87322 6 1.05 5.04 PlyA - 87753 87748 6 -0.45 5.03 Term - 88152 87847 306 0 0 -15 53 268 0.703 8.12 5.02 Intr - 88401 88365 37 2 1 127 75 -31 0.330 -2.24 5.01 Init - 89858 89740 119 2 2 83 78 81 0.305 6.27 5.00 Prom - 90883 90844 40 -2.36 6.00 Prom + 91008 91047 40 -6.16 6.01 Init + 94266 94303 38 2 2 90 78 -3 0.310 -1.45 6.02 Intr + 94948 95033 86 1 2 65 49 99 0.359 3.16 6.03 Intr + 95829 95926 98 1 2 105 37 41 0.270 0.43 6.04 Intr + 99950 100229 280 1 1 66 49 472 0.394 38.15 6.05 Intr + 104678 104812 135 0 0 73 81 21 0.243 0.44 6.06 Intr + 109336 109695 360 2 0 136 48 233 0.963 19.29 6.07 Intr + 110443 110580 138 2 0 84 74 186 0.999 17.24 6.08 Term + 111385 111611 227 2 2 101 40 340 0.999 27.44 6.09 PlyA + 112118 112123 6 1.05 7.03 PlyA - 114262 114257 6 1.05 7.02 Term - 123135 123085 51 0 0 83 43 59 0.296 -1.67 7.01 Init - 128047 127940 108 1 0 78 96 62 0.804 5.27 7.00 Prom - 129466 129427 40 -4.06 8.17 PlyA - 133112 133107 6 1.05 8.16 Term - 133926 133592 335 1 2 101 37 298 0.998 20.77 8.15 Intr - 152018 151762 257 1 2 11 80 140 0.029 2.59 8.14 Intr - 164066 163832 235 0 1 66 61 66 0.131 -1.55 8.13 Intr - 164672 164456 217 2 1 78 97 74 0.587 5.38 8.12 Intr - 164849 164745 105 2 0 89 66 58 0.624 4.11 8.11 Intr - 169743 169664 80 1 2 42 85 94 0.968 3.77 8.10 Intr - 175952 175745 208 2 1 60 115 55 0.318 4.05 8.09 Intr - 178500 178310 191 1 2 90 95 81 0.965 8.30 8.08 Intr - 179354 179303 52 2 1 70 111 33 0.979 2.28 8.07 Intr - 184195 183991 205 0 1 89 98 124 0.994 12.70 8.06 Intr - 191538 191310 229 1 1 76 91 60 0.875 2.03 8.05 Intr - 193454 193271 184 2 1 87 115 138 0.999 15.76 8.04 Intr - 201230 201083 148 1 1 96 67 60 0.967 4.94 8.03 Intr - 204054 203909 146 0 2 106 58 137 0.976 11.58 8.02 Intr - 205325 205207 119 0 2 69 111 49 0.719 5.48 8.01 Intr - 206564 206386 179 1 2 100 72 54 0.529 4.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:66944661_67156268|GENSCAN_predicted_peptide_1|102_aa MLINRASSRTDSSKRGRLQGTVHTGIRFANTQDQVQLPVGVRKQKGKLRSSRWGPPELIG NNNDTAQQYLTMAKRLFSVLGKTLALKSLRFELRSDPTILQV >gi568815581f:66944661_67156268|GENSCAN_predicted_CDS_1|309_bp atgctcatcaaccgggctagttccagaacagattctagcaagagaggccgcctgcagggc acagtccatactgggatccggtttgcaaacacacaggaccaagtgcagctgccagtgggt gtccgcaaacagaaaggcaaactgcggagctcccgttggggtccccctgaattaataggt aataataatgacacagcccagcagtacttaacaatggcaaagaggctgtttagcgtcctt ggaaaaactctggccttgaagtcactgagatttgagctaagatctgaccccaccatcctc caggtgtga >gi568815581f:66944661_67156268|GENSCAN_predicted_peptide_2|117_aa MVRCDRGLQMLLTTAGAFAAFSLMAIAIGTDYWLYSSAHICNGTNLTMDDGPPPRRARGD LTHSGLWRVCCIEEKYSKALPPKQNEQKKMKKSNLWAETKETLQQAANKTPRIKAEL >gi568815581f:66944661_67156268|GENSCAN_predicted_CDS_2|354_bp atggtgcgatgcgaccgcgggctgcagatgctgctgaccacggccggagccttcgccgcc ttctcgctcatggccatcgccatcggcaccgactactggctgtactccagcgcgcacatc tgcaacggcaccaacctgaccatggacgacgggcccccgccccgccgcgcccgcggcgac ctcacccactctggtctgtggcgggtgtgctgcatcgaagaaaagtattccaaagcatta cctccaaagcaaaatgaacagaagaaaatgaaaaaatcaaacctctgggctgaaactaaa gaaacactgcagcaggcagctaataagactccgaggataaaggcagaactctga >gi568815581f:66944661_67156268|GENSCAN_predicted_peptide_3|162_aa MNGQPQSVPKAQGLALGVCSFSSVLNFHSHAPVGDLCTGHAFYLHDTQLPGSVVTRTGPL GGQGGILADSSMGCLLNPSVPHLPALNSQDNDKIYLPGLSSDVIFDSLLRTKLLESQKVG RAGSVGRAGSMGLQLEKEERYWFQCGEARYAAKHLQHRGQLL >gi568815581f:66944661_67156268|GENSCAN_predicted_CDS_3|489_bp atgaatgggcagccccagagcgtgcccaaggcccagggtctggccctgggggtgtgcagc ttctcctctgtcctcaatttccattctcatgctcctgttggtgacctctgcacagggcat gctttctacttgcatgacacccagctcccaggctccgtggtcaccaggactgggcccctg gggggccagggtggcattctggctgacagcagcatgggctgcttgcttaatccctctgtg ccgcaccttcctgctctgaatagccaggataatgacaagatctatctcccaggacttagt tcagatgtcatcttcgactcccttctgcgcacgaagctcctggagagtcagaaagtgggg cgggctggatccgtggggcgggctggatccatggggcttcagctggagaaggaggagcgc tactggttccagtgcggggaggccaggtatgctgctaaacacctacagcacagaggacag ctcctgtga >gi568815581f:66944661_67156268|GENSCAN_predicted_peptide_4|364_aa MHVCSLILAWLPESTGDLADLAGPMDLQESGLSLNTPAPWRQQLKGPNGGKSLGTAAGIY KGHCFRINHFPEDNDYDHDSSEYLLRLPAHLALGLVPLSKSAFPVRWITSVPGRRGLVSG LLGVHSVPGGADDDDDDQGIVRASSVFPILSTILLLLGGLCIGAGRIYSRKNNIVLSAGI LFVAAGLSNIIGIIVYISSNTGDPSDKRDEDKKNHYNYGWSFYFGALSFIVAETVGVLAV NIYIEKNKELRFKTKREFLKASSSSPYARMPSYRYRRRRSRSSSRSTEASPSRDVSPMGL KITGAIPMGELSMYTLSREPLKVTTAASYSPDQEASFLQVHDFFQQDLKEGFHVSMLNRR TTPV >gi568815581f:66944661_67156268|GENSCAN_predicted_CDS_4|1095_bp atgcatgtgtgctccctcatcctggcctggttgccagaaagcactggtgacctggcagac ctggcagggcccatggaccttcaggaatcagggttgtcactgaacactccagctccttgg aggcagcagctgaaggggcccaatggaggaaagagcttggggacggcagctgggatctat aaagggcactgcttccggatcaatcacttcccagaggacaatgactacgaccacgacagc tcggagtacctcctccggcttcctgcccaccttgccctgggcctggtgcccctctccaag tctgcatttccagttaggtggataacttctgtgccaggccgcagggggctggtgtcaggg ctgctgggagtccacagtgtgcccgggggggctgatgatgatgatgatgatcaaggcatc gtgcgagcctccagcgtcttccccatcctcagcaccatcctgctcctgctgggtggcctg tgcatcggtgctggcaggatctacagccgcaagaacaacatcgtcctcagtgccggcatc ctcttcgtggctgcaggcctcagtaacatcatcggtatcatcgtctacatttccagcaac acaggtgacccgagtgacaagcgggacgaagacaaaaagaaccattacaactacggctgg tctttttactttggagctctgtctttcattgtggctgagaccgtgggcgtcctggctgta aacatttacattgagaaaaataaagagttgaggtttaagaccaaacgggaattccttaag gcgtcttcctcttctccttatgccaggatgccgagctacaggtaccggcgacggcgctcg aggtccagctcaaggtccaccgaggcctcgccctccagggacgtgtcgcccatgggcctg aagatcacaggggccatccccatgggggagctgtccatgtacacgctgtccagggagccc ctcaaggtgaccaccgcagccagctacagccccgaccaggaggccagcttcctgcaggtg catgactttttccagcaggacctgaaggaaggtttccacgtcagcatgctgaaccgacgg acgacccctgtgtga >gi568815581f:66944661_67156268|GENSCAN_predicted_peptide_5|153_aa MEESGALETVVREENDGPEDYTGTVRGDDKVKVGDKSQWRWALLTLPGTGQEGWPRAGEG LELWCFSIRCWAHLDMPSDDANPDGSEGSGAQHTGALAGLISDPQHTATVSWGRYRWMRP FLGAKEDGMDDDIITRPNGPGSEPQGAESTHGL >gi568815581f:66944661_67156268|GENSCAN_predicted_CDS_5|462_bp atggaggaatcgggagcattggagactgtggtcagagaggagaatgatggccctgaagat tacactgggacagttaggggtgatgacaaggtcaaggttggagataaaagtcaatggagg tgggccctgcttactctgcctgggactgggcaagagggctggcctagagcaggtgaaggc cttgagctctggtgcttcagcatccgctgctgggcacacttggacatgccatcagatgat gccaaccctgatgggagcgaaggctctggagcacagcacacaggggccctggcaggcctc atttcagatccccagcacacggcgacagtcagctggggccgctaccggtggatgcgtcct ttcctgggggcaaaggaagatgggatggacgatgacatcatcactaggccgaacggacct gggtctgaaccccagggggccgagtccacccatgggctgtga >gi568815581f:66944661_67156268|GENSCAN_predicted_peptide_6|453_aa MHKVSPAREAHLRKSSSLSFDSDLLNKMFPAAAQAMGTNTAGVYRAHFENPSCGLKSPEG PMSCLELLFSTPLHETQPPDPAQGTHASATTMSQTKMLKVRVTLFCILAGIVLAMTAVVT DHWAVLSPHMEHHNTTCEAAHFGLWRICTKRIPMDDSKTCGPITLPGDKENQPLRVEIFD KVTQVMSRAARTHTQTLTFTVLVNTLTTLQQPKKNCSYFRHFNPGESSEIFEFTTQKGEP GWKGVWGDKRGLLCCAPLGQKVIGKSTDFLTPDEKKPVVHLNNSLVSSPLRRPQAAAFHL QGPELPHLGARERLQKQHLLVGPYTRAAVVTGEYSISAAAIAIFSLGFIILGSLCVLLSL GKKRDYLLRPASMFYAFAGLCILVSVEVMRQSVKRMIDSEDTVWIEYYYSWSFACACAAF ILLFLGGLALLLFSLPRMPRNPWESCMDAEPEH >gi568815581f:66944661_67156268|GENSCAN_predicted_CDS_6|1362_bp atgcacaaagtatcgccagccagggaagctcatctgagaaaatcatcatcattatctttt gactctgacctgctcaataagatgtttcctgctgctgcccaagccatgggcaccaacact gcaggggtgtacagggcgcactttgagaaccccagctgcgggctcaagtctcccgagggg cccatgtcctgcctggagctcctattctccacacctctccacgagacgcagccgccggac cctgcccagggcacccacgcctcggcgaccaccatgtcccagaccaaaatgctgaaggtc cgcgtgaccctcttctgcatcctggcaggcatcgtgctggccatgacagccgtggtaacc gaccactgggctgtgctgagcccccacatggagcaccacaacactacctgcgaggcggcc cacttcggcctctggcggatttgtaccaagcgcatccccatggacgacagcaagacctgc gggcccatcaccctgcccggggataaagaaaatcaaccactgagagttgagatatttgac aaggtaacacaggtgatgagcagagcagccaggactcacactcagaccctaacattcacc gttttagtcaatacattaactactttacaacaacccaagaagaactgttcctacttcagg cattttaaccccggcgagagctcggagatcttcgaattcaccactcagaagggtgagcct gggtggaaaggggtctggggtgacaagaggggacttctgtgttgtgcaccactgggccag aaagtcattggcaaaagcactgacttcctcacgcctgatgagaagaagcctgtggtgcat ttgaacaacagccttgtcagctccccgctccggaggccccaggcagccgccttccatctc cagggcccggagcttccacacctcggggccagggagcgtttgcagaagcagcacctcctg gtaggcccctacacaagggcagctgttgtgacgggagagtacagcatctcggcagccgcc atcgccatcttcagccttggcttcatcatcctgggcagcctctgtgtcctcctgtccctc gggaagaagagggactatctgctgcgacccgcgtccatgttctatgcctttgcaggtctc tgcatcctcgtctcggtggaggtcatgcggcagtcggtgaagcgcatgattgacagtgag gacaccgtctggatcgagtactattactcctggtcctttgcctgcgcctgtgccgccttc atcctcctctttctcggcggtctcgccctcctgctgttctccctgcctcgaatgccccgg aacccatgggagtcctgcatggatgctgagcccgagcactaa >gi568815581f:66944661_67156268|GENSCAN_predicted_peptide_7|52_aa MERQALWIGHHPSSLSPLLGSALLFRSRVKELFGFRAKDTREPKWKVRKSQQ >gi568815581f:66944661_67156268|GENSCAN_predicted_CDS_7|159_bp atggagcgacaggccctttggattggccaccacccctcctcactctcacccctgctgggg tctgcactgctcttccgcagtagggtgaaggagctctttggttttagggcaaaagataca cgggagcctaaatggaaggtccggaagtcacagcagtag >gi568815581f:66944661_67156268|GENSCAN_predicted_peptide_8|963_aa XVYFRNRWVKTVHPVVHQYCLISSAHSTFQMPQKEDILKHRVVVVTLNTSQYLCQLDLEP GFFTHILLDEAAQAMECETIMPLALATQNTRIVLAGDHMQLSPFVYSEFARERNLHVSLL DRLYEHYPAEFPCRILLCENYRSHEAIINYTSELFYEGKLMASGKQPAHKDFYPLTFFTA RGEDVQEKNSTAFYNNAEVFEVVERVEELRRKWPVAWGKLDDGSIGVVTPYADQVFRIRA ELRKKRLSDVNVERVLNVQGKQFRVLFLSTVRTRHTCKHKQTPIKKKEQLLEDSTEDLDY GFLSNYKLLNTAITRAQSLVAVVGDPIALCSIGRCRKFWERFIALCHENSSLHGITFEQI KAQLEALELKKTYVLNPLAPEFIPRALRLQHSGSTNKQQQSPPKGKSLHHTQNDHFQNDG IVQPNPSVLIGNPIRAYTPPPPLGPHPNLGKSPSPVQRIDPHTGTSILYVPAVYGGNVVM SVPLPVPWTGYQGRFAVDPRIITHQAAMAYNMNLLQTHGRGSPIPYGLGHHPPVTIGQPQ NQHQEKDQHEQNRNGKSDTNNSGPEINKIRTPEKKPTEPKQLPRPPFPIPQQHTLLNQQQ NNLPEQPNQIPPQPNQAGPNNAFFNSAVAHRPQSPPAEAVIPEQQPPPMLQEGHSPLRAI AQPGPILPSHLNSFIDENPSGLPIGEALDRIHGSVALETLRQQQARFQQWSEHHAFLSQG SAPYPHHHHPHLQHLPQPPLGLHQPPVRADWKLTSSAEDEVETTYSRLEIDEEAGEEKLE ASRGWFMRFKERSHPHNIKVQGETASANIEAAASYPEDLAKIIDEDGYTKLQIFCVDKTA SYWKKMLSRTFSAPPKTVKPPEDQLKSENLEVSSSFNYSVLQHLGQFPPLMPNKQIAESA NSSSPQSSAGGKPAMSYASALRAPPKPRPPPEQAKKSSDPLSLFQELSLGSSSGSNGFYS YFK >gi568815581f:66944661_67156268|GENSCAN_predicted_CDS_8|2892_bp nnggtatatttcagaaatcgctgggtaaagactgtccacccagttgtgcatcagtactgt ttgatctcaagcgcacattccacctttcagatgccccagaaagaagatattcttaaacat cgagtggtggttgttaccttgaatacttcccagtacctctgtcagttggaccttgaacct gggttttttacacacattctattagatgaagctgcccaggccatggagtgtgaaaccatt atgcctctagcattagcaactcaaaacactcggattgtcttggctggtgatcacatgcag ctcagtccttttgtttacagcgagtttgccagggagagaaaccttcacgtttcattactt gaccgactctatgagcattaccctgctgagttcccatgtaggattctcctgtgtgagaac taccgctcccatgaagctatcatcaattatacctctgagcttttctatgagggcaaactg atggccagtgggaagcagccagcacacaaagatttctacccactaactttctttacagca cgaggagaagatgtacaagaaaaaaatagcacagctttttataataatgcagaggtgttt gaagtggtggaacgtgtagaagagttaagaaggaagtggccagtagcgtgggggaagtta gatgatggcagtattggtgtggtgactccatatgctgatcaagtgtttagaatacgtgct gaacttcgaaaaaagagattatctgatgttaatgtagaaagggtgctaaatgttcaagga aagcaattcagagttttgtttcttagcacagtacgtacaagacatacttgtaaacataaa cagacaccaattaaaaagaaagagcaacttctggaagattccacagaggacttagattat ggttttttatctaactacaagcttctcaatactgccatcacaagagcacaatccctggtt gctgtggtgggtgatcccattgctctgtgctctattggaagatgcaggaaattttgggaa cggtttattgccctgtgtcatgaaaacagtagcctacatggaatcacttttgaacagatc aaagcccagttagaggctttagaactaaagaagacatatgtgttgaatccgctggcacct gaatttatcccccgggctctaagactgcagcattcaggaagtaccaacaaacagcagcaa tcaccacccaaggggaaaagtcttcatcatacccagaatgatcacttccagaatgatgga attgttcagcccaatccttctgtacttattggcaatcctattagagcatatactcctcca ccccctcttggacctcacccaaatttgggaaaatctccaagccctgttcaaagaatagat cctcacactgggacaagtattctttatgtacctgctgtctatggagggaatgtagttatg tcggtgcctttacctgtaccatggacaggataccagggtaggtttgcagttgatcctcga attattacacatcaggcagcaatggcctataacatgaacctattacagacacatggacga ggatctcctattccttatggccttggacatcacccacctgtcaccataggccagccacaa aatcagcatcaggagaaggatcaacatgagcaaaatcgaaatggtaaaagtgatacaaat aattccggacctgaaattaataagattcgaacaccagagaaaaagccaacagaaccaaaa cagctaccaagaccaccctttccaattccacagcagcacaccttgttaaatcagcagcag aataatttgcctgaacaaccaaatcagataccacctcagccaaatcaggcgggacccaac aatgctttttttaatagtgcagttgctcatcggccacagtctcctcctgcagaagctgta attccggagcagcagccccctcccatgctgcaagaaggccacagtcctctgagagccatt gcacaacccggccccattcttccttcacatctgaatagcttcattgatgagaacccctcg ggattacctataggggaggctttagatcgtatacatgggagtgtcgctctggaaacatta aggcagcagcaggcacggttccagcagtggagcgagcatcatgcctttctcagtcagggc agcgctccatacccacaccatcaccatcctcacctccagcatcttcctcagccgcccctg ggattacatcagccgccagtgagggcagactggaagctcaccagcagtgccgaagatgaa gtggagaccacatactcaaggctggagatagatgaagaagctggagaagaaaagttggaa gctagcagaggttggttcatgaggtttaaggaaagaagccatccccataacataaaagtg caaggtgaaacagcaagtgctaatatagaagctgcagcaagttacccagaagatctagct aagatcattgatgaagatggctacaccaaactacagattttctgtgtagacaaaacagcc tcttattggaagaagatgctgtctaggactttctcagcgccacccaagactgtcaaaccc cctgaggatcaactgaagtcggagaacctcgaggtgtccagttccttcaactacagtgtg ctgcagcatcttggccagtttccaccccttatgcctaacaagcagatcgcggagtcggcc aatagcagtagcccccagagctctgcggggggcaagcccgccatgtcctatgccagcgct ctgcgggcccctccaaagcccaggccccctcctgagcaggccaagaagagtagcgaccct ctgtctctcttccaggaactgagcctagggagctcatctggcagcaatggcttttactca tattttaaataa