GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:24:05 Sequence gi568815575f:37680078_37910914 : 230837 bp : 38.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5885 6129 245 1 2 73 96 256 0.143 20.15 1.02 Intr + 14209 14471 263 1 2 113 62 239 0.925 20.01 1.03 Intr + 18606 18643 38 1 2 90 102 7 0.766 -0.64 1.04 Term + 20060 20248 189 1 0 77 54 122 0.590 4.17 1.05 PlyA + 20845 20850 6 -0.45 2.00 Prom + 21290 21329 40 -7.25 2.01 Sngl + 28070 28366 297 1 0 97 47 232 0.978 15.49 2.02 PlyA + 28644 28649 6 1.05 3.00 Prom + 29114 29153 40 -2.75 3.01 Init + 30422 30487 66 1 0 53 90 27 0.185 -1.48 3.02 Intr + 30713 30851 139 1 1 45 27 115 0.162 0.22 3.03 Intr + 31300 31350 51 2 0 109 82 32 0.052 2.66 3.04 Term + 47559 48385 827 1 2 148 39 402 0.354 33.54 3.05 PlyA + 50308 50313 6 1.05 4.03 PlyA - 51518 51513 6 1.05 4.02 Term - 67094 66633 462 2 0 72 38 161 0.273 3.57 4.01 Init - 75676 75509 168 1 0 86 78 109 0.458 9.28 4.00 Prom - 76603 76564 40 -7.45 5.00 Prom + 85489 85528 40 -6.55 5.01 Sngl + 89561 90577 1017 1 0 88 43 744 0.953 66.67 5.02 PlyA + 90805 90810 6 1.05 6.00 Prom + 91285 91324 40 -10.75 6.01 Sngl + 91926 92957 1032 2 0 69 32 351 0.979 24.44 6.02 PlyA + 93014 93019 6 1.05 7.00 Prom + 94203 94242 40 -3.65 7.01 Init + 100001 100045 45 1 0 121 87 15 0.937 5.46 7.02 Intr + 102011 102106 96 1 0 76 98 28 0.527 1.89 7.03 Intr + 103413 103523 111 2 0 85 95 38 0.728 3.86 7.04 Intr + 111898 111982 85 0 1 60 119 15 0.726 0.27 7.05 Intr + 113588 113733 146 0 2 116 121 62 0.999 11.38 7.06 Intr + 115086 115142 57 2 0 107 81 8 0.524 0.16 7.07 Intr + 115874 116064 191 1 2 71 61 128 0.955 5.86 7.08 Intr + 118878 119007 130 0 1 68 80 66 0.993 3.58 7.09 Intr + 121179 121271 93 2 0 77 79 48 0.798 2.04 7.10 Intr + 123800 124053 254 1 2 79 78 196 0.999 13.01 7.11 Intr + 124929 125091 163 0 1 119 115 160 0.999 20.86 7.12 Intr + 126310 126456 147 0 0 79 110 181 0.999 18.91 7.13 Intr + 129490 129614 125 0 2 95 69 132 0.598 10.46 7.14 Intr + 141680 141759 80 2 2 95 99 55 0.188 5.58 7.15 Intr + 145249 145418 170 2 2 18 85 113 0.442 2.74 7.16 Term + 154720 154848 129 0 0 43 42 119 0.038 0.00 7.17 PlyA + 155270 155275 6 1.05 8.07 PlyA - 155680 155675 6 1.05 8.06 Term - 160574 160498 77 0 2 111 36 113 0.998 5.42 8.05 Intr - 161028 160951 78 1 0 81 95 46 0.887 3.20 8.04 Intr - 161828 161705 124 2 1 118 111 94 0.999 13.94 8.03 Intr - 164798 164619 180 2 0 56 93 36 0.507 0.04 8.02 Intr - 166281 166240 42 0 0 112 105 47 0.988 6.32 8.01 Init - 167013 166966 48 0 0 92 52 110 0.866 6.80 8.00 Prom - 170956 170917 40 -6.75 9.00 Prom + 174420 174459 40 -3.25 9.01 Init + 180358 180403 46 0 1 89 98 58 0.960 7.80 9.02 Intr + 183604 183636 33 2 0 115 48 57 0.305 1.68 9.03 Intr + 199657 199775 119 2 2 36 71 127 0.065 5.06 9.04 Term + 221190 221417 228 2 0 77 46 199 0.586 10.25 9.05 PlyA + 221874 221879 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 98727 98533 195 0 0 35 44 210 0.836 7.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_1|244_aa MKFPASVLASVFLFVAETTAALSLSSTYRSGGDRMWQALTLLFSLLPCALVQLTLLFVHR DLSRDRPLVLLLHLLQLGPLFRCFEVFCIYFQSGNNEEPYVSITKKRQMPKNGLSEEIEK EVGQAEGKLITHRSAFSRASVIQAFLGSAPQLTLQLYISVMQQDVTVGRNPVLVSEDKVV NKYLMANILEQAGKKAQNNCRVRITPEHIERAMHKDKQLRCLLEDVPTIRLMRCPSPRRR DAWV >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_1|735_bp atgaaattcccggcctcggtgctggcgtccgtgttcctgttcgtggccgagacaacggcg gcgctcagcctgagcagcacctaccgctcgggcggggaccgcatgtggcaggcgctgacg ttgcttttctcgctactgccttgcgcgctcgtgcagctcacgcttctcttcgtacaccgc gacctcagccgcgaccgcccgctcgtactgctgctgcacctgctgcaacttgggcccctt ttcaggtgttttgaagtcttctgcatctactttcagtcaggcaacaatgaagagccttat gtcagtatcaccaagaagaggcaaatgccaaaaaatggcctctcagaggagattgagaag gaggtgggccaggcagaaggcaaactaatcacccaccgatcagcgttcagccgggcgtcg gtgatccaggctttcttgggctcagccccccagctgaccctacagctgtacataagtgtc atgcagcaggacgtcactgttggaagaaaccctgtgctggtctctgaggataaagtggtg aataagtatctgatggccaacatcctggagcaggcaggcaagaaggcccagaacaattgc agggtgcgcatcacaccagaacacatagagagggccatgcacaaggataagcagctcaga tgcctcttggaggatgtacccaccatcaggttgatgaggtgccccagcccgagaaggagg gatgcttgggtctga >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_2|98_aa MAAVQSSFGSASEGDCGKRGRGRPWGEGEGEGKCSYKGEAAAGLITEEEDNVRSETRCCT AGFEDGGKRQEPRNAVVEAGKGKKMDSLLDPLEGTCPC >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_2|297_bp atggcagcagtacagtccagcttcggctcggcatcagagggagactgtggaaagagaggg agagggagaccgtggggagagggagagggagagggcaaatgttcttataagggagaggca gcagcaggcttgatcacggaggaagaagacaatgtgaggagtgaaacaagatgctgcact gctggctttgaagatggagggaagaggcaggagccaaggaatgcagttgtagaagctgga aaaggcaagaaaatggattctctccttgatcctctggagggaacatgcccctgttga >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_3|360_aa MNRCYGRAWWLTPVISALWEAEYVSVTCQIAYILELKNNKTSNNNRSYIAPQNVEMPVHR PECHRFFRGLSPLAGNAAQVLMRSVGLLMTISLLSIVYGALRCNILAIKIKYDEYEVKVK PLAYVCIFLWRSFEIATRVVVLVLFTSVLKTWVVVIILINFFSFFLYPWILFWCSGSPFP ENIEKALSRVGTTIVLCFLTLLYTGINMFCWSAVQLKIDSPDLISKSHNWYQLLVYYMIR FIENAILLLLWYLFKTDIYMYVCAPLLVLQLLIGYCTAILFMLVFYQFFHPCKKLFSSSV SEGFQRWLRCFCWACRQQKPCEPIGKEDLQSSRDRDETPSSSKTSPEPGQFLNAEDLCSA >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_3|1083_bp atgaacagatgttatggccgggcgtggtggctcacacctgtaatctcagcactctgggag gctgagtatgtatcggtgacttgtcagatagcctatatcctggagttgaaaaataataag accagcaacaacaacaggagttacatcgccccacagaatgtggaaatgccggtgcacagg cctgagtgccaccgattctttaggggtctttctcctctggcagggaatgcagctcaggta ctgatgagaagtgtgggtctcctcatgaccatatccctgttgtccattgtgtatggagcc ttgcgctgcaacatcctagccatcaaaatcaagtacgatgagtatgaagtcaaagtgaag cctctggcctatgtctgtatcttcctgtggaggagctttgagattgccactcgagttgta gtcctggtcctctttacctccgtcctgaagacctgggtggtggttataatactcatcaac ttcttcagtttcttcttgtacccctggatcctcttctggtgcagtggttccccattccct gagaacatagagaaggccctcagtagagtgggcaccaccattgtactatgctttctaact ttactctatactggtatcaacatgttctgctggtctgctgtacagctgaaaattgacagc cctgacctcatcagcaagtcccataattggtaccagctactggtgtattacatgataaga ttcatcgagaatgccatcctcctcctcctgtggtatcttttcaagactgacatctatatg tatgtgtgcgcacctctgttggtcctgcagctgctcattgggtactgcacagccattctc ttcatgcttgtattctatcagttcttccacccttgcaaaaagctcttttcttccagtgtt tctgaaggctttcagaggtggctcaggtgtttttgctgggcctgcaggcagcaaaaaccc tgtgagccgataggaaaggaagatctacagtcatccagagatagagatgagacaccttct agcagtaaaacaagtcctgagcctggtcagttcttgaatgctgaagatctctgctctgct taa >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_4|209_aa MGLGNYWSLRYYTFDAKSVNKNGLISDPWFNNKKLTVTPVRGLLENNEGSQMLWVEVNEI SDAVNQNYHGNLPLSFFFIRPGNAPHNESNRNVTSLDSALTEKPGQFLQEWFVHIPHPGC RRPDPFPHLNSHSDENLFLYGSFEELPSPRVWRITDDSETPNTSSESGDMPPLSLPSGFP TCHKSTRLLQPCWRCEVPRLKPSYPLPLG >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_4|630_bp atggggcttggcaactactggagtctcaggtactacacatttgatgccaagagtgtaaat aaaaatgggctgataagtgacccttggtttaacaataagaagttgacagtgaccccagta aggggcctactggaaaacaatgagggaagccagatgttgtgggttgaggtcaatgaaatt agtgatgcagtcaaccaaaactatcatggcaatctgcccctctctttcttcttcatcaga cctgggaatgcaccacataatgaaagcaacaggaatgtgacatctctcgacagtgctctc acagagaaaccaggccagtttttgcaagaatggtttgttcatatacctcacccaggctgc agaagacctgaccccttccctcacctcaacagccactcagatgagaatttatttctttat gggtcctttgaggagttgccctcccccagggtgtggagaattactgatgactcagagacg ccaaacaccagctctgaatcaggggacatgccacctctaagcctcccttcgggtttcccc acatgccacaagtccaccaggctcctgcagccctgctggcggtgtgaggtccctcgcctt aagcccagctaccctttgccactgggctga >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_5|338_aa MGKKQSRKTGNSKNQSASPPPKGCSSSPAMEQSWMENDFDELREEGFRRSNYSKLKEDVR THGKEVKNLEKKLDEWLTRITNAEKSLKDLMELKTKARELSEECRSLRSRCDQLEERVSV MEDEVNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNVARQANIQIQEIQRTPQRYSSRRATPRYITVRFTKVEMKEKTLRVAREKGRV THKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRNFVTTGPALKELLKEALNMERNKWYQPLQKHAKL >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_5|1017_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaatcagagcgcctctccccct ccaaagggatgcagctcctcaccagcaatggaacaaagctggatggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactactccaagctaaaggaggacgttcga acccatggcaaagaagttaaaaaccttgaaaaaaaattagacgaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccaaggctcgagaacta agtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagtg atggaagatgaagtgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat atcatccaggagaacttccccaatgtagcaaggcaggccaacattcagattcaggaaata cagagaaccccacaaagatactcctcgagaagagcaactccaagatacataactgtcaga ttcaccaaagttgaaatgaaggaaaaaacgttaagggtagccagagagaaaggtcgggtt acccacaaagggaagcccatcagactaacagcggatctctcagcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagaaactttgtcaccaccgggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaaatggtaccagccactgcaaaaacatgccaaattgtaa >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_6|343_aa MDKFFDTYTLPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDRFTVKFYQRYKEEL VPFLLKLFQLIEKEGILPNSFYEASIILTPKPGRDTTKKENFRPISLMNIDAKILNKILT NRIQQHIKKLIHHDQVGFIPGMQGWFNIYKSINVIQHINRTNDKNHMIISTDAEKAFDKI QQPFMLKTLNKLGIDEIYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIFSARNLLKLISNFSKV SGYKINVQKSQTFLYTNKRQTESQIMSELPFTVASKRIKYLGI >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_6|1032_bp atggataaattcttcgacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagatagattcacagtcaaattctaccagaggtacaaggaggagctg gtaccattccttctgaaactattccaattaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgacaccaaagcctggcagagacacaaccaaaaaagag aattttagaccaatatccctgatgaacattgatgcaaaaatcctcaataaaatactgaca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatatacaaatcaataaatgtaatccagcatataaacaga accaacgacaaaaaccatatgattatctcaacagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgagatttatctcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaacaggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatcta gaaaaccccatcttctcagcccgaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaaacattcttatacaccaataaaagacaa acagagagccaaatcatgagtgaactcccattcacagttgcttcaaagagaataaaatac ctaggaatctaa >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_7|673_aa MGNWAVNEGLSIFVILVWLGLNVFLFVWYYRVYDIPPKFFYTRKLLGSALALARAPAACL NFNCMLILLPVCRNLLSFLRGSSACCSTRVRRQLDRNLTFHKMVAWMIALHSAIHTIAHL FNVEWCVNARVNNSDPYSVALSELGDRQNESYLNFARKRIKRTALGLLKSLFSQAQRSWK NPEGGLYLAVTLLAGITGVVITLCLILIITSSTKTIRRSYFEVFWYTHHLFVIFFIGLAI HGAERIVRGQTAESLAVHNITVCEQKISEWGKIKECPIPQFAGNPPMTWKWIVGPMFLYL CERLVRFWRSQQKVVITKVVTHPFKTIELQMKKKGFKMEVGQYIFVKCPKVSKLEWHPFT LTSAPEEDFFSIHIRIVGDWTEGLFNACGCDKQEFQDAWKLPKIAVDGPFGTASEDVFSY EVVMLVGAGIGVTPFASILKSVWYKYCNNATNLKLKKIYFYWLCRDTHAFEWFADLLQLL ESQMQERNNAGFLSYNIYLTGWDESQANHFAVHHDEEKDVITGLKQKTLYGRPNWDNEFK TIASQHPKDMDEPGSHHSQQTNTGTENQTPHVLTRGQNEQRAIILNLRESTNFEAYIHNT INGILLSPGGHTAKLHEEIKSLKIKQNNNFKYIMLSATIAADTIISTAAGVSSIEAWTVF PGLESGRETRGHL >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_7|2022_bp atggggaactgggctgtgaatgaggggctctccatttttgtcattctggtttggctgggg ttgaacgtcttcctctttgtctggtattaccgggtttatgatattccacctaagttcttt tacacaagaaaacttcttgggtcagcactggcactggccagggcccctgcagcctgcctg aatttcaactgcatgctgattctcttgccagtctgtcgaaatctgctgtccttcctcagg ggttccagtgcgtgctgctcaacaagagttcgaagacaactggacaggaatctcaccttt cataaaatggtggcatggatgattgcacttcactctgcgattcacaccattgcacatcta tttaatgtggaatggtgtgtgaatgcccgagtcaataattctgatccttattcagtagca ctctctgaacttggagacaggcaaaatgaaagttatctcaattttgctcgaaagagaata aagaggactgccctggggctactaaagtcccttttctcacaggctcagagatcttggaag aaccctgaaggaggcctgtacctggctgtgaccctgttggcaggcatcactggagttgtc atcacgctgtgcctcatattaattatcacttcctccaccaaaaccatccggaggtcttac tttgaagtcttttggtacacacatcatctctttgtgatcttcttcattggccttgccatc catggagctgaacgaattgtacgtgggcagaccgcagagagtttggctgtgcataatata acagtttgtgaacaaaaaatctcagaatggggaaaaataaaggaatgcccaatccctcag tttgctggaaaccctcctatgacttggaaatggatagtgggtcccatgtttctgtatctc tgtgagaggttggtgcggttttggcgatctcaacagaaggtggtcatcaccaaggtggtc actcaccctttcaaaaccatcgagctacagatgaagaagaaggggttcaaaatggaagtg ggacaatacatttttgtcaagtgcccaaaggtgtccaagctggagtggcacccttttaca ctgacatccgcccctgaggaagacttctttagtatccatatccgcatcgttggggactgg acagaggggctgttcaatgcttgtggctgtgataagcaggagtttcaagatgcgtggaaa ctacctaagatagcggttgatgggccctttggcactgccagtgaagatgtgttcagctat gaggtggtgatgttagtgggagcagggattggggtcacacccttcgcatccattctcaag tcagtctggtacaaatattgcaataacgccaccaatctgaagctcaaaaagatctacttc tactggctgtgccgggacacacatgcctttgagtggtttgcagatctgctgcaactgctg gagagccagatgcaggaaaggaacaatgccggcttcctcagctacaacatctacctcact ggctgggatgagtctcaggccaatcactttgctgtgcaccatgatgaggagaaagatgtg atcacaggcctgaaacaaaagactttgtatggacggcccaactgggataatgaattcaag acaattgcaagtcaacaccctaaggacatggatgaacctggaagccatcattctcagcaa actaacacaggaacagaaaaccaaacaccgcatgttctcactcggggtcaaaatgagcag agagccatcattttgaatcttagggagagcacaaacttcgaggcttacattcacaataca atcaatggcatccttctgtcccctggtggccacacagctaaactgcatgaagagataaag tctttgaagataaaacaaaacaataatttcaagtacatcatgctctctgccacaattgct gctgataccattatcagtactgctgctggtgttagtagtattgaagcttggacagtcttc cctggattggagagtggtagagagacgagagggcatctgtag >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_8|182_aa MARSLGVLVALPFPLPVGFNAEEAHNIVKEFSPGKSHPRESLSQQRETLVQAGAFHWFGW RGKEKLKPKFNTFGRLPNEMLEQWLSPTNQCVDGVLGGEDYNHNNINQWTASIVEQSLTH LVKLGKAYKYIVTCAVVQKSAYGFHTASSCFWDTTSDGTCTVRWENRTMNCIVNVFAIAI VL >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_8|549_bp atggctcggagtctcggggtcctggtggcactgccattcccgctcccggttggcttcaat gctgaggaagcccacaatattgtcaaagagttcagccctgggaagagtcatccaagagaa agtttgagtcagcagagagaaacacttgttcaggctggtgcctttcattggtttgggtgg agagggaaggagaaactaaagcccaagttcaacacatttgggagactcccaaatgagatg ttagaacaatggttatcacccaccaaccagtgtgtagatggggttttaggtggtgaagat tataatcacaacaacatcaaccagtggactgcaagcatagtggaacaatccttaacacac ctggttaagttgggaaaagcctataaatatattgtgacctgtgcagtggtccagaagagc gcatatggctttcacacagccagctcctgtttttgggataccacatctgatggaacctgt accgtaagatgggagaaccggaccatgaactgtattgtcaacgtttttgccattgctatt gttctttaa >gi568815575f:37680078_37910914|GENSCAN_predicted_peptide_9|141_aa MDAAKLPTMHMTVPYSKDMDDVDVEQEIAIATPAFSSHHPDQSVATGRGKSLYQQKDYDL LKAQMIVIRASQRSYLIEAVVEAIAGRGIDAALSLSKKPGLLTLPCQQTSLQLGLRLSQD ARQAQDSYEGQLSNRKTQNLH >gi568815575f:37680078_37910914|GENSCAN_predicted_CDS_9|426_bp atggatgctgccaaacttccaacaatgcacatgacagtcccctacagtaaagacatggat gatgtagatgttgaacaagaaattgccatagccactccagccttcagcagccaccaccct gatcagtcagtagctactggtagaggcaaaagcctctaccagcaaaaagattatgacctg ctaaaggctcagatgatcgtgatccgggcatcccaaagatcttacctcatagaagctgtt gtagaagcaattgcaggaagaggcattgatgcagcgttgtcattaagcaaaaagccagga ctgctgacactaccctgccagcagacaagcctgcagctgggcctgaggctctcacaggat gcaaggcaagcacaggattcatatgagggacaactatctaaccggaaaacacaaaacctc cattaa