GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:41:48 Sequence gi568815586f:120396422_120598207 : 201786 bp : 46.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 41706 41808 103 2 1 85 84 74 0.818 5.65 1.02 Intr + 41958 42100 143 1 2 85 107 240 0.942 25.67 1.03 Term + 44359 44442 84 0 0 92 44 51 0.665 -1.25 1.04 PlyA + 46979 46984 6 1.05 2.03 PlyA - 47570 47565 6 1.05 2.02 Term - 48534 48451 84 0 0 97 43 101 0.996 4.15 2.01 Init - 49951 49805 147 1 0 97 99 298 0.636 31.89 2.00 Prom - 49999 49960 40 -16.83 3.00 Prom + 50017 50056 40 -16.89 3.01 Init + 50060 50140 81 1 0 53 103 115 0.999 8.49 3.02 Intr + 50236 50408 173 0 2 121 97 359 0.997 38.84 3.03 Intr + 60655 60758 104 1 2 70 98 142 0.974 13.22 3.04 Term + 62206 62213 8 2 2 121 42 0 0.606 -3.27 3.05 PlyA + 63280 63285 6 -0.45 4.05 PlyA - 64976 64971 6 1.05 4.04 Term - 65741 65598 144 2 0 117 53 47 0.825 2.01 4.03 Intr - 67686 67529 158 1 2 51 90 191 0.677 15.33 4.02 Intr - 69366 69275 92 2 2 98 47 103 0.594 6.84 4.01 Init - 73188 73001 188 0 2 34 93 527 0.997 46.33 4.00 Prom - 75373 75334 40 -4.16 5.00 Prom + 88620 88659 40 -4.46 5.01 Init + 100001 100132 132 1 0 76 100 232 0.897 23.34 5.02 Term + 101652 101789 138 2 0 116 42 130 0.993 9.16 5.03 PlyA + 102052 102057 6 1.05 6.08 PlyA - 102980 102975 6 1.05 6.07 Term - 107464 107363 102 1 0 124 31 107 0.999 6.88 6.06 Intr - 107660 107549 112 1 1 89 80 94 0.951 9.08 6.05 Intr - 108562 108474 89 1 2 60 52 24 0.638 -5.23 6.04 Intr - 113702 113596 107 0 2 86 80 177 0.842 16.63 6.03 Intr - 120367 120146 222 2 0 96 105 172 0.998 17.80 6.02 Intr - 125942 125793 150 0 0 48 95 110 0.989 7.83 6.01 Init - 126570 126204 367 0 1 77 50 262 0.583 18.49 6.00 Prom - 127281 127242 40 -9.46 7.00 Prom + 129027 129066 40 -3.06 7.01 Init + 129887 130245 359 1 2 60 33 289 0.175 17.18 7.02 Intr + 137914 137968 55 1 1 60 57 69 0.437 -0.02 7.03 Term + 138237 138536 300 2 0 92 40 340 0.837 24.82 7.04 PlyA + 139175 139180 6 1.05 8.00 Prom + 141628 141667 40 -6.36 8.01 Init + 147524 147575 52 1 1 64 105 22 0.674 0.82 8.02 Intr + 149984 150180 197 0 2 106 64 136 0.981 12.13 8.03 Intr + 156078 156277 200 2 2 109 89 25 0.987 2.85 8.04 Intr + 158297 158387 91 2 1 111 89 90 0.963 11.40 8.05 Intr + 160861 161045 185 0 2 54 78 10 0.516 -4.81 8.06 Intr + 161125 161261 137 1 2 39 105 74 0.578 4.41 8.07 Intr + 164305 164465 161 2 2 70 100 230 0.997 22.11 8.08 Intr + 166926 167202 277 2 1 12 100 199 0.628 10.59 8.09 Intr + 167389 167522 134 2 2 44 78 210 0.999 15.96 8.10 Intr + 168651 168768 118 2 1 76 94 15 0.984 0.94 8.11 Intr + 169007 169108 102 0 0 57 84 152 0.999 11.85 8.12 Intr + 170404 170559 156 2 0 87 97 74 0.898 8.18 8.13 Intr + 174719 174870 152 0 2 93 99 -7 0.684 0.78 8.14 Intr + 179210 179267 58 1 1 88 100 19 0.702 1.56 8.15 Intr + 179371 179529 159 2 0 44 95 137 0.993 9.96 8.16 Term + 180169 180245 77 2 2 99 47 56 0.934 0.60 8.17 PlyA + 180446 180451 6 1.05 9.07 PlyA - 182649 182644 6 1.05 9.06 Term - 182877 182759 119 1 2 46 44 109 0.959 1.00 9.05 Intr - 182991 182926 66 1 0 81 94 83 0.942 7.08 9.04 Intr - 183230 183093 138 0 0 17 121 61 0.758 2.74 9.03 Intr - 183502 183353 150 2 0 63 75 98 0.971 6.13 9.02 Intr - 184836 184694 143 2 2 94 23 267 0.666 20.80 9.01 Init - 184941 184922 20 0 2 106 115 27 0.548 5.50 9.00 Prom - 186020 185981 40 -9.46 10.00 Prom + 186253 186292 40 -4.66 10.01 Init + 188408 188528 121 1 1 85 83 120 0.155 9.55 10.02 Intr + 193091 193180 90 0 0 35 102 60 0.082 2.07 10.03 Term + 193291 193361 71 2 2 90 48 31 0.167 -2.60 10.04 PlyA + 194996 195001 6 -0.45 11.02 PlyA - 197535 197530 6 1.05 11.01 Term - 197907 197591 317 1 2 16 32 299 0.896 12.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_1|109_aa MAVVGVSSVSRLLGRSRPQLGRPMSSGAHGEEGSARMWKTLTFFVALPGVAVSMLNVYLK SHHGEHERPEFIAYPHLRIRTKKLAFELQVSVQVEKKLALVISVSTVKV >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_1|330_bp atggcggtagttggtgtgtcctcggtttctcggctgctgggtcggtcccgcccacagctg gggcggcctatgtcgagtggcgcccatggcgaagagggctcagctcgcatgtggaagact ctcaccttcttcgtcgcgctccccggggtggcagtcagcatgctgaatgtgtacctgaag tcgcaccacggagagcacgagagacccgagttcatcgcctacccccatctccgcatcagg accaagaaattagcatttgagcttcaagtcagtgtccaagttgaaaagaaattggcatta gttatttctgtttccacagtgaaggtctag >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_2|76_aa MNSVGEACTDMKREYDQCFNRWFAEKFLKGDSSGDPCTDLFKRYQQCVQKAIKEKEIPIE GLEFMGHGKEKPENSS >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_2|231_bp atgaacagtgtgggggaggcatgcacggacatgaagcgcgagtacgaccagtgcttcaat cgctggttcgccgagaaatttctcaagggggacagctccggggacccgtgcaccgacctc ttcaagcgctaccagcagtgtgttcagaaagcaataaaggagaaagagattcctattgaa ggactggagttcatgggccatggcaaagaaaagcctgaaaattcttcttga >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_3|121_aa MWSRLVWLGLRAPLGGRQGFTSKADPQGSGRITAAVIEHLERLALVDFGSREAVARLEKA IAFADRLRAVDTDGVEPMESVLEDRCLYLRSDNVVEGNCADELLQNSHRVVEEYFVAPPG R >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_3|366_bp atgtggtcgcggttggtgtggctgggccttcgggcccctctgggcgggcgccagggcttc acctccaaggcggatcctcagggcagtggccggatcacggctgcggtgatcgagcacctg gagcgtctagcgcttgtggacttcggcagccgcgaggcagtggcgcgactggagaaagct atcgccttcgccgaccggctacgcgccgtggacacagacggggtggagcccatggaatcg gtcctggaggacagatgtctatacctgagatccgacaatgtggtagaaggcaactgtgct gatgaattactacaaaactcccatcgcgtcgtggaggagtactttgtggcccccccaggt aggtga >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_4|193_aa MSGWADERGGEGDGRIYVGNLPTDVREKDLEDLFYKYGRIREIELKNRHGLVPFAFVRFE DPRDAEDAIYGRNGYDYGQCRLRVEFPRTYGGRGSWQDLKDHMREAGDVCYADVQKDGVG MVEYLRKEDMEYALRKLDDTKFRSHEGETSYIRVYPERSTSYGYSRSRSGSRGRDSPYQS RGSPHYFSPFRPY >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_4|582_bp atgtcgggctgggcggacgagcgcggcggcgagggcgacgggcgcatctacgtggggaac cttccgaccgacgtgcgcgagaaggacttggaggacctgttctacaagtacggccgcatc cgcgagatcgagctcaagaaccggcacggcctcgtgcccttcgccttcgtgcgcttcgag gacccccgagatgcagaggatgctatttatggaagaaatggttatgattatggccagtgt cggcttcgtgtggagttccccaggacttatggaggtcggggcagctggcaggacctgaag gatcacatgcgagaagctggggatgtctgttatgctgatgtgcagaaggatggagtgggg atggtcgagtatctcagaaaagaagacatggaatatgccctgcgtaaactggatgacacc aaattccgctctcatgagggtgaaacttcctacatccgagtttatcctgagagaagcacc agctatggctactcacggtctcggtctgggtcaaggggccgtgactctccataccaaagc aggggttccccacactacttctctcctttcaggccctactga >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_5|89_aa MCDRKAVIKNADMSEEMQQDSVECATQALEKYNIEKDIAAHIKKEFDKKYNPTWHCIVGR NFGSYVTHETKHFIYFYLGQVAILLFKSG >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_5|270_bp atgtgcgaccgaaaggccgtgatcaaaaatgcggacatgtcggaagagatgcaacaggac tcggtggagtgcgctactcaggcgctggagaaatacaacatagagaaggacattgcggct catatcaagaaggaatttgacaagaagtacaatcccacctggcattgcatcgtggggagg aacttcggtagttatgtgacacatgaaaccaaacacttcatctacttctacctgggccaa gtggccattcttctgttcaaatctggttaa >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_6|382_aa MRFAKKHNKKGLKKMQANSARATSARADAIKALVKPKEVKPEIPKGVSRKLDPLAYIAHP QAWEMCSFLHCQGAQAMSVKGQGQRSNQGPGCSSSLGSQRCPDTYEGFRVDISVCQCEDR RTVYQVFESVAKKYDVMNDMMSLGIHRVWKDLLLWKMHPLPGTQLLDVAGGTGDIAFRFL NYVQSQHQRKQKRQLRAQQNLSWEEIAKEYQNEEDSLGGSRVVVCDINKEMLKVGKQKAL AQGYRAGLAWVLGDAEELPFDDDKFDIYTIAFGIRNVTHIDQALQEAHRVLKPGGRFLCL EFSQVNNPLISRLYDLYSFQVIPVLGEVIAGDWKSYQYLVESIRRFPSQEEFKDMIEDAG FHKVTYESLTSGIVAIHSGFKL >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_6|1149_bp atgcgctttgccaagaagcacaacaagaaaggcctaaaaaagatgcaggccaacagtgcc agggccacgagtgcacgtgctgacgctatcaaggcccttgtaaagcccaaggaggttaag cccgagatcccaaagggtgtcagccgcaagctcgatccacttgcctacattgcccacccc caggcttgggaaatgtgctcattcctgcattgccaaggggctcaagctatgtcggtcaaa ggccaaggccaaagatcaaaccaaggcccaggctgcagctccagcttaggctcccaaagg tgcccggacacctacgaaggcttcagagtagatatctctgtctgccaatgtgaggacaga aggactgtctatcaggtgtttgaaagtgtggctaagaagtatgatgtgatgaatgatatg atgagtcttggtatccatcgtgtttggaaggatttgctgctctggaagatgcacccgctt cctgggacccagctgcttgatgttgctggaggcacaggtgacattgcattccggttcctt aattatgttcagtcccagcatcagagaaaacagaagaggcagttaagggcccaacaaaat ttatcctgggaagaaattgccaaagagtaccagaatgaagaagattccttgggcgggtct cgtgtcgtggtgtgtgacatcaacaaggagatgctaaaggttggaaagcagaaagccttg gctcaaggatacagagctggacttgcatgggtattaggagatgctgaagaactgcccttt gatgatgacaagtttgatatttacaccattgcctttgggatccggaatgtcacacacatt gatcaggcactccaggaagctcatcgggtgctgaaaccaggaggacggtttctctgtctg gaatttagccaagtgaacaatcccctcatatccaggctttatgatctatatagcttccag gtcatccctgtcctgggagaggtcatcgctggagactggaagtcctatcagtaccttgta gagagtatccgaaggtttccgtctcaggaagagttcaaggacatgatagaagatgcaggc tttcacaaggtgacttacgaaagtctaacatcaggcattgtggccattcattctggcttc aaactttaa >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_7|237_aa MKMLIMLEVKFKLSVSGVIGEIDDCGNIDTAATGETLDIQPEELQEDELTNMNKKWSCDK KVEDVPEEVMPAKTFTLKEFLEVFHNIESTKDKISEVDKNLERNMAICQGIEKIFHTTRS PSGFFEMLFGDSSPFPEQFEKPRKETGKNVAMKAENRCRRRPPPALNAMSLGPRRARSAP TAVAAEAPVDAAELPQRRRHRLRHGQEQRLQQLLRLFGQQQRATAAPLRLGGASRRV >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_7|714_bp atgaaaatgctgataatgctggaagtgaaattcaaattgagtgtcagtggagttatagga gaaatagatgactgtgggaatattgacactgctgccactggagagactctagacatacag ccagaggaactgcaggaagatgaacttaccaacatgaacaagaaatggagctgtgacaaa aaggttgaagatgtcccagaagaagtgatgccagcaaaaactttcacattaaaggagttc ttggaggtatttcataacattgaaagcacaaaggataaaatatcagaagtagataaaaac ttagaaagaaatatggcaatttgccaaggcatagaaaagatatttcatacgacaagaagt ccatccgggttctttgagatgctgtttggcgactcgtcgccattcccggagcagtttgag aagccaaggaaggaaacagggaaaaatgtcgccatgaaggccgagaaccgctgccgccgc cgacccccgccggccctgaacgccatgagcctgggtccccgccgcgcccgctccgctccg actgccgtcgccgccgaggcccccgttgatgccgctgagctcccccaacgccgccgccac cgcctccgacatggacaagaacagcggctccaacagctcctccgcctcttcgggcagcag caaagggcaacagccgccccgctccgcctcggcggggccagccggcgagtctaa >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_8|751_aa MLGMVAHACNPSTLGGRDGKNSSGSKRYNRKRELSYPKNESFNNQSRRSSSQKSKTFNKM PPQRGGGSSKLFSSSFNGGRRDEVAEAQRAEFSPAQFSGPKKINLNHLLNFTFEPRGQTG HFEGSGHGSWGKRNKWGHKPFNKELFLQANCQFVVSEDQDYTAHFADPDTLVNWDFVEQV RICSHEVPSCPICLYPPTAAKITRCGHIFCWACILHYLSLSEKTWSKCPICYSSVHKKDL KSVVATESHQYVVGDTITMQLMKREKGVLVALPKSKWMNVDHPIHLGDEQHSQYSKLLLA SKEQVLHRVVLEEKVALEQQLAEEKHTPESCFIEAAIQELKGVLEYLSAFDEETTEVCSL DTPSRPLALPLVEEEEAVSEPEPEGLPEACDDLELADDNLKEGTICTESSQQEPITKSGF TRLSSSPCYYFYQAEDGQHMFLHPVNVRCLVREYGSLERSPEKISATVVEIAGYSMSEDV RQRHRYLSHLPLTCEFSICELALQPPVVSKETLEMFSDDIEKRKRQRQKKAREERRRERR IEIEENKKQGKYPEVHIPLENLQQFPAFNSYTCSSDSALGPTSTEGHGALSISPLSRSPG SHAGHPLERDEYNQVSGSLSDFLLTPLSPTASQGSPSFCVGSLEEDSPFPSFAQMLRVGK AKADVWPKTAPKKDENSLVPPAPVDSDGESDNSDRVPVPSFQNSFSQAIEAAFMKLDTPA TSDPLSEEKGGKKRKKQKQKLLFSTSVVHTK >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_8|2256_bp atgctgggcatggtggctcatgcctgtaatcccagtactttgggaggccgagatggaaag aactccagtggatccaagcgttataatcgcaaacgtgaactttcctaccccaaaaatgaa agttttaacaaccagtcccgtcgctccagttcacagaaaagcaagacttttaacaagatg cctcctcaaaggggcggcggcagcagcaaactctttagctcttcttttaatggtggaaga cgagatgaggtagcagaggctcaacgggcagagtttagccctgcccagttctctggtcct aagaagatcaacctgaaccacttgttgaatttcacttttgaaccccgtggccagacgggt cactttgaaggcagtggacatggtagctggggaaagaggaacaagtggggacataagcct tttaacaaggaactctttttacaggccaactgccaatttgtggtgtctgaagaccaagac tacacagctcattttgctgatcctgatacattagttaactgggactttgtggaacaagtg cgcatttgtagccatgaagtgccatcttgcccaatatgcctctatccacctactgcagcc aagataacccgttgtggacacatcttctgctgggcatgcatcctgcactatctttcactg agtgagaagacgtggagtaaatgtcccatctgttacagttctgtgcataagaaggatctc aagagtgttgttgccacagagtcacatcagtatgttgttggtgataccattacgatgcag ctgatgaagagggagaaaggggtgttggtggctttgcccaaatccaaatggatgaatgta gaccatcccattcatctaggagatgaacagcacagccagtactccaagttgctgctggcc tctaaggagcaggtgctgcaccgggtagttctggaggagaaagtagcactagagcagcag ctggcagaggagaagcacactcccgagtcctgctttattgaggcagctatccaggagctc aagggtgtgctggagtatctgtctgccttcgatgaagaaaccacggaagtttgttctctg gacactccttctagacctcttgctctccctctggtagaagaggaggaagcagtgtctgaa ccagagcctgaggggttgccagaggcctgtgatgacttggagttagcagatgacaatctt aaagaggggaccatttgcactgagtccagccagcaggaacccatcaccaagtcaggcttc acacgcctcagcagctctccttgttactacttttaccaagcggaagatggacagcatatg ttcctgcaccctgtgaatgtgcgctgcctcgtgcgggagtacggcagcctggagaggagc cccgagaagatctcagcaactgtggtggagattgctggctactccatgtctgaggatgtt cgacagcgtcacagatatctctctcacttgccactcacctgtgagttcagcatctgtgaa ctggctttgcaacctcctgtggtctctaaggaaaccctagagatgttctcagatgacatt gagaagaggaaacgtcagcgccaaaagaaggctcgggaggaacgccgccgagagcgcagg attgagatagaggagaacaagaaacagggcaagtacccagaagtccacattcccctcgag aatctacagcagtttcctgccttcaattcttatacctgctcctctgattctgctttgggt cccaccagcaccgagggccatggggccctctccatttctcctctcagcagaagtccaggt tcccatgcaggacaccccttggaacgtgacgaatataatcaggtctctggttccctttca gactttctgctgacccctctgtcacccactgccagtcagggcagtccctcattctgcgtt gggagtctggaagaagactctcccttcccttcctttgcccagatgctgagggttggaaaa gcaaaagcagatgtgtggcccaaaactgctccaaagaaagatgagaacagcttagttcct cctgcccctgtggacagcgacggggagagtgataattcagaccgtgttcctgtgcccagt tttcaaaattccttcagccaagctattgaagcagccttcatgaaactggacacaccagct acttcagatcccctctctgaagagaaaggaggaaagaaaagaaaaaaacagaaacagaag ctcctgttcagcacctcagtcgtccacaccaagtga >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_9|211_aa MVRFKHRYLLCELVSDDPRCRLSLDDRVLSSLVRDTIARVHGTFGAAACSIGFAVRYLNA YTGIVLLRCRKEFYQLVWSALPFITYLENKGHRYPCFFNTLHVGVCKVEIFHKLLTTVTH MPGTIRTCQKFLIQYNRRQLLILLQNCTDEGEREAIQKSVTRSCLLEEEEESAGYTAQAS GPLVEQNNLGSNSIFHSFPNWIAANQQTLPT >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_9|636_bp atggtgcggttcaagcacaggtacctgctctgcgaactggtgtctgacgacccccgctgc cgcctaagcctcgatgaccgagttctgagcagcctcgtacgggacacgatcgccagggtg cacggaactttcggcgcagccgcctgctccatcggcttcgcggttcgatatctcaatgcc tatactggaatagtgctacttcgatgcagaaaagaattctatcagcttgtgtggtcagct cttcccttcatcacatacttggagaacaaaggacaccgttacccatgctttttcaacaca ttacatgtgggagtctgtaaggtcgagatctttcacaagctgttgaccacagtaactcat atgccaggtacaataagaacatgtcagaagttcctaattcagtacaacaggagacagctg ttgatcttgttgcagaactgcactgatgaaggagagcgggaagctatccagaagtctgtg acaagaagctgcttattagaggaggaggaggagtcagctggctacactgctcaggcttca ggcccacttgttgaacagaacaatctgggtagcaacagcatcttccacagttttccaaac tggatagctgccaaccagcagacattacccacttga >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_10|93_aa MPAAKGQRRGLTPQPLLGSLPSRAAEPLSSPGEEADEALRGGKTKAQRKVHMPEIMQLIC LVSSSQTDRKGLNFIIYKMGTRRVPTSFNGCKD >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_10|282_bp atgccggccgcgaaggggcagcggcgggggctcacaccccaacccctcctcggttctctc cccagccgggccgcagagccattatcctccccaggtgaggaagccgatgaggctctgaga ggtgggaaaactaaggcccagagaaaggttcacatgcctgagatcatgcagctgatctgt ttggtgagcagcagccagactgacaggaaaggcctcaatttcatcatctataaaatgggc acaaggagagtacccacctcctttaatggttgtaaagactga >gi568815586f:120396422_120598207|GENSCAN_predicted_peptide_11|105_aa XQEAKAEEILEKGLEVWEYELRKNNFSDTGNFDFGIQEHINLGIKYDPSVGVYGLDFSAV LSRPGFGITDKKHRTGCTGAKHRISKEETMHWLQQKCDGIMLPCK >gi568815586f:120396422_120598207|GENSCAN_predicted_CDS_11|318_bp nttcaagaggccaaagcagaagaaatcctggagaagggtctagaggtatgggagtatgag ttgagaaaaaataacttctcagatactggaaactttgattttgggatccaggaacacatc aatctgggtatcaaatatgacccaagtgtgggtgtctatggcctggacttctctgcggtg ctgagtaggccgggtttcggcatcacagacaagaagcacaggacaggctgcactggggcc aaacacagaatcagcaaagaggagaccatgcactggctccaacagaagtgtgatgggatc atgcttccttgcaaataa