GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:34:19 Sequence gi568815597r:53760067_53983278 : 223212 bp : 42.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 511 481 31 0 1 89 74 45 0.002 0.41 1.06 Intr - 12423 12263 161 0 2 88 100 140 0.765 13.06 1.05 Intr - 36738 36623 116 1 2 40 91 47 0.061 -0.55 1.04 Intr - 37078 36833 246 2 0 83 85 52 0.079 0.91 1.03 Intr - 40782 40627 156 1 0 54 89 96 0.275 5.36 1.02 Intr - 43943 43862 82 2 1 75 72 32 0.246 -1.41 1.01 Init - 44376 44257 120 0 0 33 73 97 0.703 2.94 1.00 Prom - 45446 45407 40 -4.05 2.00 Prom + 49183 49222 40 -7.65 2.01 Init + 55540 55925 386 0 2 54 26 235 0.064 10.26 2.02 Intr + 78148 78233 86 1 2 35 20 196 0.008 6.04 2.03 Intr + 78256 78486 231 2 0 61 43 228 0.130 12.42 2.04 Intr + 88259 88419 161 0 2 82 49 59 0.000 0.19 2.05 Intr + 95097 95283 187 2 1 52 92 166 0.978 11.74 2.06 Term + 96932 97071 140 0 2 37 41 178 0.910 5.14 2.07 PlyA + 97610 97615 6 1.05 3.12 PlyA - 98399 98394 6 1.05 3.11 Term - 100087 99998 90 1 0 143 39 87 0.970 6.04 3.10 Intr - 106316 106134 183 2 0 125 52 147 0.525 13.86 3.09 Intr - 106858 106692 167 2 2 79 92 71 0.098 5.36 3.08 Intr - 107881 107828 54 2 0 70 77 51 0.058 0.23 3.07 Intr - 111422 111306 117 0 0 118 81 82 0.480 10.02 3.06 Intr - 113184 113068 117 1 0 94 34 103 0.265 5.02 3.05 Intr - 118336 118204 133 1 1 55 92 33 0.132 -0.30 3.04 Intr - 123210 123047 164 1 2 45 20 201 0.314 7.77 3.03 Intr - 129788 129647 142 2 1 43 78 171 0.450 10.61 3.02 Intr - 134387 134130 258 2 0 54 63 260 0.927 16.84 3.01 Init - 135830 135681 150 2 0 62 60 154 0.887 10.09 3.00 Prom - 140015 139976 40 -6.75 4.00 Prom + 144226 144265 40 -0.95 4.01 Init + 144650 144743 94 1 1 -8 92 80 0.280 -0.61 4.02 Intr + 146029 146228 200 2 2 92 36 171 0.497 10.45 4.03 Intr + 150421 150554 134 0 2 -8 92 131 0.072 2.42 4.04 Intr + 152871 152964 94 0 1 51 68 51 0.097 -1.55 4.05 Term + 153028 153198 171 0 0 50 54 159 0.278 5.54 4.06 PlyA + 153293 153298 6 1.05 5.00 Prom + 155816 155855 40 -4.15 5.01 Init + 157576 157578 3 0 0 117 81 0 0.145 2.25 5.02 Intr + 186383 186483 101 1 2 107 96 40 0.746 4.69 5.03 Intr + 187722 187759 38 0 2 117 72 29 0.128 1.19 5.04 Intr + 191920 192406 487 2 1 85 94 438 0.110 35.24 5.05 Intr + 198083 198214 132 2 0 84 52 129 0.991 7.74 5.06 Intr + 200290 200408 119 1 2 86 103 48 0.988 5.29 5.07 Intr + 201968 202056 89 0 2 55 116 37 0.987 1.97 5.08 Intr + 202230 202343 114 2 0 82 78 104 0.991 8.52 5.09 Intr + 203078 203143 66 1 0 68 115 14 0.101 0.28 5.10 Intr + 206230 206314 85 0 1 48 28 132 0.110 1.67 5.11 Term + 207599 207873 275 0 2 109 38 285 0.863 20.25 5.12 PlyA + 208079 208084 6 1.05 6.04 PlyA - 208388 208383 6 1.05 6.03 Term - 215398 215000 399 1 0 10 48 439 0.377 26.23 6.02 Intr - 216001 215840 162 1 0 38 40 166 0.205 6.05 6.01 Init - 216033 216028 6 0 0 89 83 0 0.466 0.53 6.00 Prom - 216484 216445 40 -10.45 7.00 Prom + 216803 216842 40 -1.95 7.01 Sngl + 219789 220139 351 2 0 67 42 233 0.995 12.40 7.02 PlyA + 221275 221280 6 1.05 8.00 Prom + 221773 221812 40 -9.15 8.01 Sngl + 222540 222743 204 2 0 98 53 238 0.573 14.89 8.02 PlyA + 222970 222975 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 16092 16090 3 0 0 113 81 0 0.811 1.85 S.002 Sngl + 55540 55929 390 0 0 54 32 233 0.839 10.27 S.003 Init + 78149 78233 85 1 1 37 20 196 0.851 8.96 S.004 Term + 78256 78515 260 2 2 61 34 247 0.826 11.33 S.005 Init + 93686 93755 70 1 1 61 97 67 0.973 6.17 S.006 Intr + 93839 93903 65 0 2 120 96 -18 0.962 -0.28 S.007 Init + 191856 192406 551 2 2 84 94 454 0.836 40.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:53760067_53983278|GENSCAN_predicted_peptide_1|304_aa MVASKIFRFLVGRTVWWVESDGTSKHSITIHDVSAIVESYYLALQDLMLLSQYSPSRRQE VFSLSQPGGHPHNWTAISRECLNLLNGMTQKLILYQEAAATNGRVSSSYPVEPKKLNSPE ETAFQTPKSSQMPRPSVPPLVKTSLFSSKLSTPDVVSPFGTPFGSSVMNRMAGIFDVNTC YGSPQSPQLIRRGPRLWTSASDQQMTEFSNPSPSTSISAEGKTMRQPSVIYSWIQNKREQ AVDKYFKLPHASSKPPRISGSLVDTSYKTLRFAFRASLKTAIYRITTTFGEHLKIVRLPQ RHGT >gi568815597r:53760067_53983278|GENSCAN_predicted_CDS_1|912_bp atggtagcaagtaagattttccgttttcttgtgggccgcacagtttggtgggtggaatca gatgggacaagtaaacattcaattacaatacatgatgttagtgccattgttgagagctat tatttagccttgcaggacctgatgttgctttctcaatattctccttcacgaagacaagaa gttttcagcctcagccaaccaggtggacatccccacaattggacagccatttcaagggag tgtttgaatcttttaaatggtatgactcagaaactgattctctatcaagaagctgctgct acgaatgggagagtgtcttcatcttacccagtggaacctaagaaattaaattctccagaa gaaactgcttttcagacaccaaaatctagccagatgcctcggccttcagtgccaccatta gttaaaacatcactgttttcttcaaaattatctacacctgatgttgtgagcccatttggg accccatttggctctagtgtaatgaatcggatggctggaatttttgatgtaaacacctgc tatgggtcaccgcaaagtcctcagctaataagaagggggccaagattgtggacatcagct tctgatcagcaaatgactgaattttctaatccttctccatctacctctattagtgctgag ggtaagacaatgagacaacccagtgtgatttattcatggattcagaataaacgtgaacag gcagtcgacaagtactttaagcttcctcatgcttccagtaaaccaccccggatttcagga agccttgtggacacttcatataaaacattaagatttgcattcagagcatcactgaaaact gccatctatcgaataactactacatttggtgaacatctgaagattgtgaggctcccccag cgacatggaact >gi568815597r:53760067_53983278|GENSCAN_predicted_peptide_2|396_aa MRQEKEFKGIQISKKEVKLSLFADDMIVYLKNPKGSSRKFLELIKEFSKVSSYKINAHKS VALLYTNSDQVENQIKNSNPFRIAAKKIKYLRVYLTKESKDLYKENYKTLLKEIRQRKQM ETHPMLMDGMSRDLPAQGRLTAVAMEMAAPSLGRTGDLQRQEGPRRTLLGLPQTLPERSP AQAPTHPCEASSGQAQQARTAAQTRPANAAVTPGTALWRPNGPFEAGSSGAPEADTLLLS CGSFHKNADTMSVLALPEDSLCLNFWVCGDTFSPFVVDADLFLILLSQEDLMTKDSPEAL SASLIKALITLRYDYLLMALLRTAPPVTAPGKLTPIWMRSQALKPKQGFPDSIASLSLYG FDEVSDHTERPTWQATEGGLQMTVSKKALILRSHRN >gi568815597r:53760067_53983278|GENSCAN_predicted_CDS_2|1191_bp atgagacaagagaaagaatttaaaggcatccaaatcagtaaaaaggaagtcaaactgtca ctgtttgctgatgatatgatcgtttaccttaaaaaccctaagggctcctccagaaagttc ctagaactgataaaagaattcagcaaagtttccagctacaagattaatgcacacaaatca gtagctcttctatacaccaacagtgaccaagtggagaatcaaatcaagaactcaaaccct tttagaatagctgcaaaaaaaataaaatacttaagagtatacctaaccaaggagtcgaaa gatctctacaaggaaaactacaaaacattgctgaaagaaatcaggcaacgcaaacaaatg gaaacacatcccatgctcatggatggtatgtcccgcgacctgccggcgcagggccggctc acggccgtggccatggagatggcggcccctagtctagggcgtacaggagaccttcaaagg caggaaggccccaggaggaccctgctgggtctgcctcaaacgctgcctgagcgaagcccg gcccaggctccaacccatccctgcgaggcttcgtctggacaagctcaacaggccaggact gccgcgcagacacgtcccgcaaacgcggctgtaactccaggaaccgccctctggcgacca aacggtcccttcgaagcagggtccagcggggcgccagaagcagataccctgttgctttcc tgtggttctttccataaaaatgctgataccatgtcagtcttggcgctgccagaagattcc ctctgcttgaatttttgggtttgtggggataccttttcaccttttgttgtagatgccgat ttgtttttgattttgctatcccaggaggatttaatgacaaaggactctccagaagccctc tctgcatcattaattaaagccctgatcacactgcgatatgattatctgctgatggccctc ctccggacagccccacctgtcactgcccctggaaagctcaccccaatctggatgcggtct caagccctcaagccaaagcagggcttcccagactcgattgcctctctctccctttatggc tttgatgaagtgagtgatcatacagagaggcccacgtggcaagcaactgagggtggtctc caaatgaccgtcagcaagaaggccctcattctacgaagccacaggaattga >gi568815597r:53760067_53983278|GENSCAN_predicted_peptide_3|524_aa MNEGDIDDDGEGKGILDQPMSAKAKTRESTRDVQGPTVCPKEMSVAGSSESLPTDLQDEP EILGAEKGWYPVVVAEMGVPGHTRLLAHGQDVPLDSVWKQYQKHFTHDHMHSHLQEHPEP LQPQPWLGQPHLGKAREAGLFPEEAERFRPPDLRLPATGLDRNRRGATEARAFSGPGGRL TPREFGNAATSLTANPDATTVNIEDPGETPKHQPGSPRGSGREEDDELLGNDDSDKTEVF DRIKGSLLPIPGKNFVRLYIRSNPDLYGMFSTDLFTVFQLNLGTLYYLLRLDILLIVSPE CMGALEAVTVSLLPIIVSPAEGPFWICATLVFAIAISGNLSNFLIHLGEKTYHYVPEFRK EEGKGILQGVLLLEEEERVSIAATIIYAYAWLVPLALWGFLMWRNSKVMNIVSYSFLEIV CVYGYSLFIYIPTAILWIIPQKAVRWILVMIALGISGSLLAMTFWPAVREDNRRVALATI VTIVLLHMLLSVGCLAYFFDAPEMDHLPTTTATPNQTVAAAKSS >gi568815597r:53760067_53983278|GENSCAN_predicted_CDS_3|1575_bp atgaatgagggggatatagacgacgatggagaggggaagggcattcttgaccagccgatg tcagcaaaggcaaagacaagggaaagtacaagagatgtacaaggcccaacagtttgtccg aaagaaatgagtgtggcggggagctcagagtcgctgccaacggaccttcaagacgaacca gaaatactgggtgctgaaaaaggttggtatccagttgtcgtggctgaaatgggggttcct ggtcatacccgtcttctcgcccatggccaggatgttccgcttgactctgtctggaaacaa tatcagaagcactttacccacgaccacatgcacagccacctccaagagcacccagagcct cttcagccacagccctggctggggcagccccatctcggcaaagccagagaggcggggctc ttcccggaggaggcggagaggttccgccccccggatctccggcttccggcaactgggctg gaccgaaaccggcgcggagcaactgaggcccgagccttctcgggacccgggggacgccta accccgcgagaatttggcaatgcagccacttctctgacagcaaacccagatgccaccaca gtaaacattgaggatcctggtgaaaccccaaaacatcagccaggatccccaagaggctca ggaagagaagaagatgatgagttactgggaaatgatgactctgacaaaactgaggtcttt gacagaattaaaggatctcttttgccaatacccgggaaaaactttgtgaggttatatatc cgcagcaatccagatctctatggtatgtttagtactgatttgtttaccgtatttcagttg aacttgggcaccctgtactaccttctgagattagatattttattgatcgtctccccggag tgcatgggtgcactagaggcagtgactgtgtctctgttgcccatcatcgtgtccccagca gagggccccttttggatatgtgccacgttggtctttgccatagcaattagtgggaatctt tccaacttcttgatccatctgggagagaagacgtaccattatgtgcccgaattccgaaaa gaagagggaaagggaatcctccagggcgtgctgctactagaggaagaggagagagtgtcc atagcagctaccatcatctatgcctatgcctggctggttcctcttgcactctggggtttc ctcatgtggagaaacagcaaagttatgaacatcgtctcctattcatttctggagattgtg tgtgtctatggatattccctcttcatttatatccccaccgcaatactgtggattatcccc cagaaagctgttcgttggattctagtcatgattgccctgggcatctcaggatctctcttg gcaatgacattttggccagctgttcgtgaggataaccgacgcgttgcattggccacaatt gtgacaattgtgttgctccatatgctgctttctgtgggctgcttggcatacttttttgat gcaccagagatggaccatctcccaacaactacagctactccaaaccaaacagttgctgca gccaagtccagctaa >gi568815597r:53760067_53983278|GENSCAN_predicted_peptide_4|230_aa MFKFDQFKRLIEDFSSIADFLVIYIEEAHASDGWAFKNNMDIRNHQNLQDRLQAAHLLLA RSPQCPVVVDTMQNQSSQLYAALPERLYIIQEGRILYKNQLWEEFQLLGRDEELEEEGLE TGGGSHPLLAKRFKPVHIGGDDGNHTLRTAAIVGVVGALPRCLLPSHCTHEPAAELSYPE IPVKLCPSPECGPQPVTDGFEIPKPFPLLQGGAVYFSGASHASELPMGSD >gi568815597r:53760067_53983278|GENSCAN_predicted_CDS_4|693_bp atgttcaaatttgaccagttcaagaggcttattgaagactttagttccatagcagatttt cttgtcatttacattgaagaagcacatgcatcagatggctgggcttttaagaacaacatg gacatcagaaatcaccagaaccttcaggatcgcctgcaggcagcccatctactgctggcc aggagcccccagtgccctgtggtggtggacaccatgcagaaccagagcagccagctctac gcagcactgcctgagaggctctacataatccaggagggcaggatcctctacaagaaccaa ttatgggaagaattccagttgttaggaagagatgaggagttggaagaggagggattagaa acaggaggaggcagtcatcctctccttgccaaaagatttaaacctgtccacattggtggt gatgatgggaaccacactttgagaaccgctgctatagtaggtgttgttggtgctctgccc agatgccttttaccaagccattgtactcatgagccagctgctgagctatcttacccagaa atacctgtgaaattatgtccttcacctgagtgtggcccacagccagtgactgatggattt gaaattccaaagccttttcccttgctgcaaggtggggcagtctacttctccggtgccagt catgcttcagagctccctatgggatcagactga >gi568815597r:53760067_53983278|GENSCAN_predicted_peptide_5|502_aa MLCARGAEPFPVSRAAGGGSRHRGAAARCPGQGHRLHPVNTRGIEQVVATKAMSYYLSSE NHLDPGPIYMRENGQLHMVNLALDGVRSSLQKPRPFRLFPKGFSVELCMNREDDTARKEK TDHFIFTYTREGNLRYSAKSLFSLVLGFISDNVDHIDSLIGFPEQIAEKLFSAAEARQKF TEPGAGLRALQKFTEAYGSLVLCSLCLRNRYLVISEKLEEIKSFRELTCLDLSCCKLGDE HELLEHLTNEALSSVTQLHLKDNCLSDAGVRKMTAPVRVMKRGLENLTLLDLSCNPEITD AGIGYLFSFRKLNCLDISGTGLKDIKTVKHKLQTHIGLVHSKVPLKEFDHSNCKTEGWAD QLQAIMQDGRTIHLLDCSVTSPQIVLQWERVTAEAVKPRETSEPRAAAQRFYGKRSRAEA PLKCPLADTHMNSSEKLQFYKEKAPDCHGPVLKHEAISSQESKKSKKRPFEESETEQNNS SQPSKQKYVCLAVEDWDLLNSY >gi568815597r:53760067_53983278|GENSCAN_predicted_CDS_5|1509_bp atgctttgtgcccggggtgctgagcccttccccgtcagccgggccgcgggaggagggagc cgtcaccgaggagctgccgctcgctgccccgggcaggggcacaggcttcatccagtgaat actagagggatcgaacaggtggttgcaaccaaggcaatgtcttactacctcagctcagaa aaccacctggacccagggcccatctacatgcgagaaaatgggcagctgcacatggtcaat ctggctctggatggtgtcaggagtagcctgcagaagccaaggcctttcagactgttcccc aaaggcttttctgtggagctttgcatgaacagggaagacgacactgcacggaaagagaag actgatcatttcatcttcacatacacccgagaggggaatcttcggtactccgccaaatcc ctcttcagccttgtcctgggtttcatctccgacaatgtggatcacattgattcccttatt ggctttcctgagcagattgctgaaaagctgttctctgctgctgaagccagacagaaattc actgagccaggtgcagggctgagggctttacagaaattcactgaggcctatggaagtttg gtgctttgctccctgtgtttgcgaaacaggtatctcgtgatttcagaaaagcttgaggag attaagtctttccgggagctgacctgcctggatctttcctgttgcaagcttggagatgag catgaacttctagaacatctcaccaatgaagccctgtctagtgtaactcagctccacctg aaggataattgtttatctgatgctggggtgcggaagatgacagcaccagttcgagtgatg aaaagaggccttgagaatctaacattattagacttatcatgtaaccctgagatcacagat gcaggcattggatacctcttttcttttaggaaactaaactgcttagatatctctgggaca gggctcaaggacatcaaaaccgtcaagcacaagctccagacccacataggccttgttcac tccaaagtgcctttgaaggaatttgatcatagtaactgcaagacagagggctgggctgac cagctccaagccataatgcaggatggtagaactatccacctgctggattgtagcgtcacc tctccacagatcgttctgcagtgggagcgtgtgactgcggaagctgtgaagccacgggag acctcggagcctagagcagcagctcagcgcttctatgggaagcggtctcgagcagaagcc ccactgaagtgtcccctggcagacacccacatgaactcttccgagaaactccagttctat aaagagaaagccccagattgccatgggccagtgttgaaacacgaagctatctcaagccag gagtcaaagaagagcaagaagagaccttttgaggagtcagagacagaacagaataactct tcacaaccttcaaagcagaaatatgtatgtcttgctgtggaagactgggacttgttaaat tcctattga >gi568815597r:53760067_53983278|GENSCAN_predicted_peptide_6|188_aa MEPGSGHRRCCQGEEGHDPKEREQLRKPFIGGLSFETTDDGLREHFEKWVTLTDCVRGCG DGSGNFIVEKTLEVVEVILAVVETLVEEEAMVVEVVAAEIVMEEVMVDIMDLEVMVATMA AVLVIVVEGAMVVVDQDVETKVVDMVAVVEDMMVTMKEEILAVVTMVVVGTIMILEIIMD NSNQIKDS >gi568815597r:53760067_53983278|GENSCAN_predicted_CDS_6|567_bp atggagcccggctccggccatcgccgttgctgccagggggaggagggccatgatccaaag gaacgagagcagttgagaaaaccgtttattggtggtctgagctttgaaactacagatgat ggtttaagagaacattttgagaaatgggtcacactcacagattgtgtgagaggttgtgga gatggatctggcaattttattgtggaaaaaactttggaggtggtggaggtaattttggct gtggtggaaactttggtggaggaggaggctatggtggtggaggtggtggcagcagagata gttatggaggaggtgatggtggatataatggatttggaggtgatggtggcaactatggcg gcggtcctggttatagtagtagagggggctatggtcgtggtggaccaggatgtggaaacc aaggtggtggatatggtggcggtggtggaggatatgatggttacaatgaaggaggaaatt ttggccgtggtaactatggtggtggtgggaactataatgattttggaaattataatggac aacagcaatcaaattaaggactcatga >gi568815597r:53760067_53983278|GENSCAN_predicted_peptide_7|116_aa MVKERGNRTVQPYGNLERGESQEAKDCSAPPPGPEQVKETGRHQPFDLELRRWAEKNLPE KKAILYQEARSTTHVHMLQGASQENESGLWPPSSRLHHSLPFGKAVAPKEQKETKR >gi568815597r:53760067_53983278|GENSCAN_predicted_CDS_7|351_bp atggtcaaagaaagaggaaacaggacagtgcagccttatggaaacctagagaggggagag tcccaagaagccaaggactgctcagcgccgcctcctggtccagagcaggtaaaggagaca ggcagacaccagccatttgatttggagttgaggaggtgggcagagaagaacctcccagag aagaaagccattttatatcaggaagcccgtagcacaacacacgttcatatgctgcaagga gcaagccaggagaatgagagcggtctctggccaccaagctccaggctgcatcattccttg cccttcgggaaagcggtggcccccaaggagcaaaaggaaaccaagcgataa >gi568815597r:53760067_53983278|GENSCAN_predicted_peptide_8|67_aa MAPAACGGSFQAAAQGRGAEAEPNRIPELKQQELPGQHTGEEERHRDISGDNAQRENFSG VLLSMCV >gi568815597r:53760067_53983278|GENSCAN_predicted_CDS_8|204_bp atggctccagctgcctgcgggggaagtttccaggctgcagcacagggaaggggagcagag gcggagcccaacagaattcctgaattgaagcagcaggagctcccaggacagcatactgga gaggaggagcggcacagagacatctccggagataatgcgcagagagagaacttcagcgga gtgctgctcagcatgtgcgtgtga