GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:38:32 Sequence gi568815582f:23736176_24314813 : 578638 bp : 43.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 18875 18941 67 1 1 93 68 87 0.991 8.47 1.02 Intr + 19486 19558 73 2 1 104 78 110 0.999 10.06 1.03 Intr + 19672 19752 81 1 0 74 68 88 0.947 4.15 1.04 Intr + 19888 20018 131 1 2 107 74 80 0.666 8.84 1.05 Intr + 20213 20274 62 0 2 103 52 71 0.974 3.45 1.06 Intr + 21026 21148 123 1 0 99 77 182 0.981 18.98 1.07 Term + 21355 21408 54 0 0 86 41 104 0.936 3.06 1.08 PlyA + 21497 21502 6 1.05 2.00 Prom + 30831 30870 40 -3.56 2.01 Init + 33461 33578 118 1 1 31 0 180 0.108 3.86 2.02 Intr + 50723 50823 101 0 2 62 84 57 0.134 2.53 2.03 Intr + 53449 53484 36 0 0 86 113 15 0.161 2.26 2.04 Intr + 70948 71017 70 0 1 114 81 55 0.192 6.15 2.05 Intr + 99942 100173 232 1 1 88 93 299 0.127 27.23 2.06 Intr + 101200 101231 32 1 2 129 102 36 0.978 6.87 2.07 Term + 121875 121951 77 1 2 -29 34 187 0.036 -0.40 2.08 PlyA + 122795 122800 6 1.05 3.00 Prom + 133524 133563 40 -2.36 3.01 Init + 146934 146972 39 2 0 78 115 46 0.527 6.48 3.02 Intr + 187135 187158 24 0 0 83 100 15 0.206 0.42 3.03 Intr + 189164 189282 119 1 2 117 58 88 0.066 7.96 3.04 Term + 206268 206337 70 0 1 92 43 91 0.099 2.41 3.05 PlyA + 206854 206859 6 1.05 4.00 Prom + 207415 207454 40 -6.96 4.01 Init + 210210 210282 73 2 1 17 110 56 0.273 1.93 4.02 Intr + 211813 211850 38 2 2 92 109 10 0.242 1.48 4.03 Intr + 214685 214730 46 1 1 92 62 31 0.222 -1.12 4.04 Intr + 216633 216742 110 1 2 70 64 93 0.156 5.10 4.05 Intr + 223872 223903 32 2 2 73 78 42 0.003 -1.37 4.06 Intr + 229615 229751 137 1 2 107 15 68 0.004 1.61 4.07 Intr + 231045 231068 24 1 0 103 73 29 0.003 0.90 4.08 Intr + 252333 252415 83 1 2 116 95 33 0.754 6.16 4.09 Intr + 257454 257569 116 2 2 97 100 73 0.988 8.65 4.10 Intr + 258233 258377 145 2 1 38 57 150 0.860 7.18 4.11 Intr + 261156 261243 88 2 1 -3 110 49 0.039 -2.46 4.12 Term + 279103 279215 113 2 2 122 39 60 0.263 3.22 4.13 PlyA + 280361 280366 6 1.05 5.00 Prom + 283426 283465 40 -3.36 5.01 Init + 295825 295872 48 0 0 82 97 18 0.496 3.05 5.02 Intr + 295961 296072 112 1 1 90 98 97 0.988 10.85 5.03 Intr + 299244 299372 129 1 0 111 86 230 0.999 25.77 5.04 Intr + 302882 302959 78 0 0 19 81 104 0.417 2.32 5.05 Term + 313015 313460 446 2 2 36 48 176 0.337 3.80 5.06 PlyA + 315548 315553 6 1.05 6.03 PlyA - 315948 315943 6 1.05 6.02 Term - 319474 319155 320 2 2 28 45 198 0.375 4.54 6.01 Init - 320618 320534 85 2 1 90 58 17 0.255 -0.12 6.00 Prom - 327461 327422 40 -3.16 7.00 Prom + 329737 329776 40 -5.36 7.01 Init + 348813 348831 19 2 1 46 116 3 0.277 -0.84 7.02 Intr + 356616 356772 157 1 1 93 87 176 0.795 17.07 7.03 Intr + 357988 358122 135 1 0 95 119 66 0.988 10.08 7.04 Intr + 376798 376894 97 1 1 118 78 125 0.995 14.51 7.05 Intr + 387660 387806 147 2 0 93 86 150 0.636 15.73 7.06 Intr + 418509 418682 174 2 0 128 48 220 0.834 22.14 7.07 Term + 420669 420728 60 2 0 94 48 40 0.701 -1.60 7.08 PlyA + 423814 423819 6 1.05 8.00 Prom + 427489 427528 40 -4.46 8.01 Init + 428311 428371 61 0 1 60 101 77 0.979 7.61 8.02 Intr + 430307 430331 25 0 1 103 111 17 0.831 2.58 8.03 Intr + 433220 433383 164 2 2 59 107 18 0.310 0.42 8.04 Intr + 436021 436186 166 2 1 23 75 185 0.418 9.72 8.05 Intr + 438343 438405 63 1 0 90 131 49 0.991 7.23 8.06 Intr + 444615 444753 139 0 1 77 78 150 0.984 13.37 8.07 Intr + 448936 449016 81 0 0 107 91 74 0.998 9.43 8.08 Intr + 449285 449392 108 1 0 83 98 163 0.999 17.28 8.09 Intr + 454915 455055 141 0 0 56 87 144 0.944 11.65 8.10 Term + 483786 483938 153 2 0 65 43 193 0.924 10.42 8.11 PlyA + 485753 485758 6 1.05 9.00 Prom + 485914 485953 40 -2.36 9.01 Init + 495916 495922 7 0 1 62 119 7 0.558 1.88 9.02 Term + 499758 499843 86 1 2 91 45 68 0.596 0.62 9.03 PlyA + 500823 500828 6 1.05 10.05 PlyA - 501100 501095 6 1.05 10.04 Term - 513870 513773 98 1 2 49 43 91 0.686 -1.17 10.03 Intr - 514244 514098 147 0 0 -20 14 500 0.229 32.01 10.02 Intr - 517619 517482 138 0 0 60 87 100 0.806 7.54 10.01 Init - 517685 517673 13 2 1 69 113 5 0.765 1.67 10.00 Prom - 533879 533840 40 -3.36 11.00 Prom + 538138 538177 40 0.14 11.01 Init + 549457 549459 3 0 0 108 81 0 0.197 1.30 11.02 Term + 557124 557258 135 2 0 84 41 99 0.402 2.82 11.03 PlyA + 559277 559282 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100173 173 1 2 92 93 292 0.866 29.01 S.002 Init - 228893 228819 75 2 0 65 107 67 0.823 7.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_1|196_aa MGSRSSHAAVIPDGDSIRRETGFSQASLLRLHHRFRALDRNKKGYLSRMDLQQIGALAVN PLGDRIIESFFPDGSQRVDFPGFVRVLAHFRPVEDEDTETQDPKKPEPLNSRRNKLHYAF QLYDLDRDGKISRHEMLQVLRLMVGVQVTEEQLENIADRTVQEADEDGDGAVSFVEFTKS LEKMDVEQKMSIRILK >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_1|591_bp atggggtcgcgcagctcccacgccgcggtcattcccgacggggacagtattcggcgagag accggcttctcccaagccagcctgctccgcctgcaccaccggttccgggcactggacagg aataagaagggctacctgagccgcatggatctccagcagataggggcgctcgccgtgaac cccctgggagaccgaattatagaaagcttcttccccgatgggagccagcgagtggatttc ccaggctttgtcagggtcttggctcattttcgccctgtagaagatgaggacacagaaacc caagaccccaagaaacctgaacctctcaacagcagaaggaacaaacttcactatgcattt cagctctatgacctggatcgcgatgggaagatctccaggcatgagatgctgcaggttctc cgtctgatggttggggtacaggtgacagaagagcagctggagaacatcgctgaccgcacg gtgcaggaggctgatgaagatggggatggggctgtgtccttcgtggagttcaccaagtcc ttagagaagatggacgttgagcaaaaaatgagcatccggatcctgaagtga >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_2|221_aa MQRPTGTYKDNVEPESVATSNLDDVSDWQEEPVPMTGTWCGYRNCTFFYPISSPPSLKFK VSKRTVFDDTGKRDHGGINLTGFKKMVKEVSRGTVALGQILANQVEQAARGPAAPGPAPL GLRLPARKMADPAAGPPPSEGEESTVRFARKGALRQKNVHEVKNHKFTARFFKQPTFCSH CTDFIWGFGKQGFQCQGLCNFRIDGDNDVDDEDDDDDDEGS >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_2|666_bp atgcaaagaccaacaggaacctacaaggacaacgtggaacctgagtctgttgccacctcc aatcttgatgatgtgagtgactggcaggaggagccagtgcccatgacaggcacctggtgt ggttatagaaactgcaccttcttctatccaatttcaagcccaccatctcttaaatttaag gtctccaaaagaacagtttttgatgacactgggaagagggatcatggtggtattaacctc acagggtttaagaagatggtaaaggaagtatctagaggaactgttgccttgggccaaatt ctggccaaccaggtggaacaagctgcccgcggtcccgcggccccggggccggcacctctc gggctccggctccccgcgcgcaagatggctgacccggctgcggggccgccgccgagcgag ggcgaggagagcaccgtgcgcttcgcccgcaaaggcgccctccggcagaagaacgtgcat gaggtcaagaaccacaaattcaccgcccgcttcttcaagcagcccaccttctgcagccac tgcaccgacttcatctggggcttcgggaagcagggattccagtgccaaggattgtgtaac ttcagaattgatggtgacaatgacgttgatgatgaggatgatgatgatgatgatgaaggc agctaa >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_3|83_aa MELIFSKKPDNTQMEETPLIKDLPIGALSDLTGHWVSVAMIITLFLDVTLLSLTRMPVFL RHVSLELGALEFEKWLDEGIEKH >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_3|252_bp atggagctgatattcagcaagaagccagacaatacacagatggaagagactcccctcatc aaggacttgcccattggggctctttcagacctgactggtcattgggtatctgtggccatg ataatcactctcttcctggacgtgaccctcctctcgttaaccaggatgcctgtgttctta agacatgtgagcctggagcttggagcattggagtttgagaaatggcttgatgagggcatt gagaagcactag >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_4|334_aa MTDAGTLLSSIISVVSYQVDLAVKARAQTCDLGSVNQGKVAVCEGVECLSIPGYCHGNAL YTNLDDGGKLAQELWECLDTMPPELAGLGACQAVEAQEESLTFVQLLLLFFTNCQLQLLC GTLGLEASFLVLWHHKALPQCHLSPKQMTLPPGICCFVVHKRCHEFVTFSCPGADKGPAS DTKPYYGNHSHLIGKLKCNGNNWVLEYQGPFGGTQLLKARVRPTMMGEAKRKPLELPQPR KTANQEQYYILRGTAEISAIINDLKDAEINEKLQQLNTGKTMNGPDPLRMKVWVTLPGEK IKAYITQVACSRLHDWKLPELVLNLHIPAIESDP >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_4|1005_bp atgacagatgccggtaccttgctgtcatccatcatatcagttgtcagttaccaggtagac ctagcggtaaaggctagagcacagacatgtgacctaggatctgtcaatcagggcaaggtt gctgtctgtgaaggagtggaatgcctgagcatccctggttattgccatggaaatgcactg tacacgaatctggatgatggagggaagctggctcaggagctatgggagtgtttggatacc atgccgcctgaattggcaggtttaggagcatgtcaggctgtggaagcacaggaggagagc ctgacttttgttcagctcctgcttctcttcttcaccaactgccagcttcagctgctttgt ggcaccctggggcttgaagcttcatttctggtgctgtggcaccacaaagctctgccccag tgtcacttgagtccaaagcagatgacgctgccaccaggcatttgctgctttgtggtgcac aagcggtgccatgaatttgtcacattctcctgccctggcgctgacaagggtccagcctcc gataccaaaccttactatgggaaccacagccacttaattggaaaacttaaatgcaatggg aataattgggtcctggagtatcaggggccatttggtggcactcaactgttaaaggcaaga gtgaggcccactatgatgggtgaagccaagaggaagccattagaactgccccagcctagg aaaacagccaaccaagagcaatactacatccttagagggacagcagagatcagtgccatc atcaatgacttgaaagatgcagagatcaatgaaaaactacaacaactcaatacaggcaaa actatgaatggcccagaccctttaagaatgaaggtttgggtcaccctgccaggtgagaaa atcaaggcttatataactcaagtagcttgctcgaggctgcacgactggaaattaccagag ctggtgttgaatttgcacattccagctatagaatctgacccctga >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_5|270_aa MVPTGTMTSGDQVQFLDPRSKHKFKIHTYSSPTFCDHCGSLLYGLIHQGMKCDTCMMNVH KRCVMNVPSLCGTDHTERRGRIYIQAHIDRDVLIVLEEEEREIILSPPKEGYSKKVAVCK PGRPGNGLCESGNPVPPWTCTLYHYPGTSGTCTLYHYPGTSGTCTLNHYPGTSGTCTLTT TLAHPGPAPLTTTLAHPGPATLTTNLAHPGPAPLTTNLAQPGHVPLTITLAQPGHAPLTT TLAHLGHARLTTTLASSGTCTLNYYPGLPP >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_5|813_bp atggtcccaacaggtacaatgacctctggggaccaagttcagttcttggacccccgcagc aaacacaagtttaagatccacacgtactccagccccacgttttgtgaccactgtgggtca ctgctgtatggactcatccaccaggggatgaaatgtgacacctgcatgatgaatgtgcac aagcgctgcgtgatgaatgttcccagcctgtgtggcacggaccacacggagcgccgcggc cgcatctacatccaggcccacatcgacagggacgtcctcattgtcctcgaggaagaagaa agagagatcattctgtctccaccaaaggaaggatacagcaagaaggtggccgtctgcaag ccaggaagaccagggaatgggttatgtgaatctggcaaccctgtcccaccctggacctgc accctttaccactaccctggcacatccgggacctgcaccctttaccactaccctggcaca tccgggacctgcacccttaaccactaccctggcacatccgggacctgcaccttaaccact accctggcacacccgggacctgcacccttaaccactaccctggcacatccaggacctgca accttaaccactaacctggcacatccaggacctgcaccgttaaccactaacctggcacaa ccaggacatgtgcccttaactattaccctggcacaaccgggacatgcacccttaaccact accctggcacatctgggacatgcacgcttaaccactactctagcatcatccgggacctgc acccttaactactaccctggtctgcctccttga >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_6|134_aa MVLASTSGEGLRKLPIMVQGERGARVSHACLGRDFSNNGLRQWIRLERRNEIKGALILDG GNRLSALIPSPCEATFSELKQASSKGQRSRRESPNPAIRGCRMQLVRGGGTDNSSQFQGH LPSFPSASQNPAGS >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_6|405_bp atggtgctggcatccacttctggtgagggactcaggaagcttccaatcatggtacaagga gaaaggggagcccgtgtatcacatgcctgcttaggtcgggatttctccaacaacgggcta agacagtggattcgtctggaaagaagaaatgaaatcaaaggagcattgatcctcgatggg gggaacaggctctcagctcttattccaagtccctgtgaagccacattctctgagctgaag caggcatccagcaagggccagagaagccgacgtgaaagcccaaacccagccatccgtggc tgccgcatgcagctcgtcaggggaggaggaacagacaatagcagtcagttccagggccat ctcccttccttccctagtgcttctcagaacccagcaggcagctga >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_7|262_aa MLKRQEVRDAKNLVPMDPNGLSDPYVKLKLIPDPKSESKQKTKTIKCSLNPEWNETFRFQ LKESDKDRRLSVEIWDWDLTSRNDFMGSLSFGISELQKASVDGWFKLLSQEEGEYFNVPV PPEGSEANEELRQKFERAKISQGTKVPEEKTTNTVSKFDNNGNRDRMKLTDFNFLMVLGK GSFGKVMLSERKGTDELYAVKILKKDVVIQDDDVECTMVEKRVLALPGKPPFLTQLHSCF QTMMDSLTSSGKGQSPSSKSTK >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_7|789_bp atgttgaagagacaggaagtaagagatgctaaaaaccttgtacctatggaccccaatggc ctgtcagatccctacgtaaaactgaaactgattcccgatcccaaaagtgagagcaaacag aagaccaaaaccatcaaatgctccctcaaccctgagtggaatgagacatttagatttcag ctgaaagaatcggacaaagacagaagactgtcagtagagatttgggattgggatttgacc agcaggaatgacttcatgggatctttgtcctttgggatttctgaacttcagaaagccagt gttgatggctggtttaagttactgagccaggaggaaggcgagtacttcaatgtgcctgtg ccaccagaaggaagtgaggccaatgaagaactgcggcagaaatttgagagggccaagatc agtcagggaaccaaggtcccggaagaaaagacgaccaacactgtctccaaatttgacaac aatggcaacagagaccggatgaaactgaccgattttaacttcctaatggtgctggggaaa ggcagctttggcaaggtcatgctttcagaacgaaaaggcacagatgagctctatgctgtg aagatcctgaagaaggacgttgtgatccaagatgatgacgtggagtgcactatggtggag aagcgggtgttggccctgcctgggaagccgcccttcctgacccagctccactcctgcttc cagaccatgatggattcattgacatcatcaggcaaaggacagtctccgtccagtaaatca actaagtaa >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_8|366_aa MTILKKPFKVKQTSQFEEKDAIIAIKSGTVRPRTFVLAYVTLHDASLSTTLGSSPTLSPF FFLLQAHEVCSSLRAFAQPVSPPALPPLSIWSPRSYGMLMLLFAVLFQDRLYFVMEYVNG GDLMYHIQQVGRFKEPHAVFYAAEIAIGLFFLQSKGIIYRDLKLDNVMLDSEGHIKIADF GMCKENIWDGVTTKTFCGTPDYIAPEIIAYQPYGKSVDWWAFGVLLYEMLAGQAPFEGED EDELFQSIMEHNVAYPKSMSKEAVAICKGLMTKHPGKRLGCGPEGERDIKEHAFFRYIDW EKLERKEIQPPYKPKARDKRDTSNFDKEFTRQPVELTPTDKLFIMNLDQNEFAGFSYTNP EFVINV >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_8|1101_bp atgacaattttgaagaagccctttaaagtcaagcagactagccagtttgaagagaaagat gcaataatagccatcaaatcaggaacagtgagaccccgaacctttgtgttagcctacgtg accttacatgatgcgtccctcagcaccaccctgggctcatctcctaccctgtcccccttc ttctttcttctccaagcacatgaggtgtgctctagcctcagggccttcgcacagcctgtt tctccgcctgctcttcccccactgtccatttggagccccagaagctacgggatgctaatg ttgctgtttgctgtcctcttccaggaccgcctgtactttgtgatggagtacgtgaatggg ggcgacctcatgtatcacatccagcaagtcggccggttcaaggagccccatgctgtattt tacgctgcagaaattgccatcggtctgttcttcttacagagtaagggcatcatttaccgt gacctaaaacttgacaacgtgatgctcgattctgagggacacatcaagattgccgatttt ggcatgtgtaaggaaaacatctgggatggggtgacaaccaagacattctgtggcactcca gactacatcgcccccgagataattgcttatcagccctatgggaagtccgtggattggtgg gcatttggagtcctgctgtatgaaatgttggctgggcaggcaccctttgaaggggaggat gaagatgaactcttccaatccatcatggaacacaacgtagcctatcccaagtctatgtcc aaggaagctgtggccatctgcaaagggctgatgaccaaacacccaggcaaacgtctgggt tgtggacctgaaggcgaacgtgatatcaaagagcatgcatttttccggtatattgattgg gagaaacttgaacgcaaagagatccagcccccttataagccaaaagctagagacaagaga gacacctccaacttcgacaaagagttcaccagacagcctgtggaactgacccccactgat aaactcttcatcatgaacttggaccaaaatgaatttgctggcttctcttatactaaccca gagtttgtcattaatgtgtag >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_9|30_aa MAVYAFKQLFWGESVGAHTGRSESLATCLL >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_9|93_bp atggctgtctatgccttcaagcagcttttctggggagagagtgttggcgcccacactggc agatctgagtccctggccacgtgcctgctgtga >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_10|131_aa MLMKDPQWNSSIFISTHLALQQKRRQAEYGAQEPSRRIVGGKGSGAQVDEEEEEEEEEEE KEEEKEEEGEEEEEEEEEEEEEEEEEEEEEEEEEEDVVVESSDDQAKTTTSSLGFISILY GCQTIVCIRVA >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_10|396_bp atgttgatgaaagacccacagtggaacagcagcatcttcatctccacccacttagctcta caacagaagcgcagacaagcagagtatggtgcccaggagcctagcagaaggatagttggg ggaaaaggatcaggagctcaggtagatgaagaagaagaggaggaggaggaggaggaggag aaggaggaggaaaaagaagaagaaggagaagaagaggaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagatgttgttgttgag tcttccgatgatcaggccaagactaccacatcctcgttgggcttcatctccatcctctat ggttgccaaaccatagtgtgcattagagtcgcttag >gi568815582f:23736176_24314813|GENSCAN_predicted_peptide_11|45_aa MKERMGLDDLNGPPTLMFILWFSLALPLYLTANEEHRLGSHTAYI >gi568815582f:23736176_24314813|GENSCAN_predicted_CDS_11|138_bp atgaaggagaggatgggtctagatgacctcaatggtcctccaactctgatgttcatcctc tggttctctttagccttgcccctctacttgacagccaatgaagagcacagacttgggagc cacactgcctacatttga