GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:48:13 Sequence gi568815592f:153939359_154218718 : 279360 bp : 38.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4870 5031 162 0 0 96 105 93 0.956 11.05 1.02 Intr + 15553 15908 356 0 2 20 8 235 0.266 1.86 1.03 Intr + 16231 16381 151 1 1 63 107 83 0.623 6.94 1.04 Term + 18214 18444 231 0 0 97 41 120 0.838 3.69 1.05 PlyA + 21829 21834 6 1.05 2.02 PlyA - 22057 22052 6 1.05 2.01 Sngl - 23967 23734 234 0 0 3 32 726 0.976 53.35 2.00 Prom - 46837 46798 40 -0.75 3.00 Prom + 53336 53375 40 -5.15 3.01 Sngl + 63602 63961 360 1 0 73 50 168 0.923 7.32 3.02 PlyA + 64574 64579 6 1.05 4.04 PlyA - 65199 65194 6 1.05 4.03 Term - 87988 87815 174 1 0 40 51 102 0.463 -1.52 4.02 Intr - 88903 88700 204 1 0 40 78 147 0.881 7.37 4.01 Init - 90404 90399 6 2 0 84 78 0 0.743 -0.47 4.00 Prom - 90460 90421 40 -3.65 5.00 Prom + 95668 95707 40 -2.75 5.01 Init + 99911 99947 37 1 1 66 64 67 0.387 0.32 5.02 Intr + 100053 100476 424 1 1 95 110 426 0.590 37.50 5.03 Intr + 150468 150820 353 0 2 126 113 302 0.988 30.54 5.04 Intr + 151594 152114 521 2 2 112 101 441 0.803 39.54 5.05 Intr + 167089 167122 34 0 1 76 107 -3 0.003 -2.72 5.06 Intr + 179325 179540 216 1 0 136 106 25 0.090 6.65 5.07 Intr + 196260 196333 74 1 2 82 77 40 0.278 0.51 5.08 Term + 197234 197428 195 1 0 54 49 153 0.674 4.43 5.09 PlyA + 198025 198030 6 1.05 6.03 PlyA - 198089 198084 6 1.05 6.02 Term - 220682 220470 213 2 0 80 53 293 0.997 21.25 6.01 Init - 223356 223255 102 0 0 71 83 70 0.366 5.09 6.00 Prom - 224893 224854 40 -1.85 7.03 PlyA - 227523 227518 6 1.05 7.02 Term - 228755 228349 407 0 2 78 47 272 0.669 16.46 7.01 Init - 229109 228965 145 2 1 92 11 28 0.204 -4.17 7.00 Prom - 229428 229389 40 -8.75 8.00 Prom + 231750 231789 40 -6.55 8.01 Init + 233847 233992 146 2 2 77 98 129 0.869 12.44 8.02 Intr + 234354 234568 215 0 2 -16 53 82 0.110 -8.26 8.03 Term + 236151 237193 1043 1 2 43 48 331 0.287 15.90 8.04 PlyA + 237225 237230 6 1.05 9.07 PlyA - 238492 238487 6 1.05 9.06 Term - 241252 241116 137 2 2 21 42 116 0.163 -2.50 9.05 Intr - 242383 242309 75 2 0 80 39 72 0.121 0.07 9.04 Intr - 260567 260310 258 0 0 53 92 189 0.933 12.31 9.03 Intr - 267568 267445 124 1 1 76 43 97 0.603 3.24 9.02 Intr - 273497 273412 86 0 2 86 76 104 0.791 7.62 9.01 Intr - 274918 274860 59 0 2 62 95 68 0.817 2.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 261760 261706 55 1 1 92 70 46 0.807 4.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_1|299_aa YDGIALMMLSIDQEPGEPTICEHYLNSVIFNCMLFIPQGVVGNVWRHVGLSQHLPKSRPI TSSEIEAVISCLTTKKSPGPDGFTAKFYRRYEEDLAQFLLKLFQTIEKEGLLPNSFHEAS IILTQKPGRDTTEKENFRPISLMSINVKILNKILANRIQQQTESDCSLSTAINIGSSGHG ILAKEKIKGIQIGREEVKLSLFADDMIVYLENHIASAQNLLKLSIHQVTHGKPLEGTETP ENSVTSALDLLAGAHSHPVECAFIFNKPLLLLLHSSRALFVCFIQLFVQNAKNLDTLNW >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_1|900_bp tatgatggcattgctttaatgatgctgagtatagatcaagaaccaggagagccaactatt tgcgaacattaccttaattcagtgattttcaactgcatgctgtttattccccagggtgta gttggcaatgtctggaggcatgttgggctgtcacaacatctgcccaagagtagaccaata acaagctctgaaattgaggcagtaattagttgcctaacaaccaaaaaaagcccaggacca gatggattcaccgccaaattctaccggaggtatgaagaggacctggcacaattcctcctg aaattattccaaacaatagaaaaagagggactcctccctaactcatttcatgaagccagc atcatcctgacacaaaaacctggcagagacacaacagaaaaagaaaatttcaggccaata tccctgatgagcatcaatgtgaaaatcctcaataaaatactggcaaaccgaatccagcag caaacagaatccgattgcagcttatccaccgcaatcaatattggaagttctggccacggc attctggcaaaagaaaaaataaaaggtattcaaataggaagagaggaagtcaaattgtct ctgtttgcagatgacatgattgtatatttagaaaaccacatcgcctcagcccaaaatctt cttaagctgagcatacaccaagtaacccatgggaaacctctagagggaactgaaactcca gaaaattctgtaaccagcgcccttgacctgcttgctggggcccactcccaccctgtggag tgtgctttcattttcaataaacctctgcttttattgcttcattcttcccgtgctttattt gtgtgttttatccaattatttgttcaaaatgccaagaacctggacaccctcaactggtaa >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_2|77_aa MLKKKKKEKEKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEKEEEEEEEEE EEEEEEEDHLHLAVFSY >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_2|234_bp atgctgaagaagaagaagaaggagaaggagaaggaagaagaagaagaagaagaagaagaa gaggaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaggaagaggaagaggaagaggaagaggagaaggaagaagaagaagaagaagaagaagag gaagaggaagaggaagaggaagaccatttgcacctagctgttttctcctattaa >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_3|119_aa MEEVHADWSMDGRGQALKKHHPAGQKTSRKFSLLVADSTWNSHPGPQALDHPWLEGEVSL GTHPFPTSNLSASYCHQHAICTTQTVHSEGHLQAHAEPHRQPPAKFVPELVGPQSFRVG >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_3|360_bp atggaggaagtacatgctgactggtccatggatggccgtgggcaggccttaaaaaagcac catccagctggccaaaagacatcaaggaagttctcactcctggtcgcagactccacttgg aacagccatcccggcccccaggctttagaccatccctggcttgaaggtgaggtttcactg gggactcatcccttcccaactagtaacctgtctgcctcctactgccatcaacatgccatc tgcaccacccagactgttcattctgagggacacctgcaggcccatgccgagccccaccgt cagccccctgccaaatttgttcctgagctagttggcccccaaagtttcagagtgggctga >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_4|127_aa METHPGPEGNLLSSREEPSPGKIYHLLMKSPWALNNQQQYSGTMSRALSKTGNLMASSET QQVPSHGDYGNRERGSICFGESNRKNKQTNKKHKSLYLVIQISFPDLVKDHQSGPSMNVQ EPQHYWG >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_4|384_bp atggagacacaccctgggccagaagggaacctgctgtcttcaagggaagaacccagtcct ggaaaaatttaccacttgctaatgaagagcccttgggccctgaataaccagcagcaatac tcaggtactatgtcgagggccttgagtaagactggaaacttgatggcttcaagtgagact cagcaggttcccagccatggtgactatgggaacagagagagaggctccatttgttttgga gaaagtaacaggaaaaacaaacaaacaaacaagaaacacaagagtctctacctggtaatt cagataagttttccagatcttgtcaaagaccatcaaagtggtccctcaatgaatgtgcaa gaaccacagcattactggggttga >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_5|617_aa MAHAPLLQRCGAARTGFCKKQQELWQRRKEAAEALGTRKVSVLLATSHSGARPAVSTMDS SAAPTNASNCTDALAYSSCSPAPSPGSWVNLSHLDGNLSDPCGPNRTDLGGRDSLCPPTG SPSMITAITIMALYSIVCVVGLFGNFLVMYVIVRYTKMKTATNIYIFNLALADALATSTL PFQSVNYLMGTWPFGTILCKIVISIDYYNMFTSIFTLCTMSVDRYIAVCHPVKALDFRTP RNAKIINVCNWILSSAIGLPVMFMATTKYRQGSIDCTLTFSHPTWYWENLLKICVFIFAF IMPVLIITVCYGLMILRLKSVRMLSGSKEKDRNLRRITRMVLVVVAVFIVCWTPIHIYVI IKALVTIPETTFQTVSWHFCIALGYTNSCLNPVLYAFLDENFKRCFREFCIPTSSNIEQQ NSTRIRQNTRDHPSTANTVDRTNHQTLPSSKAGKLYARKSGSRNCSVALTGSHAIPTFTK LRSHHVCGSRLLQECVGGSNSLGKCLLLGHPTSFLSGHSALHIRGTAKMAGSLANLKRLY KRLGPAMVSNFSKPLRLVPGPSSCTGAASPQTVPKTVSEIQVQQKYTSPLLQIHGESGEN WQLLSMTCPYDVDGAGI >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_5|1854_bp atggcccacgctcccctcctgcagcggtgcggggcagccaggactggtttctgtaagaaa cagcaggagctgtggcagcggcgaaaggaagcggctgaggcgcttggaacccgaaaagtc tcggtgctcctggctacctcgcacagcggtgcccgcccggccgtcagtaccatggacagc agcgctgcccccacgaacgccagcaattgcactgatgccttggcgtactcaagttgctcc ccagcacccagccccggttcctgggtcaacttgtcccacttagatggcaacctgtccgac ccatgcggtccgaaccgcaccgacctgggcgggagagacagcctgtgccctccgaccggc agtccctccatgatcacggccatcacgatcatggccctctactccatcgtgtgcgtggtg gggctcttcggaaacttcctggtcatgtatgtgattgtcagatacaccaagatgaagact gccaccaacatctacattttcaaccttgctctggcagatgccttagccaccagtaccctg cccttccagagtgtgaattacctaatgggaacatggccatttggaaccatcctttgcaag atagtgatctccatagattactataacatgttcaccagcatattcaccctctgcaccatg agtgttgatcgatacattgcagtctgccaccctgtcaaggccttagatttccgtactccc cgaaatgccaaaattatcaatgtctgcaactggatcctctcttcagccattggtcttcct gtaatgttcatggctacaacaaaatacaggcaaggttccatagattgtacactaacattc tctcatccaacctggtactgggaaaacctgctgaagatctgtgttttcatcttcgccttc attatgccagtgctcatcattaccgtgtgctatggactgatgatcttgcgcctcaagagt gtccgcatgctctctggctccaaagaaaaggacaggaatcttcgaaggatcaccaggatg gtgctggtggtggtggctgtgttcatcgtctgctggactcccattcacatttacgtcatc attaaagccttggttacaatcccagaaactacgttccagactgtttcttggcacttctgc attgctctaggttacacaaacagctgcctcaacccagtcctttatgcatttctggatgaa aacttcaaacgatgcttcagagagttctgtatcccaacctcttccaacattgagcaacaa aactccactcgaattcgtcagaacactagagaccacccctccacggccaatacagtggat agaactaatcatcagaccttacctagcagtaaagcaggcaagttgtacgctagaaaatct ggaagcagaaactgctccgttgccctaacagggtctcatgccattccgaccttcaccaag cttagaagccaccatgtatgtggaagcaggttgcttcaagaatgtgtaggaggctctaat tctctaggaaagtgcctgcttttaggtcatccaacctctttcctctctggccactctgct ctgcacattagagggacagccaaaatggcaggctccctggccaacttgaaaaggctatat aaaaggcttggtccagccatggtgtccaacttctccaagcccctccgcttggttcctggg ccctcttcctgcactggcgcagcatccccacagacagtgcccaagacagtgtctgaaatt caggtacaacaaaaatatacatctcctttacttcaaatacatggggagagtggtgaaaac tggcagctgctttccatgacctgcccatatgatgtggatggagctgggatttga >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_6|104_aa MNIDRAENRIQAGALEHSSEKRWRDKEESATKVGCKEHDLAMINQLLDDPKLTARKYREW KVMNTLLIQDIYQQQRASPAPDDTDDTPQELKKSPSSPSVENSI >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_6|315_bp atgaacatagatagagcagagaacaggatacaagctggggctttagaacactccagtgag aagaggtggcgtgacaaggaggaatcagcaactaaggtgggatgtaaagaacatgatctg gccatgattaaccagttgctggatgacccgaagctgacagccaggaaatacagagagtgg aaagtcatgaacaccctgctgatccaggacatctatcagcagcagcgggcttcgcctgcc cctgatgacactgatgacaccccccaggaactcaagaaatcaccttcttctccctctgtt gaaaattccatttga >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_7|183_aa MTDTLINRGSLDTETCMQGTPHVNIGVILPQTKKPLEARREAWNNSFPEETKVSEDDEME KLYKSLEQASLSPLGDRRPSTKKELRKSFVKRCKNPSINEKLHKIRTLNSTLKVTKFQIV DELYGCGARNGWFCREILANHSMFVWLQGRAVKVNITFIGQDDLDVLKALGHSHSSAQGS FSQ >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_7|552_bp atgactgataccctcataaatagggggagtttggacacagagacatgcatgcagggaaca ccccatgtgaatattggagttatactgccacaaaccaaaaaaccacttgaagccaggaga gaggcttggaataactccttccctgaagagacaaaagtgtctgaagatgatgaaatggag aagctgtacaaatcattagagcaagctagtctatctcctcttggggaccgacgaccttcg actaaaaaggagttgagaaaatcctttgttaagcggtgtaaaaatccatctataaacgag aaactccacaaaatccgaacattgaatagcacattaaaggtaacaaaatttcaaattgtg gatgaactctatggatgtggggcaaggaacggctggttctgcagagagattcttgctaat cacagcatgtttgtgtggttgcagggaagggctgtaaaagtaaatataacctttataggg caagatgatcttgacgttcttaaagcactaggacattcacacagtagcgcccaaggcagt ttttcacaatag >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_8|467_aa MGRNQSGKAENSKNQNTSSPPKEHNSSPAREQNWMENEFDELTEVGFRSDGENGTKLENT LQDIIQENFPNLARQVNIQIQEIQRTPQRYSLRRATPRHITVRFIKVEMKEKMLRAVREK EIQTTIREYYKHLYTNKLENLEETEKFMDTYTLPKLNQEEVESLNRLKIASEIEAIINSL PTKKNPGPDGFAAEFYQRYKEELVPFLLKLFQSIEKQGILPNSFYEASIILIPKPGRDTT KKGNFRPISLMNIDVKILNKILANQIQQQIKKLIHHNQVGFIPGMQDCFNVPKSIDIIYH INRTNDKNHMIMSIDAEKAFNKNQQPFVLKTLNKLGIDGMYLKIIRAIYDKPTANIILNG QKLEAFPLKTGIRQGCPLSPLPFNIVMEVLAKAIRQEKEIKCIQLGKEEVKLSLFVDHMI VYLENPIISAQNLLKLISNFSKVSGYKINVQKSQAFQYTNNRQRAKS >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_8|1404_bp atggggagaaaccagagtggaaaggctgaaaattccaaaaaccagaacacctcttctcct ccaaaggaacacaactcctcaccagcaagggaacaaaactggatggagaatgaatttgat gagttgacagaagtaggcttcagaagtgacggggagaatgggaccaaattggaaaacact cttcaggatattatccaggagaacttccccaacctagcaaggcaggtcaacattcaaatt caggaaatacagagaacgccacaaagatactccttgagaagagcaaccccaagacacata actgtgagattcatcaaggttgaaatgaaggaaaaaatgttaagggcagtcagagagaaa gaaatacaaactaccatcagagaatactataaacacctctacacaaataaactagaaaat ctagaagaaactgagaaattcatggacacatacaccctcccaaaactaaaccaggaagaa gttgaatctctgaatagactaaaaatagcttctgaaattgaggcaataattaatagccta ccaaccaaaaaaaatccaggaccagatggattcgcagccgaattctaccagaggtacaaa gaggagctggtaccattccttctgaaactattccaatcaatagaaaaacagggaatcctc ccaaactcattttatgaggccagcatcatcctgataccaaagcctggcagagacacaaca aaaaaagggaattttaggccaatatccctgatgaacattgatgtaaaaatcctcaataaa atactggcaaaccaaatccagcagcaaatcaaaaagcttatccaccacaatcaagtcggc ttcatccctgggatgcaagactgcttcaacgtacccaaatcaatagacataatctatcac ataaacagaaccaatgataaaaaccacatgattatgtcaatagatgcagaaaaggccttc aacaaaaatcaacagcctttcgtgctaaaaactctcaataaactaggtattgatggaatg tatctcaaaataataagagctatttatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattccctttgaaaactggcataagacaaggatgccctctctcacca ctcccattcaacatagtaatggaagttctggccaaagcaatcagacaagagaaagaaata aagtgtattcaattaggaaaagaggaggtcaaattgtctttgtttgtggatcacatgatt gtatatttagaaaaccccatcatctcagcccaaaatctccttaagctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattccaatacaccaat aacagacagagagccaaatcatga >gi568815592f:153939359_154218718|GENSCAN_predicted_peptide_9|246_aa XWLNKLGSAVIHQESTTKDEECYSESEQEDPEIAAETPPPPHASQTQSLALGLSSGCHCS LAANFALIVSPTNGLLFPKSVPDIPALVPVERQSLPDTVNSLSAAEDEGQPITFAVQVHS PVPSEAGIHKALENSFVTSESGFLNSLSSDDTSSLSSNHDHLTVPDKPAGSKIMDKGLVP CGPVETEFEVQNLGTSAAEAPAKFWAYWGMPVLSEQTIDWGMEEEEKGAAHSFGRVLERS ELITVG >gi568815592f:153939359_154218718|GENSCAN_predicted_CDS_9|741_bp nngtggttaaataaacttggatcggctgtaatccatcaggaatccactacaaaggatgaa gaatgttacagtgaaagtgaacaggaagatccagaaatagctgcggagacaccaccccct cctcacgcttcccagactcagtctttggcccttggcctgtcttcaggctgccattgcagt cttgctgccaattttgcccttatcgtcagcccaacaaacggcctgctgttccccaaaagc gttccagacattcctgccctcgtacctgtagagagacaatccttgcctgacacagttaac agtttgtctgctgctgaagatgagggacaaccaataacgtttgctgtgcaagttcattca cctgtaccctcagaggcaggcatccacaaggccctggaaaacagttttgtcacatcagaa agtggatttttgaactctttatctagtgatgatacttcttcattgagtagcaatcatgac catcttactgtcccagataagcctgctggatcaaagatcatggacaaaggtctcgtgccc tgcggccctgtggagacagaatttgaggtacaaaacctgggaacctcagcagcagaggct ccggccaaattctgggcctactggggcatgcctgtgctgtcagagcagacaatagactgg ggtatggaagaggaggagaagggagcagcacattcatttggcagggtcttggaaaggagt gaacttataactgttggctaa