GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:26:07 Sequence gi568815593r:17253876_17454423 : 200548 bp : 41.45% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1653 1753 101 2 2 65 53 81 0.185 2.08 1.02 Intr + 15618 15726 109 0 1 100 75 33 0.168 2.57 1.03 Term + 21333 22025 693 2 0 88 48 746 0.493 63.18 1.04 PlyA + 22935 22940 6 1.05 2.03 PlyA - 23507 23502 6 1.05 2.02 Term - 25395 25061 335 1 2 47 41 239 0.685 8.89 2.01 Init - 33969 33801 169 0 1 72 75 76 0.228 4.45 2.00 Prom - 34603 34564 40 -7.85 3.00 Prom + 34731 34770 40 -8.65 3.01 Init + 35965 36025 61 0 1 82 19 85 0.485 2.49 3.02 Intr + 38558 38787 230 0 2 88 77 205 0.706 16.07 3.03 Intr + 46665 46751 87 2 0 108 91 95 0.705 11.05 3.04 Term + 52505 52948 444 1 0 72 54 147 0.149 3.95 3.05 PlyA + 53165 53170 6 1.05 4.02 PlyA - 53463 53458 6 1.05 4.01 Sngl - 58013 57780 234 2 0 97 36 248 0.976 13.44 4.00 Prom - 71384 71345 40 -6.05 5.04 PlyA - 72223 72218 6 1.05 5.03 Term - 74717 74585 133 1 1 89 52 130 0.794 6.08 5.02 Intr - 81095 81009 87 1 0 49 99 48 0.063 0.17 5.01 Init - 88832 88705 128 2 2 81 76 64 0.321 4.18 5.00 Prom - 99796 99757 40 -3.65 6.02 PlyA - 99825 99820 6 -0.45 6.01 Sngl - 100759 99998 762 1 0 78 46 798 0.999 70.46 6.00 Prom - 104862 104823 40 -5.75 7.09 PlyA - 105135 105130 6 1.05 7.08 Term - 106224 105968 257 1 2 20 36 311 0.979 13.86 7.07 Intr - 111692 111518 175 2 1 8 33 89 0.036 -6.01 7.06 Intr - 112198 112085 114 1 0 80 77 158 0.680 13.62 7.05 Intr - 114263 114146 118 1 1 44 82 47 0.619 -0.75 7.04 Intr - 115514 115421 94 0 1 77 55 115 0.523 5.20 7.03 Intr - 124952 124859 94 2 1 19 117 80 0.255 2.62 7.02 Intr - 126635 126600 36 2 0 120 95 30 0.445 4.44 7.01 Init - 131114 130974 141 2 0 81 75 91 0.749 7.28 7.00 Prom - 141710 141671 40 -6.35 8.00 Prom + 146446 146485 40 -4.95 8.01 Init + 158962 159150 189 0 0 85 98 62 0.824 6.15 8.02 Term + 159279 159434 156 2 0 105 44 64 0.603 0.65 8.03 PlyA + 160496 160501 6 1.05 9.00 Prom + 160871 160910 40 -5.85 9.01 Sngl + 161143 161790 648 0 0 31 42 392 0.887 24.82 9.02 PlyA + 161925 161930 6 1.05 10.03 PlyA - 164033 164028 6 1.05 10.02 Term - 166221 166124 98 1 2 114 34 93 0.777 3.65 10.01 Init - 166850 166646 205 2 1 92 2 158 0.694 6.66 10.00 Prom - 171685 171646 40 -4.75 11.03 PlyA - 172147 172142 6 1.05 11.02 Term - 176690 176153 538 1 1 58 45 190 0.223 4.43 11.01 Init - 180427 180342 86 1 2 74 83 43 0.539 2.74 11.00 Prom - 180941 180902 40 -6.85 12.05 PlyA - 181225 181220 6 1.05 12.04 Term - 182327 182182 146 0 2 71 49 176 0.968 9.09 12.03 Intr - 191791 191585 207 2 0 83 75 65 0.050 2.83 12.02 Intr - 196622 196448 175 2 1 50 43 103 0.080 0.59 12.01 Intr - 199646 199518 129 2 0 72 94 51 0.229 4.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 129972 129853 120 0 0 66 42 125 0.807 3.19 S.002 Init + 190311 190417 107 2 2 70 68 104 0.864 6.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_1|300_aa MYFETSCPTLDAWCRSGSVATCNTSLTTLDGVTGPGCSDQLTVLEWHTLFSLHRKAELYH ISSRIAVSSQNSKMGGKLSKKKKGYNVNDEKAKEKDKKAEGAATEEEGTPKESEPQAAAE PAEAKEGKEKPDQDAEGKAEEKEGEKDAAAAKEEAPKAEPEKTEGAAEAKAEPPKAPEQE QAAPGPAAGGEAPKAAEAAAAPAESAAPAAGEEPSKEEGEPKKTEAPAAPAAQETKSDGA PASDSKPGSSEAAPSSKETPAATEAPSSTPKAQGPAASAEEPKPVEAPAANSDQTVTVKE >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_1|903_bp atgtattttgaaacctcttgccccacgttggatgcctggtgtcgatcaggaagtgttgca acgtgtaacacatcactcaccactcttgacggtgttacaggcccaggttgctctgatcaa ctcacagtgctcgaatggcacactctcttctctcttcaccgaaaagcagaactttatcac atttcttccagaatagctgtttctagccagaactccaagatgggaggcaagctcagcaag aagaagaagggctacaatgtgaacgacgagaaagccaaggagaaagacaagaaggccgag ggcgcggcgacggaagaggaggggaccccgaaggagagtgagccccaggcggccgcagag cccgccgaggccaaggagggcaaggagaagcccgaccaggacgccgagggcaaggccgag gagaaggagggcgagaaggacgcggcggctgccaaggaggaggccccgaaggcggagccc gagaagacggagggcgcggcagaggccaaggctgagcccccgaaggcgcccgagcaggag caggcggcccccggccccgctgcgggcggcgaggcccccaaagctgctgaggccgccgcg gccccggccgagagcgcggcccctgccgccggggaggagcccagcaaggaggaaggggaa cccaaaaagactgaggcgcccgcagctcctgccgcccaggagaccaaaagtgacggggcc ccagcttcagactcaaaacccggcagctcggaggctgccccctcttccaaggagaccccc gcagccacggaagcgcctagttccacacccaaggcccagggccccgcagcctctgcagaa gagcccaagccggtggaggccccggcagctaattccgaccaaaccgtaaccgtgaaagag tga >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_2|167_aa MEILQGGPASWVCHLDSHKGPHTQSPVLDRMLCSLEILKSLSKAQPFLQSVLPVGPNLRL DSREKEDSPRLGNDIPGVPVPPLPPYCQEQTEDGLLSGLRPGRAILGCMAHASSWGPESI PIMQRRHQLRKRIFKVEGQAWHPCTSSSGFLIVSSLQYAQRHPVNER >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_2|504_bp atggagatcttgcagggaggaccagcttcatgggtgtgccacctggacagtcacaaagga ccccacactcagagtcctgtgctggatcgaatgctttgctctcttgaaattcttaagagt ttaagcaaggcccagccctttttgcagtcagtcctgcctgtaggacctaacctacgtctt gattccagagagaaagaagattctccaagactggggaatgacattccgggagtgccagta ccacctctgcctccatattgccaggaacagactgaagatggccttctctcagggttgaga ccaggaagggccatcctgggatgcatggcacatgccagctcttggggaccagagtccatt cccatcatgcagaggcggcatcagctacggaaaagaatcttcaaagtggagggacaagcc tggcatccctgtacaagttcctctggattcctcatagtttcctccttgcagtacgcacaa aggcacccagtaaatgaacgttag >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_3|273_aa MKIRFPQSRGGWRALGLMAKDNDSVLFLEMVSGLREELEESLVWCLRTQEDQGSGAIVSR HDRCLKTSRARKEHTPGEPATQMQKLVTHVSTPSADKVSFIASSDSPSHSLTRGGGVETS GACRSQAILLVRLTSQLFRAVSWAGCSLPWPVSGVLSKHACRPHYSTQPTGPGRAASLPS KGAQLELEEMLVPRKMSISPLESWLTACCLLPRLDAQTPGTAAPAQFYECLPSQMGEGAK QEDEKAWDTTQMQSKNVLKTRRQKMNHHKHRKL >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_3|822_bp atgaagattcgatttccgcagtcccgtgggggatggcgagccctgggcctaatggctaaa gataacgactccgtgttgttccttgagatggtttcgggtttacgggaagagttagaagag agtcttgtgtggtgtttgaggactcaggaagatcagggttctggggcgatagtttctaga cacgatagatgcctgaaaaccagcagggcccgaaaggagcatacccctggggaaccagca actcaaatgcagaaactggtaacccacgtgagcaccccatcagcagacaaggtctccttc atcgctagctctgattcgccctcacactccctcactaggggtgggggtgtggagacttca ggtgcctgccggagccaggccatactcctggtgcgcctgacttctcagctgttcagggct gtttcctgggcaggctgcagtctgccctggcctgtttctggagtgctgagcaagcatgcc tgcaggccccattacagcacacagccaactggcccaggtagggctgcctcactccccagc aagggggcccagctggagctcgaggagatgctggtccccaggaagatgtccatcagcccc ctggagagctggcttacagcctgctgcctcctgcccagactggatgcccagaccccaggg actgcggctccagcccaattctatgagtgtctccctagccagatgggggaaggggccaag caggaggatgagaaggcctgggatacaactcagatgcagagcaaaaacgtgctgaagacc cgccggcagaagatgaaccaccacaagcaccggaagctgtga >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_4|77_aa MPEPSPASVGSCAARASPTSATPCSTAPSPIDHPRAEECERMARDWQAAPPAAPVQDPLG EASWAPEFGGALENLYV >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_4|234_bp atgcctgagccttcccccgcctccgtgggctcctgtgcagcccgagcctccccgacgagc gccaccccctgctccacggcgcccagtcccatcgaccacccaagggctgaggagtgtgag cgcatggcgcgggactggcaggcagctccacctgcagccccggtgcaggatccactgggt gaagccagctgggctcctgagtttggcggggctttggagaacctttatgtctag >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_5|115_aa MMILLKRWDFGELIIGYKGVAVMNAISALIQETPESCLTSSIGSMSMSHGLKMDSAKNEQ SIKAGFTLHDTSKTYLGGSQIVVLNQQQQHHLGLSNAKSQPLTQDRLTQKLRGWG >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_5|348_bp atgatgatacttcttaagaggtgggactttggggagctgattataggttataagggtgta gcagtcatgaatgcgattagtgccctgatacaagaaaccccagagagctgcctcacctct tccatcgggtcaatgagtatgtctcatgggcttaaaatggacagtgcaaagaatgagcag tctataaaagcaggttttacactgcatgataccagcaaaacctacctcggtggttctcaa atagtggtcctcaaccagcagcagcagcatcacctgggcctgagcaatgcaaagtctcaa cctctcacccaagaccgactgactcagaaactcaggggttggggctga >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_6|253_aa MDVLPRESSGFPASTVLGQNLALVPHTGRLPIASPPSPPHRALGLPQGPRRRSSTAQSPP RTQPPLLSRRHDDRVHLTGAPGLPPGLKAAINRQINLELYASCVYLSMSYYFDRDDVALK NFAKYFLHQSHEEREHAEKLMKLQNQGGGRIFLQDIKKPDCDDWESGLNAMECALHLEKN VNQSLLELHKLATDKNDPHLCDFIETHYLNEQVKAIKELGDHMTNLCKMGAPESSLAEYL FDQHTLGDSDNES >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_6|762_bp atggacgttcttccacgagagtcgtcggggtttcctgcttcaacagtgcttggacagaac ctggcgctcgtcccccacaccggccggctgcccatagccagccctccgtcacctcctcac cgcgccctcggactgccccaaggcccccgccgccgctccagcactgcgcagtcaccaccg cgaacgcagccgcctctccttagtcgccgccatgacgaccgcgtccacctcacaggtgcg ccaggactaccaccaggactaaaggccgccatcaaccgccagatcaacctggagctctac gcctcctgcgtttacctgtccatgtcttactactttgaccgcgatgatgtggctttgaag aactttgccaaatactttcttcaccaatctcatgaggagagggaacatgctgagaaactg atgaagctgcagaaccaaggaggtggccgaatcttccttcaggatatcaagaaaccggac tgtgatgactgggagagcgggctgaatgcgatggagtgtgcattacatttggaaaaaaat gtgaatcagtcactactggaactgcacaaactggccactgacaaaaatgacccccatttg tgtgacttcattgagacacattacctgaatgagcaggtcaaagccatcaaagaattgggt gaccacatgaccaacttgtgcaagatgggagcacccgaatctagcttggcggaatatctc tttgaccagcacaccctgggagacagtgataatgaaagctaa >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_7|342_aa MGIAEGQFRSCLPRTKEYTKHAQDGSSWQRRNSSSLITELERHTWQKVRIHRNQGCKMKR ICTKTCDLMNFHRFSPVDVFGVCYLYCGYTGHLAFPHINLVLTPHSGDHQLRGKKRERRT MGTSLPQQAVHLWIPLPDANFAHVMSIKIKKMEPITEISTWLGILALFSTCRSELSLRSQ LRDAFLDHPIMDREPPAQQSSSKYKQLLEIQTGGPAPNKVAGKLSCSVKSSQRCSHENNS FCKAIGLLPPWALIALPESSSNLAWEKEEKSADQKNNWPEKSDVRKQHAGGAAPAEALEN KGRFKITGLDNEERDYDLQKSSVSRKERLRFARCKLPRNHPL >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_7|1029_bp atggggattgctgaaggccaattcagaagctgcctgcctagaaccaaggagtatacgaag catgcacaagatggttcatcctggcagaggaggaacagttcttccctcataaccgagctg gagaggcacacgtggcagaaggtgcgaattcacagaaatcaaggatgtaagatgaagaga atctgcacaaagacttgtgaccttatgaattttcacaggttttccccagtggatgtcttt ggcgtctgctatctctactgcggatacaccggccacctggctttcccacacatcaacctc gtgctcacaccccactctggagaccaccagcttcgaggcaaaaagagagaacgacgtaca atggggacttcgcttccccagcaggcagttcacctctggatacccctccctgatgcaaac tttgctcatgtcatgtctatcaagatcaagaaaatggagcccataactgagatcagcaca tggcttggcatcctcgccctcttctccacatgccggtctgagctgtcactcagaagtcag ctcagagatgcctttctggaccatccaatcatggacagagagccaccagcacagcagagc agcagcaaatataaacaacttctagaaatccaaacaggagggccagcaccaaacaaggtt gctggcaagctttcttgctcagtaaagtcctctcaacgctgttcccatgaaaacaattca ttctgcaaagcaatcggcctcctgcctccttgggcactaattgctctgcctgaaagttcg agcaacctagcctgggagaaagaggagaaatccgcagaccagaagaacaactggccagag aaaagtgatgtcagaaagcagcatgctggaggagcagctccagcagaagccctggagaac aaggggagattcaaaatcactggacttgacaatgaggagagagattatgaccttcaaaaa agcagcgtcagcagaaaggagcggttacgctttgctcgatgcaagttacccagaaatcat cccctctaa >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_8|114_aa MRRLAQLSLQENLLQGGRLGNTVHLLCLQMHWGIHAEAMFSRGCSQPVTEQGRSASIMPF LQKVSELLSVQGSSPTYSSSSPLYPSQVPHPKLSNLLPSSNLLLGDYEQKRICI >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_8|345_bp atgaggcgccttgctcagctctcccttcaagagaacttgcttcaaggtgggaggctgggg aacaccgtccacctgctctgtcttcaaatgcactggggcattcatgcagaggctatgttt tcccggggctgctcccagcctgtgactgagcaaggcagaagtgctagcatcatgccattc ctgcagaaggtgtcagaactgctctcagtccaaggatcgtcccccacatattcctcctcc tctcccctttatccttcccaggtcccccaccccaaactctcaaatctacttccatcttca aatctgcttcttggagattatgaacaaaagaggatttgcatttga >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_9|215_aa MRDKHKKATQEYGVNRAGSLFKFSECNYLIAICSSYKIPFPPRICCCQYLKTGLIFSAFN PFVNHQQPVGKKRKKEKKKKKTESFRFGLHVILSPIALRGDRWVTSCRDTGPGSRGPRTQ SSERCWVLKPPLPLNTKQRVLLGLQAASASEQPRWERGSLRPASLTLTSEALNVLSQARN VKGSGSLQPVRNPFTIVCFHAYALFSSHSVKRKPQ >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_9|648_bp atgcgggacaagcataagaaagcgactcaggagtacggagtaaacagagcaggctcattg ttcaagttcagtgagtgtaattatctgatagccatttgtagttcctataaaatacccttt cccccccgtatttgttgctgccagtatttaaagacaggtctaatcttttctgcttttaat ccctttgtcaaccatcagcagccagtgggaaaaaaaaggaaaaaagaaaaaaagaaaaag aaaacagaatcatttaggtttggccttcatgttatcctgtccccaattgcactgagaggg gacagatgggtgaccagctgtcgtgacacaggaccagggtcacggggtccccgcacacaa agcagcgaacgctgctgggtcctcaagccgcctctgcctctgaacacaaagcagcgagtg ctgctgggtcttcaagctgcctctgcctctgaacagcctcgttgggagcgaggctctctc aggccggcatcactcacgctcaccagtgaggctctgaatgttttgtctcaagccaggaat gttaaaggctctggttctctgcagccagttaggaaccctttcaccattgtttgcttccat gcatatgccctgttcagcagccactctgtgaagaggaaacctcagtaa >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_10|100_aa MNEEEMNPGRTCQTGKGRKLSNASSWRTAAVTENRVGDSGSEHPCSLQRLPLSQNRAVAA TTKPTGFLGKKQNSRDQLAFWASSGDELLREAGTDGPRPG >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_10|303_bp atgaatgaggaagaaatgaacccaggtaggacttgccagactgggaagggaaggaagttg tccaatgcaagttcttggagaaccgcagctgtgacagagaacagagttggggactctgga tctgagcacccttgctctctgcagaggctccctttatcccaaaaccgtgctgttgcagcc acgactaaaccaacaggatttcttggaaaaaaacagaattcccgtgatcaattggctttc tgggcatcgagtggagatgagctgctgagagaggctggcactgatggacctaggcctggt taa >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_11|207_aa MAVRSPVAYLEAISSFIHDFKDKWTFLKRVGLTFQRTPDLVPARSSWGPGADLQFQKSSL VSGLILLTGTGLLSWDGVRLGCQNGADILILNDSHSLGHRMSSGLWAPNSHQPALGFELH VRFPASSVVPLAATMVLSLLCSTRGGRQGLCCGPPAAETQGHPPSLCRLHHSARRWLHPP PPRAFRIVVLQHQQALPIHVHSFLQLM >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_11|624_bp atggctgtcagaagtccagttgcatatctggaagccatctcttcctttattcatgacttt aaggacaagtggacatttttgaaaagggtaggcttgacattccaaagaactccagatctc gtgcctgcaaggtcatcctggggccctggggctgacctccagttccagaaaagcagccta gtgagtggattgattctgctgactggcacaggattgctgtcatgggatggtgtgcgattg ggctgccagaatggtgctgacatcctgattttgaatgattctcactcacttggtcacagg atgagttcaggactctgggctcccaacagccatcagccagctcttgggttcgagctgcat gtgaggttccctgcctcctcagttgtccctcttgctgctactatggtgctgtctttgctc tgcagcacccgcggtggcaggcaaggtttatgctgtggaccccctgctgctgaaacccag ggccatcccccatccctctgcaggctgcaccatagtgctcgaaggtggcttcacccacct cctccaagggctttccggattgttgttctgcaacaccagcaggctcttccaatccatgtt cattctttcctccagctgatgtga >gi568815593r:17253876_17454423|GENSCAN_predicted_peptide_12|218_aa DIQLQGNPKVHSGLWVIIMCQYRLMDCNKGTTLARGVDNGGGQRETLALCSKTVHNTNVP EKIVQNEGQLVAHSQEIQKYKKSVENWSPTITIQSLEKVLQALSIESLHPILETVWKKKQ RREERGKVQKYFKLCQKKSISTNIHREMNDRSHRSKVFPHDSFNWPGPNQGTYRKDPLQE KPGHKACSLLSCNHAMDLEITQRENQSLTMASRVSTIC >gi568815593r:17253876_17454423|GENSCAN_predicted_CDS_12|657_bp gatatacaactccaagggaaccctaaagtacactctgggctttgggtgataatcatgtgt cagtataggctcatggattgtaacaagggtaccactctggccagaggtgttgataatgga ggaggccagagagagacactggctctatgttccaagactgttcacaacacaaatgtccct gaaaagatagttcagaatgaaggacagttagtcgctcactcacaagagatacagaaatac aaaaagtctgtggagaattggtctccaacaataacaattcaatcattagagaaagtactt caggcattgtcaattgaatctcttcatccaatcttggaaacagtctggaaaaagaagcag aggagagaagaaaggggaaaagttcaaaagtactttaaactctgccaaaaaaaatcaatt agtaccaatatacacagagaaatgaatgacagaagtcatcgcagcaaggtatttcctcat gactccttcaactggccagggccaaaccaaggcacctaccggaaagatcctcttcaagag aaaccaggtcataaggcttgttcacttctcagctgcaaccatgcaatggatcttgaaatc acacagagggagaaccagagcctcacaatggcctccagggtgtccaccatctgctga