GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:45:49 Sequence gi568815595f:45603438_45843666 : 240229 bp : 43.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1335 1400 66 2 0 98 80 16 0.140 1.64 1.02 Intr + 14067 14216 150 2 0 80 107 55 0.350 6.96 1.03 Term + 22003 22068 66 0 0 68 54 64 0.107 -1.06 1.04 PlyA + 22387 22392 6 1.05 2.06 PlyA - 25294 25289 6 1.05 2.05 Term - 28157 28066 92 0 2 123 47 56 0.063 2.88 2.04 Intr - 39570 39512 59 2 2 101 35 63 0.566 0.83 2.03 Intr - 41102 41051 52 0 1 80 115 43 0.710 4.17 2.02 Intr - 44165 43913 253 2 1 20 95 93 0.308 0.21 2.01 Init - 44540 44466 75 2 0 86 87 28 0.406 3.60 2.00 Prom - 47384 47345 40 -2.06 3.00 Prom + 47679 47718 40 -6.46 3.01 Init + 50709 50721 13 2 1 119 75 -8 0.230 1.40 3.02 Intr + 62213 62280 68 0 2 96 105 91 0.987 10.22 3.03 Intr + 64857 64919 63 2 0 115 101 27 0.947 5.61 3.04 Intr + 69253 69383 131 0 2 91 71 186 0.992 16.69 3.05 Intr + 70017 70068 52 0 1 110 94 -8 0.991 0.91 3.06 Intr + 70906 70974 69 0 0 109 59 96 0.600 8.18 3.07 Term + 73485 73622 138 2 0 103 42 135 0.204 8.36 3.08 PlyA + 77974 77979 6 1.05 4.00 Prom + 81816 81855 40 -3.96 4.01 Init + 86029 86060 32 0 2 84 68 86 0.035 5.42 4.02 Intr + 100001 100098 98 2 2 61 68 94 0.543 4.35 4.03 Intr + 101698 101772 75 2 0 81 110 28 0.667 3.79 4.04 Intr + 103343 103470 128 0 2 105 110 -14 0.782 2.90 4.05 Intr + 106061 106210 150 1 0 92 98 17 0.662 3.46 4.06 Intr + 120038 120106 69 1 0 111 84 -10 0.175 0.28 4.07 Intr + 127864 127943 80 0 2 54 67 44 0.217 -2.75 4.08 Intr + 128616 128714 99 0 0 75 121 43 0.496 5.53 4.09 Intr + 135141 135234 94 0 1 70 108 21 0.612 2.27 4.10 Intr + 135344 135436 93 1 0 121 50 5 0.380 0.16 4.11 Intr + 136150 136207 58 0 1 107 119 -27 0.612 0.76 4.12 Term + 140096 140232 137 0 2 60 38 85 0.513 -1.12 4.13 PlyA + 140925 140930 6 1.05 5.00 Prom + 142198 142237 40 -4.96 5.01 Init + 143560 143640 81 0 0 103 75 87 0.524 8.05 5.02 Intr + 144271 144442 172 0 1 -4 77 151 0.658 4.32 5.03 Intr + 144594 144743 150 1 0 83 70 70 0.875 4.83 5.04 Intr + 148341 148390 50 1 2 110 96 -18 0.245 -0.40 5.05 Intr + 150385 150521 137 0 2 98 103 -14 0.135 0.47 5.06 Term + 150950 150992 43 2 1 130 49 41 0.172 1.03 5.07 PlyA + 153161 153166 6 1.05 6.14 PlyA - 153328 153323 6 1.05 6.13 Term - 155690 155541 150 2 0 114 50 197 0.996 16.41 6.12 Intr - 156400 156320 81 1 0 77 69 47 0.619 1.53 6.11 Intr - 156585 156420 166 2 1 86 70 191 0.948 17.06 6.10 Intr - 159635 159476 160 0 1 61 78 385 0.958 33.85 6.09 Intr - 162304 162100 205 1 1 89 76 311 0.999 28.77 6.08 Intr - 166934 166772 163 1 1 71 107 225 0.955 22.68 6.07 Intr - 168021 167780 242 0 2 88 74 502 0.675 45.15 6.06 Intr - 169178 169068 111 2 0 104 102 142 0.998 17.78 6.05 Intr - 172551 172324 228 0 0 101 81 243 0.933 22.97 6.04 Intr - 176663 176572 92 0 2 104 99 140 0.421 16.41 6.03 Intr - 178786 178646 141 2 0 68 96 243 0.990 23.42 6.02 Intr - 190155 190105 51 1 0 102 81 13 0.215 0.88 6.01 Init - 192982 192862 121 1 1 106 105 247 0.998 28.55 6.00 Prom - 193351 193312 40 -7.06 7.09 PlyA - 195247 195242 6 1.05 7.08 Term - 203939 203759 181 1 1 48 49 101 0.038 -0.72 7.07 Intr - 224022 223919 104 0 2 26 72 121 0.504 3.27 7.06 Intr - 225178 225002 177 1 0 63 77 104 0.974 6.92 7.05 Intr - 229684 229613 72 1 0 93 85 32 0.887 3.00 7.04 Intr - 230861 230801 61 1 1 88 88 54 0.999 4.14 7.03 Intr - 232347 232153 195 2 0 79 47 260 0.983 19.53 7.02 Intr - 234614 234490 125 2 2 125 69 42 0.839 5.38 7.01 Init - 238554 238480 75 0 0 114 51 88 0.538 6.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:45603438_45843666|GENSCAN_predicted_peptide_1|93_aa MDSHSVCLSRHAFSLESSLLGMAHDCMSSTVVPGVYLLHFSSSQQPCKVPANFLSCFEDA KTKAESHKVTHLNTFLLTLVFECVQCLFGIYCV >gi568815595f:45603438_45843666|GENSCAN_predicted_CDS_1|282_bp atggattcccacagcgtctgcctgtcccgtcatgctttctctctggagtcctccctcctt ggtatggctcatgactgcatgtcaagcacagttgtgccaggtgtctacctgctccatttc tcctcctcccagcagccctgcaaagtgcccgccaattttctctcctgttttgaagatgcg aaaactaaagcagagagccacaaggtcactcacctgaacaccttcctgctgactctggtt tttgaatgtgttcagtgcttatttggcatctactgtgtgtga >gi568815595f:45603438_45843666|GENSCAN_predicted_peptide_2|176_aa MEMAMEDTEGRVIFTWVVMEAKGDKGIWDPAGIQMPPMVTIAGLLRMWRGSCHPAPEHRE GPASSGQALDLPDPQADPCPAMHLVLENLATGSTLSQPQYSWEESGSSHGFSAPPWVEEG YAVSGPRVCQQLPESVQCQWGGMAAWAESFGLLTSFLSTVTSDHLAQIQNKEATCV >gi568815595f:45603438_45843666|GENSCAN_predicted_CDS_2|531_bp atggagatggcaatggaagacactgaagggagagtgatattcacctgggtggtgatggag gctaagggtgacaagggcatatgggaccctgctggcatccaaatgccaccaatggtgacc attgcagggctgctgcggatgtggagaggcagctgtcaccctgcccctgaacacagggaa gggcctgcatcctctggccaggcattggatttgccggacccccaggctgacccgtgccct gctatgcatctggtccttgaaaacctggccacaggatccacactcagccagcctcagtac tcctgggaagagtctgggagctcccatggatttagtgcgccgccatgggtggaggagggt tatgcggtgtcaggcccacgggtgtgccaacagcttcctgagagtgtccagtgccagtgg gggggcatggccgcctgggctgagtccttcgggctgctgacatcctttctctccaccgtg accagtgaccacctagcacagatacaaaacaaggaagccacctgtgtttga >gi568815595f:45603438_45843666|GENSCAN_predicted_peptide_3|177_aa MDSTGRKLRGKAFYFVNGKVFCEEDFLYSGFQQSADRCFLCGHLIMDMILQALGKSYHPG CFRCVICNECLDGVPFTVDSENKIYCVRDYHKVLAPKCAACGLPILPPEGSDETIRVVSM DRDYHVECYHCEDCGLELNDEDGHRCYPLEDHLFCHSCHVKRLEKRPSSTALHQHHF >gi568815595f:45603438_45843666|GENSCAN_predicted_CDS_3|534_bp atggacagtacaggccggaagctgagaggaaaagccttttattttgtcaacggcaaagtg ttttgtgaagaagacttcctgtactctggtttccagcagtcggctgacaggtgttttctt tgtggacatctgatcatggacatgatcctgcaagccctggggaagtcctaccaccccggc tgtttccgctgtgtcatctgtaatgagtgtttggatggggtgcccttcaccgtggactca gagaacaagatctactgtgtccgagattaccacaaggtgctggcccccaagtgtgcagcc tgtgggcttcccatccttccacctgagggctcagatgagaccatccgtgtcgtgtccatg gacagagactaccacgtggagtgttaccactgcgaggactgtggtctggagctcaatgat gaagatggccaccgctgttatccgctggaggaccacctgttctgtcactcctgccacgtg aagaggctggagaagagaccctcatctacagcccttcaccagcaccacttctag >gi568815595f:45603438_45843666|GENSCAN_predicted_peptide_4|370_aa MATAAYEQLKLHITPEKFYVEACDDGADDVLTIDRVSTEVTLAVKKDVPPSAVTRPIFGI LGTIHLVAGNYLIVITKKIKVGEFFSHVVWKATDFDVLSYKKTMLHLTDIQLQDNKTFLA MLNHVLNVDGFYFSTTYDLTHTLQRLSNTSPEFQEMSLLERMDGFQRHFDSQVIIYGKQV IINLINQKGSEKPLEQTFATMVSSLGSGMMRYIAFDFHKECKNMRWDRLSILLDQVAEMQ DELRTGKRTHLGLIMDGWNSMIRYYKNNFSDGFRQDSIDLFLGNYSVDELESHSPLSVPR DWKFLALPIIMVVAFSMCIICLLMAGDTWTETLAYVLFWGVASIGTFFIILYNGKDFVDA PRLVQKEKID >gi568815595f:45603438_45843666|GENSCAN_predicted_CDS_4|1113_bp atggcgacggcggcctacgagcagctgaagctgcatatcacacctgaaaaattttatgtg gaagcttgtgatgatggagcagatgacgtacttaccattgaccgtgtgtccacagaggtt acccttgcagtcaagaaagatgttcctccttcagctgtcacaagaccaatatttggtata ctgggcacaatccatctggtggcaggtaattatcttatagtcattaccaaaaagataaaa gtaggtgaatttttcagtcatgtagtctggaaagcaacagattttgatgtcctttcttat aagaagacaatgttgcacttaactgatattcagttacaagataataaaaccttcctagcg atgctaaaccatgtcttgaatgtggatggattttacttttcaacaacatatgatttgacc catactttgcagcggctatccaacactagtcctgaattccaagaaatgagtctcttggaa aggatggacggtttccaaaggcattttgattcccaagtaattatttatggaaaacaagtt ataatcaatctgattaaccagaagggctcggagaagccacttgagcagacatttgcaaca atggtgtcttccttgggaagtggaatgatgagatacattgcctttgacttccataaggaa tgtaaaaatatgagatgggatcgactaagtattttattggatcaggtagcagaaatgcaa gatgaattaagaactggaaagagaactcatttgggacttataatggatggctggaactca atgatacgatattataagaacaacttttccgatggatttagacaagattccatagactta tttcttggaaactattcagtggatgaattagaatctcatagtcctttaagtgttccaagg gactggaaattcctggctttgcctattatcatggttgttgccttttcaatgtgcattatc tgtttgcttatggctggtgacacttggacagaaacactggcctatgtgctcttctgggga gttgcaagcattggaacattttttatcattctttacaatggcaaagattttgtcgatgct cccagactggtccagaaagaaaagatagactga >gi568815595f:45603438_45843666|GENSCAN_predicted_peptide_5|210_aa MGPARNLNPYPRLRGGMARAACTSMEQELNTYRNTLAAERRYSLQVSSELFYHSIKLLFI LLILHLSTYFILPGHRTRTQDPPNVPSISKLLGTTVFRTGSRGSCLHACLVQLQPCREPA PMPALGAAYPTAAAGPLYLWVLGLSILIYLPVTGYFHLIHRHLKLHMVKPLLIFTLQASS VPHFPHLRKWHQHPPSRAFAVAVTSPGQAF >gi568815595f:45603438_45843666|GENSCAN_predicted_CDS_5|633_bp atgggcccagcaaggaacctgaacccctaccccagactgcgaggaggcatggccagagcc gcctgcacgtccatggagcaggagctgaacacttatcggaacaccctggctgcggaaagg aggtactccctgcaggtctcctctgagctgttctatcactcgataaagctcctcttcatc ttgctcatcctccacttgtccacatacttcattcttcctggccacaggacaagaactcag gacccgccaaatgttcctagcatctctaagcttctgggcactaccgtcttccgcactggc agccgtggaagctgcttgcatgcatgcctggtccagctgcagccttgcagagagcccgca cccatgccagcacttggagctgcctaccccactgcagcagctggtccgctctacttgtgg gtgttggggctttctattctcatctatttgccggttactggatatttccacctaattcac agacatctcaaacttcacatggtcaaaccactcttgatttttactctgcaggccagctct gttccccactttcctcatctccggaaatggcaccagcatccccccagcagggcctttgca gttgctgtcacctctcctggacaggccttctag >gi568815595f:45603438_45843666|GENSCAN_predicted_peptide_6|636_aa MEKARPLWANSLQFVFACISYAVGLGNVWRFPYLCQMYGGGQARPGMRNVNSEDYIPGSF LVPYIIMLIVEGMPLLYLELAVGQRMRQGSIGAWRTISPYLSGVGVASVVVSFFLSMYYN VINAWAFWYLFHSFQDPLPWSVCPLNGNHTGYDEECEKASSTQYFWYRKTLNISPSLQEN GGVQWEPALCLLLAWLVVYLCILRGTESTGKVVYFTASLPYCVLIIYLIRGLTLHGATNG LMYMFTPKIEQLANPKAWINAATQIFFSLGLGFGSLIAFASYNEPSNNCQKHAIIVSLIN SFTSIFASIVTFSIYGFKATFNYENCLKKVSLLLTNTFDLEDGFLTASNLEQVKGYLASA YPSKYSEMFPQIKNCSLESELDTAVQGTGLAFIVYTEAIKNMEVSQLWSVLYFFMLLMLG IGSMLGNTAAILTPLTDSKIISSHLPKEAISGLVCLVNCAIGMVFTMEAGNYWFDIFNDY AATLSLLLIVLVETIAVCYVYGLRRFESDLKAMTGRAVSWYWKVMWAGVSPLLIVSLFVF YLSDYILTGTLKYQAWDASQGWRWAPMATEKHGPFIFRNQGTISMVWGQLVTKDYPAYAL AVIGLLVASSTMCIPLAALGTFVQRRLKRGDADPVA >gi568815595f:45603438_45843666|GENSCAN_predicted_CDS_6|1911_bp atggagaaagcgcggccgctgtgggccaactcgctacagttcgtgttcgcctgcatctcg tacgccgtgggcctgggcaacgtgtggcgattcccgtacctgtgccagatgtacggcgga ggccaggcccggcccggcatgaggaatgttaattctgaagattacattccaggtagtttc ctggtcccctacatcatcatgcttatcgtggagggaatgccgctcttgtacctggaactg gctgtggggcagcgcatgcggcagggcagcatcggcgcctggaggaccatcagcccgtac ctcagtggtgtcggggtcgccagcgtggtggtctctttcttcctctccatgtactacaac gtcatcaacgcctgggccttctggtacctcttccactccttccaggatcccctgccgtgg tctgtctgcccactgaatggtaaccacacgggctacgatgaggagtgtgagaaggcgtcc tccacacagtacttctggtacaggaaaaccctcaatatctcgccgtccctccaggagaac gggggtgtgcagtgggagccggcgctgtgcctcctcctggcctggctggtggtgtacctg tgcatcctgcgtggcaccgagtccactggcaaggtggtgtatttcacggcgtcactgccc tattgcgtgctcatcatctacctcatcaggggcctcacgctccacggagccaccaatggc ctcatgtacatgttcactcccaagatagagcagctggccaaccccaaggcctggatcaat gcagccacccagatcttcttctcacttggcctgggcttcggcagcctgatcgccttcgcc agctacaatgagccatccaacaactgccagaagcacgccatcatcgtgtccctcatcaac agcttcacctccatatttgccagcattgtcaccttctccatctatggcttcaaggccacc ttcaattatgaaaactgcttgaagaaggtgagtctgctgctgaccaacacttttgacctt gaagatggctttttgacagccagcaacctggagcaggtgaagggctacctcgcatctgcc tacccaagcaaatacagcgagatgttcccgcaaatcaaaaactgcagcttggaatcggag ctagacacggccgtccagggcactggcctggcattcatcgtctacacagaggccattaaa aacatggaggtgtcccagctgtggtcggtgctctacttcttcatgctgctgatgctgggc attgggagcatgctggggaacacagcggccatcctcacccctctgacagacagcaagatc atctccagccacctgcccaaggaggccatctcaggtctggtgtgccttgtcaactgtgcc attggcatggtgttcacgatggaggctgggaactactggtttgacatattcaacgactac gcggccacactgtccctgctgctcatcgtgctggtggagacgattgccgtgtgctacgtg tacgggctgaggagatttgaaagtgaccttaaggccatgaccggccgagctgtgagctgg tactggaaggtgatgtgggctggcgtaagcccactgctgattgtcagcctctttgtcttc tacctgagcgactacatcctcacggggaccctgaagtatcaagcctgggacgcctcccag ggctggcgctgggcccccatggccactgaaaagcatgggccattcattttccgaaaccag gggactatttccatggtgtggggccagctcgtgaccaaagattacccggcctatgcactg gctgtcatcgggctgcttgtggcctcctccaccatgtgcatccccctggcggccctgggg acttttgttcagcgtcgcctcaagaggggagacgcagaccccgtggcctga >gi568815595f:45603438_45843666|GENSCAN_predicted_peptide_7|329_aa MVSNRPSGAAPRTGESAAGAGDTERAELGLNEHHQNEVINYMRFARSKRGLRLKTVDSCF QDLKESRLVEDTFTIDEVSEVLNGLQAVVHSEVESELINTAYTNVLLLRQLFAQAEKWYL KLQTDISELENRELLEQVAEFEKAEITSSNKKPILDVTKPKLAPLNEGGTAELLNKDFIK AQDLSNLENTVAALKSEFQKTLNDKTENQKSLEENLATAKHDLLRVQEQLHMAEKELEKK FQQTAAYRNMKEILTKKNDQIKDLRKRLAQSVARNRATATQSCSLQLPTQCQFCAAMLPS ALASSSSDPGGSLRLVIFAKYSKLLGIAQ >gi568815595f:45603438_45843666|GENSCAN_predicted_CDS_7|990_bp atggtgagtaaccgaccctcaggcgccgcaccgcgcacaggagagagcgcggccggggct ggagacactgagcgggcagagttgggcctaaatgagcaccatcaaaatgaagttattaat tatatgcgttttgctcgttcaaagagaggcttgagactcaaaactgtagattcctgcttc caagacctcaaggagagcaggctggtggaggacaccttcaccatagatgaagtctctgaa gtcctcaatggattacaagctgtggttcatagtgaggtggaatctgagctcatcaacact gcctataccaatgtgttacttctgcgacagctgtttgcacaagctgagaagtggtatctt aagctacagacagacatctctgaacttgaaaaccgagaattattagaacaagttgcagaa tttgaaaaagcagagattacatcttcaaacaaaaagcccatcttagatgtcacaaagcca aaacttgctccacttaatgaaggtggaacagcagaactcctaaacaaggattttataaag gcccaagacttaagtaacttagaaaacactgtcgctgccttaaagagtgagtttcagaag acacttaatgacaagacagaaaaccagaagtcactggaggagaatctggcgacagccaag cacgatctactcagggttcaggagcagctgcacatggctgaaaaggaattagaaaagaaa tttcagcaaacagcagcttatcgaaacatgaaagagattcttaccaagaagaatgaccaa atcaaagatctgaggaaaagactggcacaaagtgtagccagaaacagagccacagcaaca cagagctgtagcctgcagctgccaacccagtgccagttctgtgcagccatgttgccttca gctctggccagcagttcctccgatcctggaggctctctgcgtcttgttatatttgccaaa tattctaaattgcttggaatagctcagtga