GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:40:54 Sequence gi568815581r:19638323_19845129 : 206807 bp : 45.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 7823 7862 40 -1.06 1.01 Init + 10223 10384 162 1 0 85 18 200 0.797 10.44 1.02 Intr + 10641 10802 162 2 0 56 107 352 0.822 34.07 1.03 Intr + 13225 13456 232 0 1 47 41 183 0.224 6.85 1.04 Intr + 14225 14310 86 0 2 64 110 30 0.164 2.34 1.05 Intr + 18044 18252 209 1 2 -7 109 145 0.838 4.98 1.06 Intr + 19423 19540 118 1 1 73 76 28 0.962 0.57 1.07 Intr + 22805 22946 142 1 1 86 63 122 0.998 9.43 1.08 Intr + 25011 25177 167 1 2 92 121 85 0.993 11.88 1.09 Intr + 26626 26725 100 0 1 91 92 138 0.986 14.18 1.10 Term + 27564 27718 155 1 2 24 44 70 0.389 -5.72 1.11 PlyA + 28458 28463 6 1.05 2.48 PlyA - 29937 29932 6 1.05 2.47 Term - 33251 33162 90 2 0 67 43 63 0.126 -2.58 2.46 Intr - 40584 40391 194 1 2 122 49 130 0.395 11.71 2.45 Intr - 41717 41630 88 2 1 108 90 36 0.999 5.34 2.44 Intr - 43139 43045 95 0 2 90 107 108 0.999 12.58 2.43 Intr - 43348 43216 133 1 1 110 71 129 0.976 13.62 2.42 Intr - 64352 64283 70 1 1 113 75 60 0.356 6.38 2.41 Intr - 64845 64770 76 1 1 98 30 39 0.646 -2.33 2.40 Intr - 65856 65748 109 0 1 80 82 95 0.976 7.96 2.39 Intr - 66372 66331 42 0 0 113 75 24 0.735 2.04 2.38 Intr - 67181 67114 68 0 2 86 91 107 0.999 9.42 2.37 Intr - 68439 68326 114 1 0 84 79 114 0.997 10.52 2.36 Intr - 69521 69424 98 1 2 86 75 177 0.982 15.85 2.35 Intr - 70077 69980 98 0 2 36 99 80 0.947 2.71 2.34 Intr - 71910 71740 171 0 0 82 81 27 0.285 1.54 2.33 Intr - 74423 74381 43 1 1 122 94 1 0.972 2.34 2.32 Intr - 75651 75503 149 0 2 66 85 226 0.357 19.13 2.31 Intr - 77014 76794 221 2 2 86 78 241 0.890 20.92 2.30 Intr - 82315 82268 48 2 0 107 66 25 0.012 0.85 2.29 Intr - 85832 85709 124 2 1 18 87 105 0.010 3.56 2.28 Intr - 89998 89846 153 1 0 122 55 10 0.019 1.37 2.27 Intr - 93608 93447 162 2 0 90 31 74 0.016 2.07 2.26 Intr - 100131 100001 131 1 2 132 100 84 0.902 14.41 2.25 Intr - 100806 100674 133 0 1 22 92 183 0.574 12.32 2.24 Intr - 101352 101186 167 1 2 77 98 405 0.999 40.08 2.23 Intr - 102155 102014 142 2 1 106 82 270 0.999 28.13 2.22 Intr - 102888 102771 118 2 1 27 87 197 0.641 13.97 2.21 Intr - 103890 103682 209 0 2 41 97 328 0.992 26.88 2.20 Intr - 104308 104223 86 2 2 66 113 171 0.997 16.94 2.19 Intr - 105141 104910 232 0 1 83 57 545 0.951 48.25 2.18 Intr - 106812 106646 167 1 2 105 42 345 0.899 31.28 2.17 Intr - 117406 117268 139 1 1 -11 74 173 0.801 6.04 2.16 Intr - 117837 117694 144 0 0 74 80 126 0.954 10.88 2.15 Intr - 119981 119815 167 0 2 63 85 31 0.604 -0.02 2.14 Intr - 123666 123601 66 1 0 63 81 138 0.896 9.48 2.13 Intr - 139394 139259 136 2 1 37 101 134 0.938 9.74 2.12 Intr - 142307 142150 158 0 2 68 46 154 0.961 8.93 2.11 Intr - 142782 142664 119 2 2 88 86 119 0.999 11.81 2.10 Intr - 143745 143567 179 0 2 35 47 178 0.952 7.12 2.09 Intr - 145583 145375 209 0 2 104 60 160 0.929 13.60 2.08 Intr - 147722 147615 108 0 0 72 110 34 0.628 4.26 2.07 Intr - 157403 157300 104 1 2 66 82 83 0.976 5.32 2.06 Intr - 157960 157773 188 1 2 53 98 100 0.933 5.99 2.05 Intr - 161253 161173 81 0 0 97 91 38 0.844 4.83 2.04 Intr - 163600 163455 146 2 2 20 106 68 0.404 1.80 2.03 Intr - 170944 170882 63 2 0 88 103 3 0.200 0.49 2.02 Intr - 172116 172056 61 0 1 121 75 17 0.287 2.01 2.01 Init - 181182 181171 12 0 0 80 114 12 0.403 2.80 2.00 Prom - 193972 193933 40 -1.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:19638323_19845129|GENSCAN_predicted_peptide_1|510_aa MANTFPPSLHPAAPWPVAALGSRTAHSTPYIPARCQSRGEGGGRVGEAVNSGCHDQAMEL EVRRVRQAFLSGRSRPLRFRLQQLEALRRMVQEREKDILTAIAADLCKSEFNVYSQEVIT VLGEIDFMLENLPEWVTAKPVKKNVLTMLDEAYIQPQPLGVVLIIGAWNYPFVLTIQPLI GAIAAGNAVIIKPSELSENTAKILAKLLPQYLDQDLYIVINGGVEETTELLKQRFDHIFY TGNTAVGKIVMEAAAKHLTPVTLELGGKSPCYIDKDCDLDIVCRRITWGKYMNCGQTCIA PDYILCEASLQNQIVWKIKETVKEFYGENIKESPDYERIINLRHFKRILSLLEGQKIAFG GETDEATRYIAPTVLTDVDPKTKVMQEEIFGPILPIVPVKNVDEAINFINEREKPLALYV FSHNHKLIKRMIDETSSGGVTGNDVIMHFTLNSFPFGGVVSCSMGCDGPVWQDHLFLVKE TLSYDQGALTYIATGPCLGGKHIASTQHSD >gi568815581r:19638323_19845129|GENSCAN_predicted_CDS_1|1533_bp atggccaacaccttccctccatccctacaccccgccgccccctggcccgtggccgcgctc ggctcccgcactgctcactccaccccctacatcccagcccgctgccagagccggggagag ggcgggggccgcgtgggcgaggccgtgaacagcggctgtcacgaccaggccatggagctc gaagtccggcgggtccgacaggcgttcctgtccggccggtcgcgacctctgcggtttcgg ctgcagcagctggaggccctgcggaggatggtgcaggagcgcgagaaggatatcctgacg gccatcgccgccgacctgtgcaagagtgaattcaatgtgtacagtcaggaagtcattact gtccttggggaaattgattttatgcttgagaatcttcctgaatgggttactgctaaacca gttaagaagaacgtgctcaccatgctggatgaggcctatattcagccacagcctctggga gtggtgctgataatcggagcttggaattaccccttcgttctcaccattcagccactgata ggagccatcgctgcaggaaatgctgtgattataaagccttctgaactgagtgaaaataca gccaagatcttggcaaagcttctccctcagtatttagaccaggatctctatattgttatt aatggtggtgttgaggaaaccacggagctcctgaagcagcgatttgaccacattttctat acgggaaacactgcggttggcaaaattgtcatggaagctgctgccaagcatctgacccct gtgactcttgaactgggagggaaaagtccatgttatattgataaagattgtgacctggac attgtttgcagacgcataacctggggaaaatacatgaattgtggccaaacctgcattgca cccgactatattctctgtgaagcatccctccaaaatcaaattgtatggaagattaaggaa acagtgaaggaattttatggagaaaatataaaagagtctcctgattatgaaaggatcatc aatcttcgtcattttaagaggatactaagtttgcttgaaggacaaaagatagcttttggt ggggagactgatgaggccacacgctacatagccccaacagtacttaccgatgttgatcct aaaaccaaggtgatgcaagaagaaatttttggaccaattcttccaatagtgcctgtgaaa aatgtagatgaggccataaatttcataaatgaacgtgaaaagcctctggctctttatgta ttttcgcataaccataagctcatcaaacggatgattgatgagacatccagtggaggtgtc acaggcaatgacgtcattatgcacttcacgctcaactctttcccatttggaggagtggtc agttgctccatgggatgcgatggcccagtttggcaggaccatctgttcctggttaaggaa accttgtcctatgaccaaggagccttgacctatatagccacaggtccttgtctaggaggt aaacatattgccagcacccagcacagtgactga >gi568815581r:19638323_19845129|GENSCAN_predicted_peptide_2|1936_aa MVTRVICQWGLLADVLQMNSWCVEGPDSNHILLSVSMNLTTLDTSYLQWYEGPTPAPWAS SGRDHAPQYQQTQHRQLDEGSPLGLLGLTHLPLWLVPFLSNSVSAAVGILRAMTPGVETP QTTAPFKIPKTQASSNLLALVTRHGPAEEQSKDGNEPRECAHCLLVQGSERQRAEQQSKA VFGRSVSTGKLSDQQGKTPICRHQGSTDSLNTERPMDIGTAASSKAVLFTVGSPPHSAAA PTCTHMFLRTRTTSVGPSNSGGSLCAMSGRVCVGSPPGPGFGSSPPGAEAAPSLRYVPYG ASPPSLEGLITFEAPELPEETLMEREHTDTLRHLNVMLMFTECVLDLTAMRGGNPELCTS AVSLYQIQESVVVDQISQLSKDWGRVEQLVLYMKAAQLLAASLHLAKAQIKSGKLSPSTA VKQVVKNLNERYKFCITMCKKLTEKLNRFFSDKQRFIDEINSVTAEKLIYNCAVEMVQSA ALDEMFQQTEDIVYRYHKAALLLEGLSRILQDPADIENVHKLEPGLTEAEEEEEEEEEEK EEQGVPRPEMPSSWALYILRVLWAFALWRPLPRMREQSRPFHIFQHLCHVNRKLDIGELV ISKESPGPAQDTPQVAFSHEAVKAHGQLVVIQRIAHGVVAVLQQEGQLDQRVPPEAHVIP ELLEAIQDGLSQVTPPATRPLAGCGQVVNGIFKGAMSKISEAVKRARAAFSSGRTRPLQF RIQQLEALQRLIQEQEQELVGALAADLHKNEWNAYYEEVVYVLEEIEYMIQKLPEWAADE PVEKTPQTQQDELYIHSEPLGVVLVIGTWNYPFNLTIQPMVGAIAAGNSVVLKPSELSEN MASLLATIIPQYLDKDLYPVINGGVPETTELLKERFDHILYTGSTGVGKIIMTAAAKHLT PVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNSGQTCVAPDYILCDPSIQNQIVEKLK KSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIEGQKVAYGGTGDAATRYIAPTILTDVD PQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREKPLALYMFSSNDKETEASGWVLVQVI KKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFETFSHRRSCLVRPLM NDEGLKVRYPPSPAKSRREDCAHLWVPPAGSRGGDCARGAHLDGSVFGRTCRVMRQEQVN LRMAAATEEEILKTPTGLGLSSFIPKGNRNCLGRRGSHTQSSCPGTGPGLAGQAAAGNRE PQGECVQTCGSQRQLHIREGGLEECLANDSLVPKVKAVATLGFCSFGSSFPKEVLDMRVG DLSGHWCGALKAGEACLAGLGCVGTCGPESDLAAVLQFLFQVLTFMIYIVSTVFCGHLGK VELASVTLAVASFGSPNKKHVGVILQRGALVLLLCCLPCWALFLNTQHILLLFRQDPDVS RLTQDYVMIFIPGLPPSASLFDLQLSDACYDWQFCVLDSPTRLSSLSDSVASMAPGGMFG ARELGREEVRVKKITWPQVLSGVVGNCVNGVANYALVSVLNLGVRGSAYANIISQFAQTV FLLLYIVLKKLHLETWAGWSSQCLQDWGPFFSLAVPSMLMICVEWWAYEIGSFLMGLLSV VDLSAQAVIYEVATVTYMRHSHRLAYAAHVTRIPLGLSIGVCVRVGMALGAADTVQAKRS AVSGVLSIVGISLVLGTLISILKNQLGHIFTNDEDVIALVSQVLPVYSVFHVFEAICCVY GGVLRGTGKQAFGAAVNAITYYIIGLPLGILLTFVVRMRIMGLWLGMLACVFLATAAFVA YTARLDWKLAAEEAKKHSGRQQQQRAESTATRPGPEKAVLSSVATGSSPGITLTTYSRSE CHVDFFRTPEEAHALSAPTSRLSVKQLVIRRGAALGAASATLMVGLTEVLTNFPGKVNFP VKQDAFHEQDKDLHLF >gi568815581r:19638323_19845129|GENSCAN_predicted_CDS_2|5811_bp atggtcactcgggtgatatgccagtggggactgctggcagacgtgcttcaaatgaattct tggtgtgtggagggccctgatagtaaccacattctgctctctgtttctatgaatttgact actttagatacctcatatctgcagtggtacgaaggtccaacaccagccccatgggcttcc tccggccgggatcatgctccccagtaccagcagacacagcacagacagttggacgaaggc tctccactgggtcttctaggccttactcaccttcccctttggttggtaccattcctgagc aattcagtcagtgctgctgtgggcatcctcagggccatgactccaggagtagaaactcct cagaccacagctcctttcaaaatccctaaaactcaagcatcttccaacctgttagccttg gttactcgtcatgggcctgctgaagaacagtcgaaagatgggaatgagccacgggaatgt gcccattgcctcttagtgcaaggaagtgagaggcagcgggccgagcagcagagcaaggca gtgtttggcagatctgtcagtaccgggaagttatcagatcaacaaggaaagactcctata tgtcgacatcagggcagcacagacagtttaaatacagaacgaccaatggatataggtaca gcagcaagttccaaggctgtcctcttcactgtagggtctcctccacacagtgcggcagcc cccacttgtacccacatgttccttcgaacaagaacaacctcagtggggcccagcaactcc gggggctctctttgtgccatgagtggccgcgtgtgcgtggggtccccgcctggcccaggc ttcggctcttcccctccaggagcagaggcagctcccagcctgagatacgtgccttacggt gcttcaccccccagcctagaggggctcatcacctttgaagcccctgaactgccggaggag acgctgatggagcgggaacacacagacaccttacgccatctgaatgtgatgctgatgttc actgagtgtgtgctggacctgacagccatgaggggaggaaaccctgagctgtgcacatct gctgtgtccttgtaccagatccaggagagtgtggtggtggaccagatcagtcagctgagc aaagactgggggcgggtggagcagctggtgttgtacatgaaagcagcacagctgcttgcg gcttctctgcatcttgccaaagcccagatcaagtccgggaaactgagcccatccacagct gtgaaacaagttgtcaagaatctgaacgaacgatataaattctgcatcaccatgtgcaag aaacttacagaaaagctgaatcgattcttctctgacaaacagaggtttattgatgaaatc aacagtgtgactgcagagaaactcatctataattgtgctgtagaaatggttcagtctgca gccctggatgagatgtttcagcagaccgaagatattgtttatcgctatcataaggcagcc cttcttttggaaggcctaagtaggattctacaggaccctgcagatattgaaaatgtgcat aaattggagcccgggctcactgaggccgaggaggaggaggaggaggaggaggaggagaag gaggagcaaggggtcccaaggcctgaaatgccatcatcttgggctctgtacatccttcgt gtgctctgggcctttgccctgtggagacctctcccaaggatgagggagcagagccggccc ttccatatcttccagcatctctgccatgtaaacagaaaactagacattggagagctggta atcagcaaagagagccctggcccagcacaggacacgccccaggtggcgttttcacatgag gcggtcaaagcacatggccagctcgttgtcatccagcgcattgcacatggtgtggttgcg gtgctccagcaggagggccagctggaccagagggtccctcccgaagcccatgttatccct gagttgctggaggccatccaggacggcttgtctcaggtgacaccccctgctaccaggccc ctggctggctgtggccaggtggtcaatggcatcttcaaaggcgccatgagcaagatcagc gaggccgtgaagcgcgcccgcgccgccttcagctcgggcaggacccgtccgctgcagttc cggatccagcagctggaggcgctgcagcgcctgatccaggagcaggagcaggagctggtg ggcgcgctggccgcagacctgcacaagaatgaatggaacgcctactatgaggaggtggtg tacgtcctagaggagatcgagtacatgatccagaagctccctgagtgggccgcggatgag cccgtggagaagacgccccagactcagcaggacgagctctacatccactcggagccactg ggcgtggtcctcgtcattggcacctggaactaccccttcaacctcaccatccagcccatg gtgggcgccatcgctgcagggaactcagtggtcctcaagccctcggagctgagtgagaac atggcgagcctgctggctaccatcatcccccagtacctggacaaggatctgtacccagta atcaatgggggtgtccctgagaccacggagctgctcaaggagaggttcgaccatatcctg tacacgggcagcacgggggtggggaagatcatcatgacggctgctgccaagcacctgacc cctgtcacgctggagctgggagggaagagtccctgctacgtggacaagaactgtgacctg gacgtggcctgccgacgcatcgcctgggggaaattcatgaacagtggccagacctgcgtg gcccctgactacatcctctgtgacccctcgatccagaaccaaattgtggagaagctcaag aagtcactgaaagagttctacggggaagatgctaagaaatcccgggactatggaagaatc attagtgcccggcacttccagagggtgatgggcctgattgagggccagaaggtggcttat gggggcaccggggatgccgccactcgctacatagcccccaccatcctcacggacgtggac ccccagtccccggtgatgcaagaggagatcttcgggcctgtgctgcccatcgtgtgcgtg cgcagcctggaggaggccatccagttcatcaaccagcgtgagaagcccctggccctctac atgttctccagcaacgacaaggagactgaggccagtggctgggtcctggtgcaggtgatt aagaagatgattgcagagacatccagtggtggggtggcggccaacgatgtcatcgtccac atcaccttgcactctctgcccttcgggggcgtggggaacagcggcatgggatcctaccat ggcaagaagagcttcgagactttctctcaccgccgctcttgcctggtgaggcctctgatg aatgatgaaggcctgaaggtcagataccccccgagcccggccaagagcagaagggaagat tgcgcccacctatgggttcctccagcaggcagtcgtggcggggactgcgctcgaggagct caccttgatgggagcgtcttcggcaggacctgcagggttatgcggcaggaacaagttaac ctcagaatggcggcggctacagaggaggaaattttgaaaacaccaactgggcttggcctg agctccttcattcctaaaggcaacagaaattgtttgggcaggagaggcagccacacgcaa agcagctgccctggcacagggccaggccttgcaggacaggcagctgctggcaatcgtgag ccccagggtgaatgtgtccagacctgtggctctcagcgccagctgcacatcagagaggga ggtttagaagaatgcctggcaaatgattctctggtgcccaaagtcaaagcagtggctacc cttggattttgtagctttgggtcctccttccccaaagaggtcctcgatatgcgcgtaggt gacctctcggggcactggtgcggggccctcaaagctggagaggcctgtcttgcagggctc ggctgtgtgggcacctgtggccccgagtctgacctggccgctgtcctccagttcctgttc caggtgctgacttttatgatctacatcgtgagcactgtgttctgcgggcacctgggcaag gtggagctggcatcggtgaccctcgcggtggccagcttcggcagccccaacaagaagcac gtgggcgtgatcctgcagcggggcgcgctggtcctgctcctctgctgcctcccttgctgg gcgctcttcctcaacacccagcacatcctgctgctcttccggcaggacccggacgtgtcc aggttgacccaggactatgtaatgattttcattccaggacttccgcccagtgcttcactg tttgacttgcagcttagtgatgcctgttacgactggcagttctgtgtgctagacagccct accaggctcagctcactttctgacagtgtagcctccatggcaccagggggaatgtttggg gcaagggagctgggcagggaagaggtcagggtgaagaagatcacctggccccaagtcctc agtggtgtggtgggcaactgtgtcaacggtgtggccaactatgccctggtttctgtgctg aacctgggggtcaggggctccgcctatgccaacatcatctcccagtttgcacagaccgtc ttcctccttctctacattgtgctgaagaagctgcacctggagacgtgggcaggttggtcc agccagtgcctgcaggactggggccccttcttctccctggctgtccccagcatgctcatg atctgtgttgagtggtgggcctatgagatcgggagcttcctcatggggctgctcagtgtg gtggatctctctgcccaggctgtcatctacgaggtggccactgtgacctacatgagacac agccatcgtcttgcttatgcagcccatgtcactcggattcccttggggctcagcatcggg gtctgtgtccgagtggggatggctctgggggctgcggatactgtgcaggccaagcgctcg gccgtctcgggcgtgctcagcatagttggcatttccctggtcctgggcaccctgataagc atcctgaaaaatcagctggggcatatttttaccaatgatgaagatgtcattgccctggtg agccaggtcttgccggtttatagtgtctttcacgtgtttgaggccatctgttgtgtctat ggcggagttctgagaggaactgggaagcaggcctttggtgccgctgtgaatgccatcaca tattacatcatcggcctaccactgggcatccttctgacctttgtggtcagaatgagaatc atgggcctctggctgggcatgctggcctgtgtcttcctggcaactgctgcctttgttgct tatactgcccggctggactggaagcttgctgcagaggaggctaagaaacattcaggccgg cagcagcagcagagagcagagagcactgcaaccagacctgggcctgagaaagcagtccta tcttcagtggctacaggcagttcccctggcattaccttgacaacgtattcaaggtctgag tgccacgtggacttcttcaggactccagaggaggcccacgccctttcagctcctaccagc agactatcagtgaaacagctggtcatccgccgtggggctgctctgggggcggcgtcagcc acactgatggtggggctcacggaagtccttaccaacttccctggaaaagtcaatttccct gtcaaacaggatgctttccatgagcaagacaaggacttgcacctcttctag