GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:43:50 Sequence gi568815575f:11012061_11221807 : 209747 bp : 39.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 575 570 6 1.05 1.01 Sngl - 8171 7266 906 2 0 42 42 349 0.930 21.68 1.00 Prom - 10441 10402 40 -6.15 2.02 PlyA - 10601 10596 6 1.05 2.01 Sngl - 11862 11044 819 0 0 68 43 442 0.519 33.19 2.00 Prom - 14440 14401 40 -3.85 3.00 Prom + 42470 42509 40 -3.65 3.01 Init + 46004 46162 159 1 0 84 50 87 0.567 4.37 3.02 Intr + 66820 66903 84 0 0 57 37 99 0.049 0.80 3.03 Intr + 69857 69937 81 1 0 79 42 66 0.195 0.02 3.04 Intr + 76184 76579 396 1 0 56 58 164 0.142 4.05 3.05 Intr + 86306 86442 137 1 2 41 97 103 0.309 4.95 3.06 Intr + 94689 94805 117 0 0 42 36 115 0.005 0.36 3.07 Intr + 99192 99363 172 0 1 51 28 100 0.001 -0.68 3.08 Intr + 102606 102926 321 2 0 64 49 159 0.113 4.73 3.09 Intr + 105207 105355 149 2 2 50 96 110 0.976 6.11 3.10 Intr + 106441 106560 120 1 0 73 85 77 0.946 4.59 3.11 Intr + 109552 109746 195 1 0 92 15 141 0.182 4.81 3.12 Term + 120070 120379 310 1 1 80 38 170 0.510 4.75 3.13 PlyA + 120732 120737 6 1.05 4.20 PlyA - 121901 121896 6 1.05 4.19 Term - 127470 126803 668 1 2 75 52 507 0.963 38.80 4.18 Intr - 128123 127971 153 0 0 64 58 94 0.323 3.12 4.17 Intr - 130253 130173 81 0 0 79 72 62 0.628 2.39 4.16 Intr - 132188 131920 269 1 2 46 88 178 0.542 9.85 4.15 Intr - 144566 144469 98 2 2 79 78 82 0.132 4.19 4.14 Intr - 157624 157445 180 1 0 112 78 170 0.897 17.54 4.13 Intr - 166188 166040 149 1 2 67 94 165 0.997 14.03 4.12 Intr - 167392 167242 151 1 1 90 27 209 0.999 13.81 4.11 Intr - 170058 170003 56 1 2 88 108 25 0.976 2.28 4.10 Intr - 174371 174176 196 2 1 84 115 178 0.996 18.17 4.09 Intr - 176924 176668 257 0 2 77 72 146 0.941 8.14 4.08 Intr - 184936 184865 72 2 0 82 98 20 0.025 0.86 4.07 Intr - 201063 200782 282 1 0 88 29 228 0.047 13.37 4.06 Intr - 202778 202655 124 2 1 70 40 137 0.051 6.34 4.05 Intr - 202950 202897 54 0 0 97 76 117 0.683 9.66 4.04 Intr - 203353 203186 168 1 0 61 70 112 0.338 5.92 4.03 Intr - 203734 203627 108 1 0 87 68 39 0.160 1.36 4.02 Intr - 204137 203887 251 0 2 67 64 120 0.217 3.73 4.01 Intr - 208385 208210 176 1 2 79 39 99 0.040 2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:11012061_11221807|GENSCAN_predicted_peptide_1|301_aa MIISIDAEKAFDKIQQHFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGFPLSPLLFNIVLEVLAKAIRQEKEIKGIQLGKEEVKLSLFADDMILYLENPIVS AQSLLNDFSKVSGYKINVRKSQAFFTPITDKQIQIMSELLFTIASKRIKYLGIQLARDVK DLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMTILPKVIYRFSAILGCKLPMTFF TELEKITLKFIWNQKRAHIPKTILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRAID Q >gi568815575f:11012061_11221807|GENSCAN_predicted_CDS_1|906_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacacttcatgcta aaaactctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggattccctctctcaccactcctattcaacatagtgttggaagtt ctggccaaggcaatcaggcaggagaaagaaataaagggtattcagttaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattctatatttagaaaaccccatcgtctca gcccaaagtctccttaatgacttcagcaaagtctcaggatacaaaatcaatgtgcgaaaa tcacaagcattttttacaccaataacagacaaacagatccaaatcatgagtgaactccta ttcacaattgcttcaaagagaataaaatacctaggaatccaacttgcaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggacacaaacaaa tggaagaacattccatgctcatggataggaagaatcaacattgtgaaaatgaccatactg cccaaggtaatttatagattcagtgccatccttggatgcaagctaccaatgactttcttc acagaattggaaaaaattactttaaagttcatatggaaccaaaaaagagctcacattccc aagacaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaagcta tactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagctatagac caatag >gi568815575f:11012061_11221807|GENSCAN_predicted_peptide_2|272_aa MRKSQRQKAENSKNQNASSPPKDHNSSPAREQNWMENEFDKLTEVCFRRCIITNSSQLKE HVLTQCKEAKNLEKRLEELLTRVTSLEKNINDLMELKNIARELCEAYTNINSRTNQVEER ISEIEDQLNEIKDEDKIKKNEKEKQSLQEIWDYVKRPNLHLIGVPEIDGENGTKLGNTLQ DILQENFLNLARQANIQIQEIQRIPQRYSLRRATPRYIIVRFAKVEMKEKNVKGSQRERS GYPQREAHQTNSGSLGINPTSQKRVGANIQHS >gi568815575f:11012061_11221807|GENSCAN_predicted_CDS_2|819_bp atgaggaaaagccagcgccaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcaccagcaagggaacaaaactggatggagaatgagtttgat aaattgacagaagtatgcttcagaaggtgcataataacaaactcctcccagctaaaggag catgttctaacccaatgcaaggaagctaagaaccttgagaaaaggttagaggaattgcta actagagtaaccagtttagagaagaacataaatgacctgatggagctgaaaaacatagca cgagaactttgtgaagcatacacaaatatcaatagtcgaaccaatcaagtggaagaaagg atatcagagattgaagatcaacttaatgaaataaaggatgaagacaagattaaaaagaat gaaaaggaaaaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacat ctgattggtgtacctgaaattgatggggagaatggaaccaagttgggaaacactcttcag gatattctccaggagaacttcctcaacctagcaagacaggccaacattcaaattcaggaa atacagagaataccacaaagatactccttgaggagagcaaccccaaggtacataattgtc agatttgccaaggttgaaatgaaggaaaaaaatgttaagggcagccagagagaaaggtca ggttacccacaaagggaagcccatcagactaacagtggatctcttggcataaaccctaca agccagaagagagtgggggccaatattcaacattcttaa >gi568815575f:11012061_11221807|GENSCAN_predicted_peptide_3|746_aa MVPENSEPLIGEKTALIVLNLSKMMGGEEIHHGREQGETRQNFNKSLMVTCDLESLVVLR VLQADSSQLAATWQQKAATAKLLCAARGHEPEVSVEEGTGLSHTHAVRKSITERIKHLRI QLTREVKGVYNENYKTLLKEIRDDTSKWKNIPCSWIGTINVIKMAILPKAVYRFNAILVK LPMTFFTELEKTILKFIWNRKRARISKAIISKKNKAGGITLPDFKLYYRVTITKTACMVL LTVREPRVGALRLQRVPFPGHLISGPRAFLLGSEWCVDMAASLWPRKSSVYFQEQEVTLA FDVDTVPLSTHVMNAGRLHGTQTYCPPPACPEVPPSKKEVGVRENAGGDRSHSGDRVRGR RRRRRRRREVTAALGSGWRLKADFGMLASSVLDRFKQKKPGFSYGGNHWSEIQLFSAPIE RVRCPAFMVTPFLYLIISGCPVNTEPSGPTCEKKTYSVPAHQERAYEYVECPIRGTAAEN KENLDPSNLMPPPNQTPAPDQPFALSTVREESSIPRADSEKKWVYPSEQMFWNAMLKKGW KWKDEDISQKDMYNIIRIHNQNNEQAWKEILKWEALHAAYELPFDRHDWIINRCGTEVRY VIDYYDGGEVNKDYQFTILDVRPALDSLSAVWDRMKVAWWRWTSKERDQICLHHLQVPVE NENVGLLVQKYENSKMALTEYSNRCWALPSMGSSVTAKETNPNCDHTIPDFCQVLSKVFA PLPETVFPVWGLTSKSKQHFVLWSFL >gi568815575f:11012061_11221807|GENSCAN_predicted_CDS_3|2241_bp atggttccggaaaacagtgagcctctcataggtgagaagacggccctaattgttctcaat ctgagcaaaatgatgggtggagaagaaatccaccatggaagagaacaaggagaaactagg caaaattttaacaaaagtttaatggtcacatgtgatctggaaagccttgttgtgctgaga gtgctgcaggcagacagctctcagctggcagcaacttggcagcagaaggctgccacggcc aagttactctgtgcagcaagaggacatgagccagaggtaagtgtggaggagggtaccgga ctgagccacactcacgctgttagaaaaagtatcacagaaagaataaaacacctaagaata cagctaaccagggaggtgaaaggtgtctacaatgagaactataaaacactactcaaagaa atcagagatgacacaagcaaatggaaaaacatcccatgctcatggataggaacaatcaat gtcattaaaatggccatactgcccaaagcagtttacagattcaatgctattcttgtcaag ctgccaatgacattcttcacagaactagaaaaaactattttaaaattcatatggaaccga aaaagagcccgaatatccaaggcaatcataagcaaaaagaacaaagctggaggcatcacg ttacctgacttcaaactatactacagggtcacaataaccaaaacagcatgcatggtactg ctgacagtcagagagcccagagttggagctctacgactccaaagggtgccctttccaggg cacctcatctctgggcccagggcttttctcctaggctcggaatggtgtgtggatatggct gccagcctgtggcctaggaaaagtagtgtttatttccaagagcaagaggtaaccctggcc tttgatgtagacacggtccctctgagtacacatgtgatgaatgctggcagactccatgga acacagacctattgcccgcctcccgcgtgcccggaagtcccgccttctaaaaaggaagta ggggtgcgtgagaacgccggcggtgaccgcagccacagcggtgaccgagtgagaggaagg cggcggcggcggcggcggcggcgtgaagtcactgctgctctgggttcgggttggcgactg aaggcggattttggaatgttagctagttcagttcttgatagatttaaacagaagaaacct ggcttttcctatggtggtaatcattggtcagaaattcagctctttagtgctccaattgag agagttagatgtcctgccttcatggtgacaccatttttatacttgattatttcaggctgt ccagtgaatacagagccatctggcccaacctgtgagaagaaaacatactctgtgcctgcc caccaggaacgcgcctatgagtacgtggagtgtcccattaggggcactgcggctgagaat aaggagaacctagatccttcaaatctgatgccaccaccaaatcaaacaccagctccagat cagccatttgcattgtctactgtcagagaagagtcatccattccgagagcagattcagag aaaaagtgggtttacccttctgagcagatgttctggaatgcaatgttaaagaaagggtgg aagtggaaggatgaggatatcagtcagaaggatatgtataatatcattagaattcacaat cagaataacgagcaggcttggaaggagattttgaagtgggaagcccttcatgctgcgtat gagttgccttttgataggcacgattggatcataaaccgttgcgggacagaagttagatat gtgattgattattatgatggtggtgaagtcaacaaggactaccagttcaccatcctggac gtccgtcctgccttagattcactttcggcagtatgggacagaatgaaagtcgcttggtgg cgttggacctccaaagaacgtgaccagatctgcctacatcatttgcaagttccagtggaa aatgaaaatgtggggctgcttgttcaaaagtatgaaaattccaagatggcactgacagag tattcaaataggtgctgggctcttccaagcatggggtcctctgtgactgcaaaggaaacc aaccctaattgtgaccacaccatccctgatttctgccaagttctttctaaggtgtttgct ccacttccagagactgtctttcctgtttggggtttaacttcaaaatccaagcagcatttt gtgctatggtcatttctctga >gi568815575f:11012061_11221807|GENSCAN_predicted_peptide_4|1164_aa XVSAERSAVSLMGFPLRVTRPFFLAAFNIFSFISTLVNLTIMCLGVALLEEYLCGVKQHE EDAVVDIFLWRNVLFVDPFGSLPIKALKQRYTWNHWDPDHFTEAFLQALCGLSPSKQGTT CQIPWTSGPHPSRRAALRVSVTLRADKPSPWACTFFRLPWTVFNSMLHFLVGRNLSLHWS SQLPRTPELRGLWGILAEPLRQSSALLLGPPATPCGRKSCGMTEDLPSYLSGSHPGLALI QAPDNDTGAISSQWTCRRWHHKAPLRAWVLGSLESSQPACPHSSLIGRLSAHVGGQDSEH SPFPAPCFLCQDEELEYCGNRGLREPPLPPRSSATMGMPAGGVAPHGPTTCMHTEPTNQR GARAGGLAALARTRDSGPQRSLVVSASTDGQKRKKSLRKKLDSLGKEKNKDKEFIPQAFG MPLSQVIANDRAYKLKQDLQRDEQKDASDFVASLLPFGNKRQNKELSSSNSSLSSTSETP NESTSPNTPEPAPRARRRGAMSVDSITDLDDNQSRLLEALQLSLPAEAQSKKEKARDKKL SLNPIYRQVPRLVDSCCQHLEKHGLQTVGIFRVGSSKKRVRQLREEFDRGIDVSLEEEHS VHDVAALLKEFLRDMPDPLLTRELYTAFINTLLLEPEEQLGTLQLLIYLLPPCNCDTLHR LLQFLSIVARHADDNISKDGQEVTGNKMTSLNLATIFGPNLLHKQKSSDKEFSVQSSARA EESTAIIAVVQKMIENYEALFMVPPDLQNEVLISLLETDPDVVDYLLRRKASQSSSPDML QSEVSFSVGGRHSSTDSNKASSGDISPYDNNSPVLSERSLLAMQEDAAPGGSEKLYRVPG QFMLVGHLSSSKSRESSPGPRLGKDLSEEPFDIWGTWHSTLKSGSKDPGMTGLEMAYITS VHICCPDLISVALLTAREAGNCHVGSKGERLGDHQVSLSCIDGSSGDIFESSSLRAGPCS LSQGNLSPNWPRWQGSPAELDSDTQGARRTQAAAPATEGRAHPAVSRACSTPHVQVAGKA ERPTARSEQYLTLSGAHDLSESELDVAGLQSRATPQCQRPHGSGRDDKRPPPPYPGPGKP AAAAAWIQGPPEGVETPTDQGGQAAEREQQVTQKKLSSANSLPAGEQDSPRLGDAGWLDW QRERWQIWELLSTDNPDALPETLV >gi568815575f:11012061_11221807|GENSCAN_predicted_CDS_4|3495_bp nnggtttctgccgagagatccgctgttagtctgatgggcttccctttgagggtaacccga cctttctttctggctgcctttaacattttttccttcatttcaactttggtgaatctgaca attatgtgtcttggagttgctcttctcgaggagtatctttgtggtgtaaaacagcatgag gaagacgctgttgtggacatttttttgtggagaaatgtactatttgtggacccttttggc tctctgcccattaaagctttaaagcagcgctatacatggaatcactgggaccctgaccat tttacagaggcatttttgcaagctctttgcggtttaagtcccagtaaacaaggcactacg tgtcagatcccttggacgtcaggaccgcatccatcgagaagagctgctctaagagtatct gtaaccctgagagcagacaaaccgtccccttgggcgtgcaccttcttccggctcccctgg actgtttttaactccatgctccatttcctggttggccgtaacctgagcctccactggagc tcgcagctgccgcgcacacctgaactccgcggcctctgggggatcctcgcggagccattg agacagagctcggccctgctcctgggtcccccagcgaccccgtgtggccggaagagctgt gggatgacagaggatctccccagctatctcagtgggagtcacccgggccttgccctgatt caggccccagacaacgacactggggccatttcctcacagtggacctgcagacgctggcac cacaaggctcctctgagggcctgggtcctcggttctcttgagagcagccagcctgcctgc ccgcactcctcgctgattggtcgactcagtgcacatgtgggtggacaggattccgagcac tccccattcccggctccttgcttcctttgccaggatgaggagctggaatattgtgggaac agaggcctcagagagccacccttgccccctaggtcctcagctaccatgggcatgcctgca gggggcgtggccccacacggccccactacctgcatgcacactgagccgaccaatcagcga ggagcacgggcaggcgggctggctgctcttgcaagaacccgggactcaggccctcagagg agcctcgtggtgtcggcatctacagatggacaaaagagaaagaaatctttaagaaagaaa ctggattcactaggaaaggagaaaaacaaagacaaagaattcatcccacaggcatttgga atgcccttatcccaagtcattgcgaatgacagggcctataaactcaagcaggacttgcag agggacgagcagaaagatgcatctgactttgtggcttccctcctcccatttggaaataaa agacaaaacaaagaactctcaagcagtaactcatctctcagctcaacctcagaaacaccg aatgagtcaacgtccccaaacaccccggaaccggctcctcgggctaggaggaggggtgcc atgtcagtggattctatcaccgatcttgatgacaatcagtctcgactactagaagcttta caactttccttgcctgctgaggctcaaagtaaaaaggaaaaagccagagataagaaactc agtctgaatcctatttacagacaggtccctaggctggtggacagctgctgtcagcaccta gaaaaacatggcctccagacagtggggatattccgagttggaagctcaaaaaagagagtg agacaattacgtgaggaatttgaccgtgggattgatgtctctctggaggaggagcacagt gttcatgatgtggcagccttgctgaaagagttcctgagggacatgccagacccccttctc accagggagctgtacacagctttcatcaacactctcttgttggagccggaggaacagctg ggcaccttgcagctcctcatataccttctacctccctgcaactgcgacaccctccaccgc ctgctacagttcctctccatcgtggccaggcatgccgatgacaacatcagcaaagatggg caagaggtcactgggaataaaatgacatctctaaacttagccaccatatttggacccaac ctgctgcacaagcagaagtcatcagacaaagaattctcagttcagagttcagcccgggct gaggagagcacggccatcatcgctgttgtgcaaaagatgattgaaaattatgaagccctg ttcatggttcccccagatctccagaacgaagtgctgatcagcctgttagagaccgatcct gatgtcgtggactatttactcagaagaaaggcttcccaatcatcaagccctgacatgctg cagtcggaagtttccttttccgtgggagggaggcattcatctacagactccaacaaggcc tccagcggagacatctccccttatgacaacaactccccagtgctgtctgagcgctccctg ctggctatgcaagaggacgcggccccggggggctcggagaagctttacagagtgccaggg cagtttatgctggtgggccacttgtcgtcgtcaaagtcaagggaaagttctcctggacca aggcttgggaaagatctgtcagaggagcctttcgatatctggggaacttggcattcaaca ttaaaaagcggatccaaagacccaggaatgacaggcctagaaatggcttacatcacatct gtccacatttgttgcccagacctcatttctgtggccctcctaactgccagagaggctgga aactgccatgtaggaagtaaaggggaacgattgggtgatcaccaagtcagcctttcctgc attgatggttcctctggagacatttttgaaagcagctccctaagagcggggccctgctcc ctttctcaagggaacctgtccccaaattggcctcggtggcaggggagccccgcagagctg gacagcgacacgcagggggctcggaggactcaggccgcagcccccgcgacggagggcagg gcccaccctgcggtgtcgcgcgcctgcagcacgccccacgtccaggtggcagggaaagcc gagcggcccacggccaggtcggagcagtacttgaccctgagcggcgcccacgacctcagc gagagtgagctggatgtggccgggctgcagagccgggccacacctcagtgccaaagaccc catgggagtgggagggatgacaagcggcccccgcctccatacccgggcccagggaagccc gcggcagcggcagcctggatccaggggcccccggaaggcgtggagacacccacggaccag ggaggccaagcagccgagcgagagcagcaggtcacgcagaaaaaactgagcagcgccaac tccctgccagcgggcgagcaggacagtccgcgcctgggggacgctggctggctcgactgg cagagagagcgctggcagatctgggagctcctgtcgaccgacaaccccgatgccctgccc gagacgctggtctga