GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:08:05 Sequence gi568815589r:121241205_121470187 : 228983 bp : 43.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 3399 3438 40 -1.76 1.01 Init + 15273 15315 43 2 1 80 86 22 0.452 1.78 1.02 Intr + 23950 24027 78 2 0 102 58 29 0.622 0.82 1.03 Intr + 24457 24550 94 2 1 96 78 77 0.966 6.52 1.04 Intr + 26694 27015 322 0 1 82 91 86 0.646 4.16 1.05 Term + 28031 28168 138 1 0 108 45 30 0.567 -1.34 1.06 PlyA + 29866 29871 6 1.05 2.04 PlyA - 30219 30214 6 1.05 2.03 Term - 40165 40071 95 2 2 105 42 50 0.443 0.09 2.02 Intr - 41410 41265 146 0 2 47 96 102 0.546 6.83 2.01 Init - 47061 47018 44 0 2 56 94 30 0.296 0.30 2.00 Prom - 48892 48853 40 -5.86 3.00 Prom + 49882 49921 40 -7.46 3.01 Init + 58658 58801 144 1 0 121 100 259 0.998 28.52 3.02 Intr + 60759 60963 205 2 1 86 68 365 0.990 33.07 3.03 Intr + 61707 61861 155 1 2 72 98 303 0.951 29.49 3.04 Intr + 69480 69641 162 2 0 116 92 230 0.999 26.37 3.05 Intr + 71135 71284 150 1 0 104 51 240 0.956 22.26 3.06 Intr + 72730 72819 90 0 0 132 105 185 0.999 24.79 3.07 Intr + 75882 76014 133 2 1 117 79 222 0.999 24.42 3.08 Intr + 77202 77290 89 1 2 98 109 107 0.885 13.49 3.09 Intr + 77461 77676 216 0 0 127 98 326 0.984 36.20 3.10 Intr + 80064 80197 134 2 2 67 81 199 0.989 16.54 3.11 Intr + 83350 83440 91 1 1 114 99 94 0.996 13.10 3.12 Intr + 85308 85478 171 2 0 100 76 229 0.970 23.04 3.13 Intr + 86104 86278 175 0 1 67 91 165 0.987 14.21 3.14 Intr + 87687 87811 125 1 2 72 77 162 0.998 13.80 3.15 Intr + 88034 88111 78 1 0 117 94 136 0.775 16.85 3.16 Intr + 90184 90244 61 0 1 111 81 44 0.921 4.31 3.17 Term + 91230 91399 170 1 2 75 55 252 0.983 18.64 3.18 PlyA + 91617 91622 6 1.05 4.00 Prom + 92509 92548 40 -3.46 4.01 Init + 93973 94057 85 0 1 45 80 18 0.661 -2.32 4.02 Term + 94266 94369 104 1 2 87 39 178 0.993 11.24 4.03 PlyA + 96285 96290 6 1.05 5.09 PlyA - 97897 97892 6 1.05 5.08 Term - 100204 99998 207 1 0 110 36 215 0.999 15.84 5.07 Intr - 106945 106811 135 1 0 83 88 190 0.999 19.26 5.06 Intr - 108119 107916 204 2 0 84 75 188 0.996 16.50 5.05 Intr - 112098 112016 83 1 2 94 94 37 0.924 4.26 5.04 Intr - 113469 113397 73 0 1 94 65 74 0.538 4.68 5.03 Intr - 124752 124691 62 1 2 104 86 -28 0.042 -2.95 5.02 Intr - 129047 128923 125 1 2 89 96 152 0.394 16.33 5.01 Init - 142860 142826 35 0 2 69 115 -11 0.363 -0.66 5.00 Prom - 153462 153423 40 -1.46 6.04 PlyA - 154777 154772 6 1.05 6.03 Term - 176178 175105 1074 0 0 -14 42 409 0.187 18.22 6.02 Intr - 176713 176466 248 2 2 46 41 216 0.096 9.88 6.01 Init - 178740 178341 400 0 1 55 -52 306 0.032 10.13 6.00 Prom - 193766 193727 40 -3.86 7.09 PlyA - 194310 194305 6 1.05 7.08 Term - 197889 197773 117 0 0 90 39 45 0.271 -1.66 7.07 Intr - 206393 206123 271 1 1 119 16 262 0.203 19.64 7.06 Intr - 211136 210940 197 2 2 71 71 76 0.193 2.41 7.05 Intr - 220348 220286 63 1 0 143 62 0 0.187 1.81 7.04 Intr - 222124 222089 36 1 0 89 106 -3 0.219 0.06 7.03 Intr - 224563 224407 157 0 1 83 82 52 0.763 4.11 7.02 Intr - 226727 226639 89 2 2 117 91 13 0.987 3.27 7.01 Init - 227194 227048 147 1 0 44 99 96 0.737 6.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:121241205_121470187|GENSCAN_predicted_peptide_1|224_aa MGLEDVESRIVAGEGTNYKAFDAQQILVLITSLSEASSRRASLWEGGGLLAKEDIKVLPS EEILISKTFTLRPLQISPPCSGPQFPPGESRGLGLVPSCEVKRGVPLDEGELGATRPAAD GRAWRSSPTRLPGRPAPRAAPARALPTPAARTTTPPPRRPEPAERSWTQQPLSPVPQQQD HISDQGTRTSHRSAVYSDHSLLESYPISISSISQLRKVKDPESL >gi568815589r:121241205_121470187|GENSCAN_predicted_CDS_1|675_bp atgggtctcgaggatgtagaaagtagaattgtggctggggaggggacaaactacaaggca ttcgatgctcaacaaatacttgtgctgataacttctctttctgaagccagctccaggaga gcctcactctgggaaggaggtggtcttctggcaaaagaagatatcaaagttcttccatca gaggagatccttatcagcaagacattcactctgaggcctcttcagatctcccctccgtgc tccgggcctcagtttcctccaggggaaagcagagggcttggtttggtgccttcctgcgag gtgaagcgaggggtccccctggacgaaggcgagcttggagccacccgcccagcagccgac gggagggcctggcgctcctcccccacccgcctgcccggccgccccgccccgagggcggct cccgcccgcgccctgcccaccccggccgcgcgcaccacaacgcccccgccccgccgcccg gaaccagctgagcgcagctggacccagcagccgctgtctccagtgccgcagcagcaggac cacatttctgaccagggcaccaggacctctcaccgctctgccgtttacagcgaccatagc ttgctagaatcttaccccatctccatttcctccatttcacagttgagaaaagtcaaggac ccagagagtttatga >gi568815589r:121241205_121470187|GENSCAN_predicted_peptide_2|94_aa MNTALDLRRDRKQARPFGNPWLLGHNILGTTKKPFHLSFGKEEQLRLWVAHVTIWKPAGP QQTRVTQSVLAHIQQHRTHYLPRLQDLLLEGLAA >gi568815589r:121241205_121470187|GENSCAN_predicted_CDS_2|285_bp atgaacacagctctggatcttcggagagacagaaagcaggccagaccttttgggaatccc tggcttttggggcacaacatcctggggaccaccaaaaagccctttcacctgagctttggg aaagaggaacagctccggctgtgggtggctcatgtcaccatctggaagccagctgggcca cagcagacgcgggtcacacaatcagtacttgcacacatccagcaacaccgcactcactac ctgccaaggctgcaagacctgctgctggagggtttggctgcgtaa >gi568815589r:121241205_121470187|GENSCAN_predicted_peptide_3|782_aa MAPHRPAPALLCALSLALCALSLPVRAATASRGASQAGAPQGRVPEARPNSMVVEHPEFL KAGKEPGLQIWRVEKFDLVPVPTNLYGDFFTGDAYVILKTVQLRNGNLQYDLHYWLGNEC SQDESGAAAIFTVQLDDYLNGRAVQHREVQGFESATFLGYFKSGLKYKKGGVASGFKHVV PNEVVVQRLFQVKGRRVVRATEVPVSWESFNNGDCFILDLGNNIHQWCGSNSNRYERLKA TQVSKGIRDNERSGRARVHVSEEGTEPEAMLQVLGPKPALPAGTEDTAKEDAANRKLAKL YKVSNGAGTMSVSLVADENPFAQGALKSEDCFILDHGKDGKIFVWKGKQANTEERKAALK TASDFITKMDYPKQTQVSVLPEGGETPLFKQFFKNWRDPDQTDGLGLSYLSSHIANVERV PFDAATLHTSTAMAAQHGMDDDGTGQKQIWRIEGSNKVPVDPATYGQFYGGDSYIILYNY RHGGRQGQIIYNWQGAQSTQDEVAASAILTAQLDEELGGTPVQSRVVQGKEPAHLMSLFG GKPMIIYKGGTSREGGQTAPASTRLFQVRANSAGATRAVEVLPKAGALNSNDAFVLKTPS AAYLWVGTGASEAEKTGAQELLRVLRAQPVQVAEGSEPDGFWEALGGKAAYRTSPRLKDK KMDAHPPRLFACSNKIGRFVIEEVPGELMQEDLATDDVMLLDTWDQVFVWVGKDSQEEEK TEALTSAKRYIETDPANRDRRTPITVVKQGFEPPSFVGWFLGWDDDYWSVDPLDRAMAEL AA >gi568815589r:121241205_121470187|GENSCAN_predicted_CDS_3|2349_bp atggctccgcaccgccccgcgcccgcgctgctttgcgcgctgtccctggcgctgtgcgcg ctgtcgctgcccgtccgcgcggccactgcgtcgcggggggcgtcccaggcgggggcgccc caggggcgggtgcccgaggcgcggcccaacagcatggtggtggaacaccccgagttcctc aaggcagggaaggagcctggcctgcagatctggcgtgtggagaagttcgatctggtgccc gtgcccaccaacctttatggagacttcttcacgggcgacgcctacgtcatcctgaagaca gtgcagctgaggaacggaaatctgcagtatgacctccactactggctgggcaatgagtgc agccaggatgagagcggggcggccgccatctttaccgtgcagctggatgactacctgaac ggccgggccgtgcagcaccgtgaggtccagggcttcgagtcggccaccttcctaggctac ttcaagtctggcctgaagtacaagaaaggaggtgtggcatcaggattcaagcacgtggta cccaacgaggtggtggtgcagagactcttccaggtcaaagggcggcgtgtggtccgtgcc accgaggtacctgtgtcctgggagagcttcaacaatggcgactgcttcatcctggacctg ggcaacaacatccaccagtggtgtggttccaacagcaatcggtatgaaagactgaaggcc acacaggtgtccaagggcatccgggacaacgagcggagtggccgggcccgagtgcacgtg tctgaggagggcactgagcccgaggcgatgctccaggtgctgggccccaagccggctctg cctgcaggtaccgaggacaccgccaaggaggatgcggccaaccgcaagctggccaagctc tacaaggtctccaatggtgcagggaccatgtccgtctccctcgtggctgatgagaacccc ttcgcccagggggccctgaagtcagaggactgcttcatcctggaccacggcaaagatggg aaaatctttgtctggaaaggcaagcaggcaaacacggaggagaggaaggctgccctcaaa acagcctctgacttcatcaccaagatggactaccccaagcagactcaggtctcggtcctt cctgagggcggtgagaccccactgttcaagcagttcttcaagaactggcgggacccagac cagacagatggcctgggcttgtcctacctttccagccatatcgccaacgtggagcgggtg cccttcgacgccgccaccctgcacacctccactgccatggccgcccagcacggcatggat gacgatggcacaggccagaaacagatctggagaatcgaaggttccaacaaggtgcccgtg gaccctgccacatatggacagttctatggaggcgacagctacatcattctgtacaactac cgccatggtggccgccaggggcagataatctataactggcagggtgcccagtctacccag gatgaggtcgctgcatctgccatcctgactgctcagctggatgaggagctgggaggtacc cctgtccagagccgtgtggtccaaggcaaggagcccgcccacctcatgagcctgtttggt gggaagcccatgatcatctacaagggcggcacctcccgcgagggcgggcagacagcccct gccagcacccgcctcttccaggtccgcgccaacagcgctggagccacccgggctgttgag gtattgcctaaggctggtgcactgaactccaacgatgcctttgttctgaaaaccccctca gccgcctacctgtgggtgggtacaggagccagcgaggcagagaagacgggggcccaggag ctgctcagggtgctgcgggcccaacctgtgcaggtggcagaaggcagcgagccagatggc ttctgggaggccctgggcgggaaggctgcctaccgcacatccccacggctgaaggacaag aagatggatgcccatcctcctcgcctctttgcctgctccaacaagattggacgttttgtg atcgaagaggttcctggtgagctcatgcaggaagacctggcaacggatgacgtcatgctt ctggacacctgggaccaggtctttgtctgggttggaaaggattctcaagaagaagaaaag acagaagccttgacttctgctaagcggtacatcgagacggacccagccaatcgggatcgg cggacgcccatcaccgtggtgaagcaaggctttgagcctccctcctttgtgggctggttc cttggctgggatgatgattactggtctgtggaccccttggacagggccatggctgagctg gctgcctga >gi568815589r:121241205_121470187|GENSCAN_predicted_peptide_4|62_aa MVKAINLTKSWYRSGIYHPKLLGLSREAGRKANQEHYHFNRTFYLSKVADVIVSYNGKII VK >gi568815589r:121241205_121470187|GENSCAN_predicted_CDS_4|189_bp atggtcaaagcaatcaacttgacaaagtcctggtacagaagtggcatttaccacccaaag cttctagggctttcacgagaggcaggcagaaaagccaatcaggagcactatcacttcaac cgcacattttatctttctaaagttgcagacgtcatcgtctcctacaatggaaaaatcatc gtgaaataa >gi568815589r:121241205_121470187|GENSCAN_predicted_peptide_5|307_aa MPRYISQCELLRAFPAAPGLATRTGECDCVSGSMAEKRHTRDSEAQRLPDSFKEIESPSR LLVDIFSIVPRSQKIIKEYERAIIFRLGRILQGGAKGPGLFFILPCTDSFIKVDMRTISF DIPPQEILTKDSVTISVDGVVYYRVQNATLAVANITNADSATRLLAQTTLRNVLGTKNLS QILSDREEIAHNMQSTLDDATDAWGIKVERVEIKDVKLPVQLQRAMAAEAEASREARAKV IAAEGEMNASRALKEASMVITESPAALQLRYLQTLTTIAAEKNSTIVFPLPIDMLQGIIG AKHSHLG >gi568815589r:121241205_121470187|GENSCAN_predicted_CDS_5|924_bp atgcccagatatatctcccaatgtgaacttctaagggcattcccggcggctccgggtttg gcaacgaggacgggggagtgcgactgcgtctcgggcagcatggccgagaagcggcacaca cgggactccgaagcccagcggctccccgactccttcaaggagatagagtccccaagcaga ttgttagttgatattttctccatagtgcccagatcccaaaagattataaaagagtatgaa agagccatcatctttagattgggtcgcattttacaaggaggagccaaaggacctggtttg ttttttattctgccatgcactgacagcttcatcaaagtggacatgagaactatttcattt gatattcctcctcaggagatcctcacaaaggattcagtgacaattagcgtggatggtgtg gtctattaccgcgttcagaatgcaaccctggctgtggcaaatatcaccaacgctgactca gcaacccgtcttttggcacaaactactctgaggaatgttctgggcaccaagaatctttct cagatcctctctgacagagaagaaattgcacacaacatgcagtctactctggatgatgcc actgatgcctggggaataaaggtggagcgtgtggaaattaaggatgtgaaactacctgtg cagctccagagagctatggctgcagaagcagaagcgtcccgcgaggcccgcgccaaggtt attgcagccgaaggagaaatgaatgcatccagggctctgaaagaagcctccatggtcatc actgaatctcctgcagcccttcagctccgatacctgcagacactgaccaccattgctgct gagaaaaactcaacaattgtcttccctctgcccatagatatgctgcaaggaatcataggg gcaaaacacagccatctaggctag >gi568815589r:121241205_121470187|GENSCAN_predicted_peptide_6|573_aa MALKAKARELREECRSLRSQCDQLEERVSVMEDEMNEMKQEGKFREKRIKRNEQSLQEIW DYVKSLNLRLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQRNTENATKILLE KSNSKTHNCQIHQKIQTTIKDYYKHLYTNKLENLEEMDKFLDTYTLPRLNQEEVESLNRL ITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELHINRTKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NVVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLIRNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LNEIKEDTKKWKNIPCSWVGRINIVKMAILPKVIYRFDAIPIKLPMTFFTELEKTTLKFI WNQKRARITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIT PHIYNYLTSLTNLRKTSNGERIPYLINGAGKTG >gi568815589r:121241205_121470187|GENSCAN_predicted_CDS_6|1722_bp atggcgctgaaagccaaggctcgagaactacgtgaagaatgcagaagcctcaggagccaa tgcgatcaactggaagaaagggtatcagtgatggaagatgaaatgaatgaaatgaagcaa gaagggaagtttagagaaaaaagaataaaaagaaacgaacaaagcctccaagaaatatgg gactatgtgaaaagtctaaatctacgtctgattggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaatctagca aggcaggccaacattcagattcagagaaatacagagaacgccacaaagatactcctcgag aagagcaactccaagacacataattgtcagattcaccaaaaaatacaaactaccatcaaa gactactacaaacacctctacacaaataaactagaaaatctagaagaaatggataaattc ctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctgaatagacta ataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtccagga ccagatggattcacagccgaattctaccagaggtacaaggaggaactgcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacgtagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatcta gaaaaccccatcgtctcagcccaaaatctccttaagctgatacgcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ttaggaatccaacttacaagggacgtgaaggacctcttcaaggagaactacaaaccactg ctcaatgaaataaaagaggatacaaagaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcgatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccgcatcaccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacactacccgacttcaaactatactacaaggctacagtaaccaaaacagcatgg tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacg ccgcatatttacaactatctgacatctttgacaaacctgagaaaaacaagcaatggggaa aggattccctatttaataaatggtgctgggaaaactggctag >gi568815589r:121241205_121470187|GENSCAN_predicted_peptide_7|358_aa MWHIYTIEYYAAIKKDEFMSFAGTWMKLETIILSKLTQEQKTKHRMFSLEKIMNVKGKVI LSMLVVSTVIIVFWEFINSSGHYNKVPQGGLSNRNLILSVLEAGESKIPSEGSSWLADGH LLPVSTRDGEQHRRLFLVDISLKGPQDELRAGGYPREERLRESQSLPHNGKAALTQGTLR FITRKHPEVVTVTRWKAPVVWEGTFNKAILGNYYAKQKITVRLMVFAIGRYNDHYLEEFI TSANRYFMVGHKVIFYIMVDDVSKLPFIELGPLHSFKMFEVKPEKRWQDISMMRMKITGE HILAHIQHEVDFLFCMDVDQVNGKQLDTHMCGTLLAGVNASFSEIGDMKCWLSNLAVP >gi568815589r:121241205_121470187|GENSCAN_predicted_CDS_7|1077_bp atgtggcacatatacaccatagaatactatgcagccataaaaaaggatgagttcatgtcc tttgctgggacatggatgaagctggaaaccatcattctcagcaaactaacacaggaacag aaaaccaaacaccgcatgttctcactcgagaaaataatgaatgtcaaaggaaaagtaatt ctgtcaatgctggttgtctcaactgtgatcattgtgttttgggaatttatcaacagttca ggccactataacaaagtaccacaaggtggtttaagcaacaggaatttaattctttcagtt ctagaggctggggagtccaagatccccagtgagggctcttcctggctggcagatggccac ctcctccctgtatccacacgtgatggagagcaacacagaaggctctttcttgtggatata tcactcaaaggcccccaggatgagctgagggctgggggctacccaagggaggagagattg agggagtctcagtctctgccccacaatggcaaagctgccctgactcagggaactctacgt tttatcaccaggaaacacccagaggttgtgacagtgaccagatggaaggcgccggttgtg tgggaaggcactttcaacaaagccatcctaggaaattattatgccaaacagaaaattacc gtgcggttgatggtttttgctattggaagatataatgatcattacttggaggagttcata acatctgctaataggtacttcatggttggccacaaagtcatattttacatcatggtggat gatgtctccaagctgccgtttatagagctgggtcctctgcattccttcaaaatgtttgag gtcaagccagagaagaggtggcaagacatcagcatgatgcgtatgaagatcactggggag cacatcttggcccacatccaacacgaggtcgacttcctcttctgcatggatgtggaccag gtaaatggcaagcagctggacacccacatgtgtgggacattgcttgcaggagttaatgcc tctttctcagagataggggacatgaaatgctggttgtcaaacttggctgtaccctag