GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:52:41 Sequence gi568815578r:4756975_5032562 : 275588 bp : 45.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2337 2394 58 0 1 51 92 58 0.329 1.39 1.02 Intr + 4348 4450 103 0 1 67 63 60 0.569 1.15 1.03 Term + 6839 6921 83 0 2 100 49 70 0.752 2.16 1.04 PlyA + 7687 7692 6 1.05 2.16 PlyA - 11254 11249 6 1.05 2.15 Term - 15396 15288 109 2 1 73 38 66 0.102 -1.92 2.14 Intr - 19908 19778 131 0 2 53 63 87 0.170 2.19 2.13 Intr - 25556 25484 73 1 1 77 78 65 0.295 3.81 2.12 Intr - 29354 29257 98 2 2 126 108 86 0.992 13.21 2.11 Intr - 30780 30659 122 1 2 77 86 216 0.938 20.51 2.10 Intr - 31294 31243 52 1 1 59 103 83 0.795 5.38 2.09 Intr - 32723 32652 72 2 0 77 34 140 0.784 7.10 2.08 Intr - 33637 33477 161 2 2 80 116 215 0.999 23.21 2.07 Intr - 35653 35565 89 0 2 98 88 92 0.619 9.81 2.06 Intr - 38992 38841 152 1 2 82 84 200 0.999 17.96 2.05 Intr - 41111 41036 76 1 1 93 94 90 0.934 9.62 2.04 Intr - 44402 44230 173 2 2 67 52 2 0.190 -6.76 2.03 Intr - 45373 45252 122 2 2 43 40 109 0.304 1.81 2.02 Intr - 49005 48813 193 0 1 50 49 176 0.526 8.97 2.01 Init - 49773 49723 51 0 0 71 78 44 0.403 0.86 2.00 Prom - 50001 49962 40 -5.96 3.00 Prom + 52439 52478 40 -1.26 3.01 Init + 56558 56764 207 1 0 76 33 108 0.053 2.96 3.02 Intr + 64073 64195 123 1 0 32 94 105 0.087 6.28 3.03 Intr + 68990 69041 52 1 1 79 99 46 0.034 3.28 3.04 Intr + 78379 78502 124 2 1 11 78 83 0.049 -0.76 3.05 Intr + 82646 82769 124 2 1 74 93 29 0.461 2.59 3.06 Term + 83857 83877 21 0 0 94 55 32 0.423 -1.19 3.07 PlyA + 86261 86266 6 1.05 4.13 PlyA - 86619 86614 6 1.05 4.12 Term - 100230 99998 233 1 2 19 38 213 0.884 6.04 4.11 Intr - 102410 102315 96 0 0 118 98 85 0.995 12.48 4.10 Intr - 105111 104974 138 1 0 86 94 69 0.937 7.74 4.09 Intr - 105933 105804 130 0 1 111 91 139 0.996 16.77 4.08 Intr - 110901 110796 106 2 1 80 81 53 0.978 4.02 4.07 Intr - 113079 112932 148 1 1 88 98 256 0.997 25.89 4.06 Intr - 117118 116962 157 1 1 66 103 142 0.998 13.08 4.05 Intr - 117722 117602 121 1 1 138 76 -5 0.988 3.80 4.04 Intr - 126849 126668 182 0 2 118 87 138 0.717 15.37 4.03 Intr - 127849 127779 71 2 2 49 83 82 0.897 2.70 4.02 Intr - 128935 128847 89 0 2 92 91 51 0.939 5.41 4.01 Init - 129211 129150 62 1 2 70 35 82 0.611 0.14 4.00 Prom - 132858 132819 40 -2.66 5.06 PlyA - 133347 133342 6 1.05 5.05 Term - 140152 139973 180 1 0 67 55 178 0.972 10.01 5.04 Intr - 142222 142153 70 0 1 101 84 1 0.923 0.18 5.03 Intr - 142738 142581 158 1 2 114 94 140 0.998 16.01 5.02 Intr - 145584 145468 117 0 0 93 110 54 0.994 8.76 5.01 Init - 156049 155906 144 1 0 53 75 196 0.585 14.92 5.00 Prom - 157071 157032 40 -5.96 6.00 Prom + 157198 157237 40 -3.46 6.01 Init + 172339 172417 79 0 1 68 81 124 0.952 8.92 6.02 Term + 176857 176975 119 2 2 73 41 85 0.808 1.00 6.03 PlyA + 177015 177020 6 1.05 7.00 Prom + 180201 180240 40 -2.46 7.01 Init + 181641 181690 50 2 2 118 115 -4 0.904 5.72 7.02 Term + 192589 192982 394 1 1 55 53 134 0.458 1.11 7.03 PlyA + 194214 194219 6 1.05 8.06 PlyA - 197265 197260 6 1.05 8.05 Term - 212826 212726 101 1 2 37 47 86 0.026 -2.31 8.04 Intr - 213945 213819 127 0 1 90 82 47 0.031 4.55 8.03 Intr - 244524 244367 158 1 2 82 32 90 0.037 2.53 8.02 Intr - 256763 256593 171 0 0 30 48 117 0.009 1.81 8.01 Init - 272820 272763 58 0 1 86 58 98 0.911 6.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 96026 95956 71 0 2 83 48 99 0.840 3.50 S.002 Init + 216727 216789 63 0 0 82 71 74 0.840 6.27 S.003 Term - 271284 271190 95 1 2 85 49 62 0.879 -0.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:4756975_5032562|GENSCAN_predicted_peptide_1|81_aa XRWRFPESISCSSVERYRCWERAMPEPTEVGSGAALEVRACNAHVNAKDGSFSASTLELT AMCSEENEANQPKANPKHPMA >gi568815578r:4756975_5032562|GENSCAN_predicted_CDS_1|246_bp nngaggtggcgctttccagaaagcatcagctgtagtagcgtggagaggtaccggtgttgg gagagggcaatgcctgagcccacggaagttgggtcaggggcagcattggaagtcagagcc tgcaatgctcacgtaaatgcaaaagatggttccttcagtgccagtaccttggagttgaca gccatgtgctcagaggagaatgaagcaaaccaaccaaaagccaaccccaaacacccaatg gcctga >gi568815578r:4756975_5032562|GENSCAN_predicted_peptide_2|557_aa MDRLQWLTVIPALWEAEACCTSGYLHWKFPWVCLKLSPGPHFSNCEGGTSAYVAVLASSL CVTRTLPVTPRFIRQHTLSPKVLVIITGVVLLASSEQNPEMLLNTCNAQDSPTTKNSPRP GARGGLISHPWNQCGCQCALWLGHLCLDCPRVVVFPLDVCSQRLWQAEAAASSVAPHSLG NELLLHLKTYNLYYEGQNLQLRHREEEDEFIVEGLLNISWGLRRPIRLQMQDDNERIRPP PSSSSWHSGCNLGAQGTTLKPLTVPKVQISEVDAPPEGDQMPSSTDSRGLKPLQEDTPQL MRTRSDVGVRRRGNVRTPSDQRRIRRHRFSINGHFYNHKTSVFTPAYGSVTNVRINSTMT TPQIENSAEEFALYVVHTSGEKQKLKATDYPLIARILQGPCEQISKVFLMEKDQVEEVTY DVAQYIKFEMPVLKSFIQKLQEEEDREVKKLMRKDSSTSPVGHGVEAQKGFCKLHRTGSL SPSKLPLMPPTLTCRAIYGVNSTESGVSYNILPWDNPGVLRRKPITETMSLAKEEGFHQV LQLRRRKISLKSISLSN >gi568815578r:4756975_5032562|GENSCAN_predicted_CDS_2|1674_bp atggaccggctgcagtggctcactgtaatcccagcactttgggaggctgaggcctgctgt acttctggttatctgcactggaagtttccatgggtctgcctgaaactcagtcctggtcct cacttcagcaactgtgaaggtggcacctctgcctacgttgcggtcctcgccagcagcctg tgcgtcaccaggactctgcctgtcaccccacgtttcatccggcagcacaccctttcccca aaggttttggtcatcatcactggggtagtgctgctggcatctagtgagcagaatccagag atgctgctaaacacctgcaatgcacaagacagtcccacgacaaagaacagtccaaggcca ggggcgagaggaggactcatcagtcatccttggaaccaatgtggctgtcagtgtgctctg tggctggggcatttgtgcttggattgtccaagggttgtcgtctttcctttggatgtttgt tctcagaggctgtggcaagccgaggcagcagccagcagtgttgcccctcacagccttgga aatgaacttctcttgcatctgaagacctacaacttgtactatgaaggccagaatttacag ctccggcaccgggaggaagaagacgagttcattgtggaggggctcctgaacatctcctgg ggcctgcgccggcccattcgcctgcagatgcaggatgacaacgaacgcattcgaccccct ccatcctcctcctcctggcactctggctgtaacctgggggctcagggaaccactctgaag cccctgactgtgcccaaagttcagatctcagaggtggatgccccgccggagggtgaccag atgccaagctccacagactccaggggcctgaagcccctgcaggaggacaccccacagctg atgcgcacacgcagtgatgttggggtgcgtcgccgtggcaatgtgaggacgcctagtgac cagcggcgaatcagacgccaccgcttctccatcaacggccatttctacaaccataagaca tccgtgttcacaccagcctatggctctgtcaccaacgtccgcatcaacagcaccatgacc accccacagattgagaattcagcagaggagtttgccttgtacgtggtccatacgagtggt gagaaacagaagctgaaggccaccgattacccgctgattgcccgaatcctccagggccca tgtgagcagatctccaaagtgttcctaatggagaaggaccaggtggaggaagtcacctac gacgtggcccagtatataaagttcgagatgccggtacttaaaagcttcattcagaagctc caggaggaagaagatcgggaagtaaagaagctgatgcgcaaggacagcagcaccagccct gtggggcatggagtggaagcccagaagggcttctgcaagctgcacagaactgggtcacta agcccctctaagctgcccttgatgccacctaccctcacctgcagagccatctatggggtc aacagcaccgagagtggggtatcttacaacatccttccctgggacaacccaggggttctg agaagaaaaccaatcactgagacaatgagccttgccaaggaagaaggctttcatcaggtg ctgcagctaagaagacggaagatcagtctcaaatccatctccctgagcaactaa >gi568815578r:4756975_5032562|GENSCAN_predicted_peptide_3|216_aa MASQTGGAATCLQGSPTGIHKCLTFRMDVAEGNSERLATNQRGRALERDEKGSIKEAQTP EKSSCVCVGLTRSAIPKQLTYGTQTRSLAPGHQEIDQEVKAKPIATGMSQASTTRGQTKR LMAQEATGLKGEMGKDILGRGSSKFKAMRSLARPENSVWSSVGGTEGGSCRGMQVGQGDR EQESECTVRQGRGTPPTPRHHTVAPCSMGLDIPAVS >gi568815578r:4756975_5032562|GENSCAN_predicted_CDS_3|651_bp atggcctcgcagaccggaggagcagccacatgtttacagggaagccccactggcatccac aagtgcctgacctttaggatggatgttgcagaaggaaattcagagcgcttagccactaat cagagaggcagggcgctggagcgggacgaaaaaggaagcataaaggaagcacagactcca gaaaaaagcagctgtgtgtgcgtgggtctgactcgctcagccattcctaagcagctcacg tatggaacgcagacacgaagcctggctccggggcaccaggaaatagaccaagaagtcaaa gcaaaaccaatagcaactgggatgtctcaggcttctaccaccagaggccagaccaagagg ctcatggcccaggaagcaacaggtcttaagggtgagatggggaaggacattctaggcaga ggcagcagcaagttcaaagccatgaggagcctggctcgaccagagaacagcgtgtggtcc agcgtgggtggaacagaaggtgggagctgcaggggaatgcaagtgggccagggggataga gaacaggaatctgaatgcacagtcagacagggccgtggaacacccccaacccctaggcac catactgtggccccatgctctatggggctggatatacctgctgtcagctga >gi568815578r:4756975_5032562|GENSCAN_predicted_peptide_4|510_aa MAGAGVGVLALTTGPTLELPGLPLFQASAFAFLAPARAILSLDKWKCNTTDVSVANGTAE LLHTEHIWYPRIREIQGAIIMSSLIEVVIGLLGLPGALLKYIGPLTITPTVALIGLSGFQ AAGERAGKHWGIAMLTIFLVLLFSQYARNVKFPLPIYKSKKGWTAYKLQLFKMFPIILAI LVSWLLCFIFTVTDVFPPDSTKYGFYARTDARQGVLLVAPWFKVPYPFQWGLPTVSAAGV IGMLSAVVASIIESIGDYYACARLSCAPPPPIHAINRGIFVEGLSCVLDGIFGTGNGSTS SSPNIGVLGITKVGSRRVIQCGAALMLALGMIGKFSALFASLPDPVLGALFCTLFGMITA VGLSNLQFIDLNSSRNLFVLGFSIFFGLVLPSYLRQNPLVTGITGIDQVLNVLLTTAMFV GGCVAFILDNTIPGTPEERGIRKWKKGVGKGNKSLDGMESYNLPFGMNIIKKYRCFSYLP ISPTFVGYTWKGLRKSDNSRSSDEDSQATG >gi568815578r:4756975_5032562|GENSCAN_predicted_CDS_4|1533_bp atggcaggggccggtgttggggtgttggcactgaccacggggcccaccctggagctgcct gggttacccctgtttcaggccagtgcttttgcatttttggcccctgctcgagccatcctg tctttagataaatggaaatgtaacaccacagatgtttcagttgccaatggaacagcagag ctgttgcacacagaacacatctggtatccccggatccgagagatccagggggccatcatc atgtcctcactgatagaagtagtcatcggcctcctcggcctgcctggggctctactgaag tacatcggtcccttgaccattacacccacggtggccctaattggcctctctggtttccag gcagcgggggagagagccgggaagcactggggcattgccatgctgacaatattcctagta ttactgttttctcaatacgccagaaatgttaaatttcctctcccgatttataaatccaag aaaggatggactgcgtacaagttacagctgttcaaaatgttccctatcatcctggccatc ctggtatcctggctgctctgcttcatcttcacggtgacagatgtcttccctcccgacagc acaaagtatggcttctatgctcgcacagatgccaggcaaggcgtgcttctggtagccccg tggtttaaggttccatacccatttcagtggggactgcccaccgtgtctgcggccggtgtc atcggcatgctcagtgccgtggtcgccagcatcatcgagtctattggtgactactacgcc tgtgcacggctgtcctgtgccccacccccccccatccacgcaataaacaggggaattttc gtggaaggcctctcctgtgttcttgatggcatatttggtactgggaatggctctacttca tccagtcccaacattggagttttgggaattacaaaggtcggcagccgccgcgtgatacag tgcggagcagccctcatgctcgctctgggcatgatcgggaagttcagcgccctctttgcg tcccttccggatcctgtgctgggagccctgttctgcacgctctttggaatgatcacagct gttggcctctctaacctgcagttcattgatttaaattcttcccggaacctctttgtgctt ggattttcgatcttctttgggctcgtccttccaagttacctcagacagaaccctctggtc acagggataacaggaatcgatcaagtgttgaacgtccttctcacaactgctatgtttgta gggggctgtgtggcttttatcctggataacaccatcccaggcactccagaggaaagagga atccggaaatggaagaagggtgtgggcaaagggaacaaatcactcgacggcatggagtcg tacaatttgccatttggcatgaacattataaaaaaatacagatgcttcagctacttaccc atcagcccaacctttgtgggctacacatggaaaggcctcaggaagagcgacaacagccgg agttcagatgaagactcccaggccacgggatag >gi568815578r:4756975_5032562|GENSCAN_predicted_peptide_5|222_aa MWFTLKQPLSGLSLQVVINGGATSSGEQDNEDTELMAIYTTENGIAEKSSLAETLDSTGS LDPQRSDMIYTIEDVPPWYLCIFLGLQHYLTCFSGTIAVPFLLADAMCVGYDQWATSQLI GTIFFCVGITTLLQTTFGCRALLLDSLCAPHPVPSFYFHTLLRVRGLAIVGRNGDIEAHG LQHEGQRHQGDTVHPPGDLVGALEEVVFETGPAGSKGSAQEG >gi568815578r:4756975_5032562|GENSCAN_predicted_CDS_5|669_bp atgtggttcacgctgaaacagcctctaagtggattgtctctgcaggtggtgataaatgga ggcgccacctccagcggtgagcaggacaatgaggacactgagctcatggcgatctacact acggaaaacggcattgcagaaaagagctctctcgctgagaccctggatagcactggcagt ctggacccccagcgatcagacatgatttataccatagaagatgttcctccctggtacctg tgtatatttctggggctacagcactacctgacatgcttcagcggcacgatcgcagtgccc ttcctgttggccgatgccatgtgtgtggggtacgaccagtgggccaccagccagctcatt gggaccattttcttctgtgtgggaatcactactttgctacagacaacgtttggatgcagg gcactgctcttggactcactttgtgccccacacccagtgccatctttctacttccacacc ctcctcagggtgcggggcctggccattgtggggcgcaatggtgacatcgaagcacatggc ttgcagcacgaagggcagcgccatcagggagacacagtgcatcctccaggggacttggtg ggagccctggaggaggtggtgtttgagacaggccctgcaggctccaagggctctgcccag gagggctga >gi568815578r:4756975_5032562|GENSCAN_predicted_peptide_6|65_aa MGLTLGPAGLSLEALSQAASAPAGEQGIKNPTKLPCKCEDRTAISLEIQVQNLLFEELFM GHFIQ >gi568815578r:4756975_5032562|GENSCAN_predicted_CDS_6|198_bp atggggctgacactgggacctgctgggctgagcctggaagccctaagccaggcagcctcg gctccagcaggggaacaaggaataaagaatccaaccaagctgccctgcaaatgtgaggac aggactgcaataagtttggaaatacaagtacagaacttgctgtttgaggaactcttcatg ggacacttcattcaataa >gi568815578r:4756975_5032562|GENSCAN_predicted_peptide_7|147_aa MPGPSFHLDGSKHLSKCTKKPFGDFLVMQLCKTENFSPTYTGTVTHTQARTHTETDIYVE NQKHHQRYKHSDTHMETDTRRPKCTQTQTDSNQSVYPHNSFPKAEPPNGLTFKVKLHPST EMSSNQCVTHHFCAGVCATLRLSWYSA >gi568815578r:4756975_5032562|GENSCAN_predicted_CDS_7|444_bp atgccgggcccctcattccatcttgatggctctaaacacttgtccaaatgcactaagaaa ccctttggagattttctagtaatgcagctgtgcaaaactgagaacttctcacccacctac acaggcacagtcacacacacacaggcgcgcacacacacagaaacagacatatatgtagaa aatcagaaacatcaccagagatacaaacactcagatactcacatggaaacagatacacgc agacccaaatgcacacagacacaaacagatagtaatcaatctgtatatcctcacaactca ttccccaaagcagaaccgccaaatggccttaccttcaaagtgaaactccacccttccaca gaaatgtcctctaatcagtgcgtcacccaccacttctgtgccggtgtctgtgcaaccctg cgattgtcctggtactcagcatga >gi568815578r:4756975_5032562|GENSCAN_predicted_peptide_8|204_aa MGFLHAGLAGLAGLELLTSGVASTCRIMLVSLWTYMKCSTSNCIISIKDYTSIRINVAEV DKVTSRFNGQFKTYAFGGCRGDRRRRLQRALRRRGAAAGREPRELQPGESPAAAPPARPA GRTCRRGRELGLDSTNGDFSYGFELGPWCISKCNSANCNGKHMDRSTPHHLEKPQVFHIC RYILSDTLLLDVVVHAYSPSYSGD >gi568815578r:4756975_5032562|GENSCAN_predicted_CDS_8|615_bp atggggtttctccatgctggtctggctggtctggctggtctcgaactcctgacctcaggt gtggcctcaacatgcagaataatgctagtgagtttgtggacgtatatgaaatgctccacc agcaactgcatcatcagcatcaaagactacacctccatccggataaatgtggctgaggtt gataaagtcacgagcaggtttaatggccagtttaaaacctatgctttcggcggctgtcgc ggggaccggaggcggaggctgcagcgggcactgcggcgccgaggcgcggcggcaggacgg gaaccacgcgagctgcagccaggtgagagcccggccgccgcgccccctgcccggcctgcg ggccgcacctgccggcggggccgcgagctaggtcttgactccactaacggggacttctca tacggctttgaacttggaccttggtgcataagcaaatgtaactcagcgaattgtaatgga aagcatatggacaggtctacgccacatcacttagaaaagccgcaagtgtttcacatctgc cgatacattctctcagatactctgcttctggatgtggtggtgcacgcctatagtcccagc tactctggagactga