GENSCAN 1.0 Date run: 1-Aug-119 Time: 17:04:23 Sequence gi568815576f:38884071_39091515 : 207445 bp : 49.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 162 53 110 1 2 46 100 76 0.087 4.60 1.01 Init - 23089 23080 10 1 1 107 81 2 0.451 2.17 1.00 Prom - 25302 25263 40 -2.06 2.00 Prom + 32405 32444 40 -0.66 2.01 Init + 34029 34082 54 2 0 82 113 58 0.783 6.99 2.02 Intr + 38527 38651 125 0 2 51 100 23 0.207 -0.82 2.03 Intr + 52667 52771 105 2 0 103 75 36 0.000 3.13 2.04 Term + 62778 63198 421 0 1 31 47 201 0.000 5.06 2.05 PlyA + 65107 65112 6 1.05 3.00 Prom + 67171 67210 40 -3.76 3.01 Init + 68739 68780 42 2 0 83 110 51 0.820 7.22 3.02 Intr + 73063 73227 165 0 0 65 37 90 0.461 1.76 3.03 Intr + 73293 73354 62 2 2 98 29 55 0.450 -1.87 3.04 Intr + 75526 75675 150 1 0 -25 43 255 0.000 8.98 3.05 Intr + 76159 76183 25 1 1 65 95 -29 0.286 -6.47 3.06 Intr + 77317 77611 295 0 1 123 95 247 0.597 25.48 3.07 Intr + 78028 78143 116 2 2 127 94 63 0.997 10.97 3.08 Intr + 78358 78583 226 0 1 119 15 55 0.544 -1.14 3.09 Intr + 82197 82249 53 1 2 94 111 33 0.716 4.83 3.10 Intr + 87440 87535 96 1 0 47 115 46 0.360 3.41 3.11 Intr + 89747 89850 104 1 2 119 53 57 0.755 4.27 3.12 Intr + 100005 100161 157 0 1 97 81 160 0.971 16.21 3.13 Intr + 101742 102021 280 2 1 109 100 207 0.669 21.05 3.14 Intr + 102228 102342 115 1 1 92 94 79 0.999 8.41 3.15 Intr + 105387 105540 154 0 1 101 45 187 0.995 15.77 3.16 Intr + 107262 107556 295 2 1 123 95 294 0.603 30.18 3.17 Intr + 107964 108079 116 1 2 120 94 80 0.997 11.97 3.18 Intr + 113153 113294 142 1 1 86 70 51 0.058 3.03 3.19 Intr + 127008 127101 94 1 1 68 53 73 0.066 0.82 3.20 Intr + 131525 131681 157 2 1 92 69 146 0.977 13.11 3.21 Intr + 133696 133975 280 0 1 116 100 237 0.998 24.75 3.22 Intr + 134199 134313 115 1 1 120 53 29 0.557 2.11 3.23 Intr + 138752 138944 193 2 1 97 99 169 0.892 18.39 3.24 Intr + 141000 141279 280 2 1 115 100 238 0.986 24.75 3.25 Intr + 141487 141601 115 2 1 80 94 83 0.992 7.61 3.26 Intr + 145293 145449 157 0 1 91 69 125 0.993 10.91 3.27 Intr + 147624 147903 280 2 1 148 100 212 0.999 25.45 3.28 Intr + 148128 148242 115 1 1 131 53 31 0.533 3.41 3.29 Intr + 158867 159020 154 2 1 90 81 132 0.998 12.77 3.30 Intr + 160871 161150 280 1 1 109 100 220 0.658 22.35 3.31 Intr + 161358 161472 115 1 1 108 94 83 0.979 10.41 3.32 Intr + 165355 165511 157 1 1 78 69 102 0.926 7.31 3.33 Intr + 168004 168283 280 0 1 144 100 242 0.999 28.05 3.34 Term + 168507 168625 119 1 2 125 55 46 0.872 3.70 3.35 PlyA + 169547 169552 6 1.05 4.00 Prom + 170742 170781 40 -4.06 4.01 Init + 182539 182601 63 0 0 89 72 62 0.594 5.85 4.02 Intr + 186323 186409 87 1 0 137 41 46 0.091 4.97 4.03 Intr + 194916 195015 100 2 1 28 81 119 0.006 4.88 4.04 Intr + 196834 197157 324 2 0 120 93 245 0.025 23.95 4.05 Intr + 197401 197515 115 2 1 102 94 7 0.993 2.21 4.06 Intr + 199661 199814 154 2 1 80 60 96 0.899 6.07 4.07 Intr + 202209 202497 289 2 1 94 69 209 0.989 16.42 4.08 Intr + 202941 203056 116 1 2 120 94 68 0.996 10.77 4.09 Term + 203337 203351 15 2 0 99 49 3 0.807 -4.26 4.10 PlyA + 203402 203407 6 1.05 5.03 PlyA - 203545 203540 6 1.05 5.02 Term - 203761 203696 66 1 0 87 43 71 0.857 0.44 5.01 Intr - 203950 203861 90 1 0 118 65 55 0.404 6.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 62782 63198 417 0 0 49 47 209 0.865 9.21 S.002 Intr + 194862 195015 154 0 1 91 81 140 0.990 13.67 S.003 Intr + 196863 197157 295 2 1 122 93 238 0.963 24.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:38884071_39091515|GENSCAN_predicted_peptide_1|40_aa MGEGTGTSQPISYQQAFDIVNPLKGPPQDPREGDEEEKAK >gi568815576f:38884071_39091515|GENSCAN_predicted_CDS_1|120_bp atgggggaaggaacaggaacctcacagccgatctcctatcagcaagcttttgatattgtc aatccccttaaggggccccctcaagaccccagggaaggggatgaggaggaaaaggcaaag >gi568815576f:38884071_39091515|GENSCAN_predicted_peptide_2|234_aa MGWVWWLMAVIPALLEAKASWLVQEWMPDAGCTNQILSSGDLESRTSGYHRLALAVRPGR KLSLASGAAYICRQRTLLPHGWACRFCGLVTLTCTEMRALGTTGTEPPGPGESKEGFLEE EWQSGPLLLQQECPQLLLGKGLCWLGLPLGQSGWLLPLPSEGSSGESCHARTPFPIIPNP LHWFLPADVKIKSWPDFPLSFRFYPCALRPPRSPLRLLICDLEEIARVLGGMPA >gi568815576f:38884071_39091515|GENSCAN_predicted_CDS_2|705_bp atgggctgggtgtggtggctcatggctgtaatcccagcccttttggaggccaaggcttcc tggttggtccaggagtggatgcctgatgcaggctgcacaaatcagattctctcttctgga gatttagaatcgagaaccagtggttatcataggttagccttggctgtcagacctggaagg aagctctctttggcctctggagctgcgtatatctgtagacagcggaccctgctgccacat ggctgggcctgccgcttctgtggcttggtcacactcacatgcacggaaatgagagcactg gggactactggaacagaaccgcctggcccgggagaatccaaggaaggcttcctggaggag gaatggcagagcggacctctgctactgcagcaggaatgcccccagctgctgctggggaag ggcttatgttggctgggtttgcctttggggcagagtggctggctgctccccctgcccagc gagggtagttctggggagagttgccacgctcggaccccatttcccatcatccccaaccct ttgcattggttcttgccagcagatgtgaagataaaaagctggccagacttcccactctcc ttccgcttctatccctgtgcgctgaggcctccgaggagccctctgaggcttctgatttgc gacctggaggaaattgcccgtgtgctcggaggaatgcctgcatga >gi568815576f:38884071_39091515|GENSCAN_predicted_peptide_3|1827_aa MESSLDKRHTVERQPGKKMQDPNREHRMQQVPGSLDQAHPALATGEDTGVAVLEPALWTA ECYSGSFSRGAKCGQLYIPIVTPQCLVVQWHKTYLCYEVERLDNGTSVKMDQHRGFLHNQ VTDPAIRIRAGPFQSRDIHSTASVWRDQAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIY RVTWFISWSPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYDPLYKEALQMLRDAGAQV SIMTYDEFKHCWDTFVDHQGCPFQPWDGLDEHSQALSGRLRAILQATSLCPLSTLSPPAP FNPPALPESGKLKDGPQSLRKAETWVEQQNKRSSSKKCKQTVHHHLQLLTDASKAVCSRS KSSVRAVVATGDPGTFAPLELPHRMASGFQNTKSGQDSAYITLANALLAKGPVKVRLQDP QATLEETLKLHFVTQESVPCRAWARNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKI KRGRSNLLWDTGVFRGQVYFKPQYHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCV AKLAEFLSEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYCWENFV YNEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVE RLDNGTWVLMDQHMGFLCNEAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIYRVTWFISW SPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYDPLYKEALQMLRDAGAQVSIMTYDEF EYCWDTFVYRQGCPFQPWDGLEEHSQALSGRLRAILQAQTKHIAQRSCMEATKMETQGLG PSGRDLHTIIAPPTMPRVPNTSPPENFTFKRSASARPKYRLGSEERLGPAEVPSGKNPMK AMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVDSETHCHAERC FLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIFTARLYYFQYPC YQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ NPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGPVLPKRQSN HRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKFLAEHPNV TLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYEDFAYCWENFVCNEGQPFMPWYKF DDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLCFTMEVTKHHSAVFR KRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSWSPCPECAGEVAEFLAR HSNVNLTIFTARLCYFWDTDYQEGLCSLSQEGASVKIMGYKDFVSCWKNFVYSDDEPFKP WKGLQTNFRLLKRRLREILQNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTKGPSR PRLDAKIFRGQVYSQPEHHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEF LAEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVKIMDDEEFAYCWENFVYSEGQP FMPWYKFDDNYAFLHRTLKEILRNPMEAMYPHIFYFHFKNLRKAYGRNESWLCFTMEVVK HHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSWSPCPECAGE VAEFLARHSNVNLTIFTARLYYFWDTDYQEGLRSLSQEGASVEIMGYKDFKYCWENFVYN DDEPFKPWKGLKYNFLFLDSKLQEILE >gi568815576f:38884071_39091515|GENSCAN_predicted_CDS_3|5484_bp atggaatcttccctggacaagcgacataccgtggagagacagccagggaagaagatgcag gaccctaacagagagcacaggatgcagcaggtgccagggagcctggaccaggcacatcct gcactggccacaggggaggacacaggggtggctgtcctggagcctgctctctggaccgct gagtgttattcagggtctttctccaggggagcaaagtgtggccagctctacatccccatt gtcactccacagtgtctggtggttcagtggcataagacctacctgtgctacgaagtggag cgcctggacaatggcacctcggtcaagatggaccagcacaggggctttctacacaaccag gtgaccgacccagccatccgaatccgggcagggcccttccaatccagggacattcatagc acagcctctgtctggagagaccaggctaagaatcttctctgtggcttttacggccgccat gcggagctgcgcttcttggacctggttccttctttgcagttggacccggcccagatctac agggtcacttggttcatctcctggagcccctgcttctcctggggctgtgccggggaagtg cgtgcgttccttcaggagaacacacacgtgagactgcgtatcttcgctgcccgcatctat gattacgaccccctatataaggaggcactgcaaatgctgcgggatgctggggcccaagtc tccatcatgacctacgatgaatttaagcactgctgggacacctttgtggaccaccaggga tgtcccttccagccctgggatggactagatgagcacagccaagccctgagtgggaggctg cgggccattctccaggccacctccctgtgccctctttccactctctcacctcctgctcca ttcaacccccctgctcttccagaatcagggaaactgaaggatgggcctcagtctctaagg aaggcagagacctgggttgagcagcagaataaaagatcttcttccaagaaatgcaaacag accgttcaccaccatctccagctgctcacagacgccagcaaagcagtatgctcccgatca aaaagctcagtgagggctgtcgtggccactggcgacccaggcacatttgctccgcttgag ctccctcaccgaatggcatctgggttccaaaataccaagtctggtcaagactctgcctac atcacacttgctaatgccctgttggccaagggtcccgtgaaagtcaggttgcaggacccc caggccacactggaggagaccctgaagctccactttgtcacccaggagtccgtgccttgc agagcctgggccagaaatccgatggagcggatgtatcgagacacattctacgacaacttt gaaaacgaacccatcctctatggtcggagctacacttggctgtgctatgaagtgaaaata aagaggggccgctcaaatctcctttgggacacaggggtctttcgaggccaggtgtatttc aagcctcagtaccacgcagaaatgtgcttcctctcttggttctgtggcaaccagctgcct gcttacaagtgtttccagatcacctggtttgtatcctggaccccctgcccggactgtgtg gcgaagctggccgaattcctgtctgagcaccccaatgtcaccctgaccatctctgccgcc cgcctctactactactgggaaagagattaccgaagggcgctctgcaggctgagtcaggca ggagcccgcgtgacgatcatggactatgaagaatttgcatactgctgggaaaactttgtg tacaatgaaggtcagcaattcatgccttggtacaaattcgatgaaaattatgcattcctg caccgcacgctaaaggagattctcagatacctgatggatccagacacattcactttcaac tttaataatgaccctttggtccttcgacggcgccagacctacttgtgctatgaggtggag cgcctggacaatggcacctgggtcctgatggaccagcacatgggctttctatgcaacgag gctaagaatcttctctgtggcttttacggccgccatgcggagctgcgcttcttggacctg gttccttctttgcagttggacccggcccagatctacagggtcacttggttcatctcctgg agcccctgcttctcctggggctgtgccggggaagtgcgtgcgttccttcaggagaacaca cacgtgagactgcgcatcttcgctgcccgcatctatgattacgaccccctatataaggag gcgctgcaaatgctgcgggatgctggggcccaagtctccatcatgacctacgatgagttt gagtactgctgggacacctttgtgtaccgccagggatgtcccttccagccctgggatgga ctagaggagcacagccaagccctgagtgggaggctgcgggccattctccaggctcagacc aaacacattgcacaacgcagttgcatggaagccacaaagatggaaacccagggcctgggg ccctctgggagggacctacacacgatcatagcccccccgacaatgcctcgtgtgcccaac accagcccaccagaaaatttcaccttcaagaggagcgcctcggcccggccgaagtaccgt ctgggaagtgaggagcgcctcggcccggccgaagtaccgtctgggaaaaacccgatgaag gcaatgtatccaggcacattctacttccaatttaaaaacctatgggaagccaacgatcgg aacgaaacttggctgtgcttcaccgtggaaggtataaagcgccgctcagttgtctcctgg aagacgggcgtcttccgaaaccaggtggattctgagacccattgtcatgcagaaaggtgc ttcctctcttggttctgcgacgacatactgtctcctaacacaaagtaccaggtcacctgg tacacatcttggagcccttgcccagactgtgcaggggaggtggccgagttcctggccagg cacagcaacgtgaatctcaccatcttcaccgcccgcctctactacttccagtatccatgt taccaggaggggctccgcagcctgagtcaggaaggggtcgctgtggagatcatggactat gaagattttaaatattgttgggaaaactttgtgtacaatgataatgagccattcaagcct tggaagggattaaaaaccaactttcgacttctgaaaagaaggctacgggagagtctccaa aatccgatggagcggatgtatcgagacacattctacgacaactttgaaaacgaacccatc ctctatggtcggagctacacttggctgtgctatgaagtgaaaataaagaggggccgctca aatctcctttgggacacaggggtctttcgaggcccggtactacccaaacgtcagtcgaat cacaggcaggaggtgtatttccggtttgagaaccacgcagaaatgtgcttcttatcttgg ttctgtggcaaccgactgcctgctaacaggcgcttccagatcacctggtttgtatcatgg aacccctgcctgccctgtgtggtgaaggtgaccaaattcttggctgagcaccccaatgtc accctgaccatctctgccgcccgcctctactactaccgggatagagattggcggtgggtg ctcctcaggctgcataaggcaggggcccgtgtgaagatcatggactatgaagactttgca tactgctgggaaaactttgtgtgcaatgaaggtcagccattcatgccttggtacaaattc gatgacaattatgcatccctgcaccgcacgctaaaggagattctcagaaacccgatggag gcaatgtacccacacatattctacttccactttaaaaacctactgaaagcctgtggtcgg aacgaaagctggctgtgcttcaccatggaagttacaaagcaccactcagctgtcttccgg aagaggggcgtcttccgaaaccaggtggatcctgagacccattgtcatgcagaaaggtgc ttcctctcttggttctgtgacgacatactgtctcctaacacaaactacgaggtcacctgg tacacatcttggagcccttgcccagagtgtgcaggggaggtggccgagttcctggccagg cacagcaacgtgaatctcaccatcttcaccgcccgcctctgctacttctgggatacagat taccaggaggggctctgcagcctgagtcaggaaggggcctccgtgaagatcatgggctac aaagattttgtatcttgttggaaaaactttgtgtacagtgatgatgagccattcaagcct tggaagggactacaaaccaactttcgacttctgaaaagaaggctacgggagattctccaa aacacagtggagcgaatgtatcgagacacattctcctacaacttttataatagacccatc ctttctcgtcggaataccgtctggctgtgctacgaagtgaaaacaaagggtccctcaagg ccccgtttggacgcaaagatctttcgaggccaggtgtattcccagcctgagcaccacgca gaaatgtgcttcctctcttggttctgtggcaaccagctgcctgcttacaagtgtttccag atcacctggtttgtatcctggaccccctgcccggactgtgtggcgaagctggccgaattc ctggctgagcaccccaatgtcaccctgaccatctccgccgcccgcctctactactactgg gaaagagattaccgaagggcgctctgcaggctgagtcaggcaggggcccgcgtgaagatt atggacgatgaagaatttgcatactgctgggaaaactttgtgtacagtgaaggtcagcca ttcatgccttggtacaaattcgatgacaattatgcattcctgcaccgcacgctaaaggag attctcagaaacccgatggaggcaatgtatccacacatattctacttccactttaaaaac ctacgcaaagcctatggtcggaacgaaagctggctgtgcttcaccatggaagttgtaaag caccactcacctgtctcctggaagaggggcgtcttccgaaaccaggtggatcctgagacc cattgtcatgcagaaaggtgcttcctctcttggttctgtgacgacatactgtctcctaac acaaactacgaggtcacctggtacacatcttggagcccttgcccagagtgtgcaggggag gtggccgagttcctggccaggcacagcaacgtgaatctcaccatcttcaccgcccgcctc tactacttctgggatacagattaccaggaggggctccgcagcctgagtcaggaaggggcc tccgtggagatcatgggctacaaagattttaaatattgttgggaaaactttgtgtacaat gatgatgagccattcaagccttggaaaggactaaaatacaactttctattcctggacagc aagctgcaggagattctcgagtga >gi568815576f:38884071_39091515|GENSCAN_predicted_peptide_4|420_aa MGEIKLKKDFGVKGDKPSVIRVNPETHSHAERCFLSWFCDNTLSPNKNYQTHPFSSEYRL AVLRSENKGSLKAPFGRKDLSRPELALTLLLSQVYSELKYHPEMRFFHWFSKWRKLHRDQ EYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIFVARLYYFWDPDYQEALRSLCQKRDG PRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTF NFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLD VIPFWKLDLDQDYRVTCFTSWSPCFSCAQEMAKFISKNKHVSLCIFTARIYDDQGRCQEG LRTLAEAGAKISIMTYSEFKHCWDTFVDHQGCPFQPWDGLDEHSQDLSGRLRAILQNQEN >gi568815576f:38884071_39091515|GENSCAN_predicted_CDS_4|1263_bp atgggagagattaaactgaagaaagattttggggtaaagggtgataaaccaagtgtgatc agggtcaatcctgagacccatagtcatgcagaaaggtgcttcctctcttggttctgcgac aacacactgtctcctaacaaaaactaccagacccatcctttctcgtcggaataccgtctg gctgtgctacgaagtgaaaacaaagggtccctcaaggccccctttggacgcaaagatctt tcgaggccagagcttgccctgaccctgctcctctcccaggtgtattccgaacttaagtac cacccagagatgagattcttccactggttcagcaagtggaggaagctgcatcgtgaccag gagtatgaggtcacctggtacatatcctggagcccctgcacaaagtgtacaagggatatg gccacgttcctggccgaggacccgaaggttaccctgaccatctttgttgcccgcctctac tacttctgggacccagattaccaggaggcgcttcgcagcctgtgtcagaaaagagacggt ccgcgtgccaccatgaagatcatgaattatgacgaatttcagcactgttggagcaagttc gtgtacagccaaagagagctatttgagccttggaataatctgcctaaatattatatatta ctgcacatcatgctgggggagattctcagacactcgatggatccacccacattcactttc aactttaacaatgaaccttgggtcagaggacggcatgagacttacctgtgttatgaggtg gagcgcatgcacaatgacacctgggtcctgctgaaccagcgcaggggctttctatgcaac caggctccacataaacacggtttccttgaaggccgccatgcagagctgtgcttcctggac gtgattcccttttggaagctggacctggaccaggactacagggttacctgcttcacctcc tggagcccctgcttcagctgtgcccaggaaatggctaaattcatttcaaaaaacaaacac gtgagcctgtgcatcttcactgcccgcatctatgatgatcaaggaagatgtcaggagggg ctgcgcaccctggccgaggctggggccaaaatttcaataatgacatacagtgaatttaag cactgctgggacacctttgtggaccaccagggatgtcccttccagccctgggatggacta gatgagcacagccaagacctgagtgggaggctgcgggccattctccagaatcaggaaaac tga >gi568815576f:38884071_39091515|GENSCAN_predicted_peptide_5|51_aa GDWLLAVLLLLGLSLPISAQGSVSESGRGSVRYTRKDIYTFASVYGNTCNS >gi568815576f:38884071_39091515|GENSCAN_predicted_CDS_5|156_bp ggagactggctgctggctgtcctcttactgctgggcctgtcccttccaatcagcgcccag ggctctgtgagtgagagtgggcggggctctgtccgctacacaaggaaagacatctacaca tttgctagtgtttatggaaacacttgcaattcatga