GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:34:08 Sequence gi568815592r:143395710_143611550 : 215841 bp : 40.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8693 8846 154 1 1 81 110 49 0.413 6.61 1.02 Term + 21917 22206 290 0 2 104 34 185 0.786 9.25 1.03 PlyA + 22946 22951 6 1.05 2.00 Prom + 28419 28458 40 -5.25 2.01 Init + 30709 30789 81 0 0 77 52 64 0.424 2.72 2.02 Term + 34425 34649 225 2 0 94 37 143 0.826 5.60 2.03 PlyA + 34837 34842 6 1.05 3.00 Prom + 35162 35201 40 -6.95 3.01 Sngl + 40561 41001 441 0 0 74 43 376 0.409 27.70 3.02 PlyA + 41145 41150 6 1.05 4.06 PlyA - 41295 41290 6 1.05 4.05 Term - 43026 42842 185 1 2 59 40 97 0.008 -1.28 4.04 Intr - 50001 49951 51 1 0 53 111 61 0.022 2.86 4.03 Intr - 54966 54838 129 1 0 52 61 153 0.174 8.75 4.02 Intr - 55281 55080 202 0 1 25 69 264 0.283 16.14 4.01 Init - 60888 60886 3 0 0 85 81 0 0.124 -0.95 4.00 Prom - 62532 62493 40 -7.55 5.00 Prom + 63375 63414 40 -7.25 5.01 Init + 65301 65494 194 2 2 69 40 86 0.567 0.39 5.02 Intr + 67207 67288 82 1 1 105 99 124 0.679 13.92 5.03 Intr + 69856 69994 139 0 1 111 78 -43 0.429 -3.88 5.04 Intr + 75252 75376 125 1 2 54 93 108 0.656 7.28 5.05 Intr + 75674 75740 67 1 1 84 95 55 0.970 3.36 5.06 Intr + 75848 75902 55 0 1 62 116 33 0.981 0.62 5.07 Intr + 76451 76619 169 2 1 104 49 94 0.739 6.13 5.08 Intr + 79077 79147 71 2 2 79 95 67 0.597 3.56 5.09 Intr + 91259 91335 77 2 2 21 97 44 0.026 -3.16 5.10 Term + 94741 94997 257 2 2 27 41 226 0.350 6.56 5.11 PlyA + 97140 97145 6 1.05 6.08 PlyA - 97895 97890 6 1.05 6.07 Term - 100138 99998 141 1 0 71 39 84 0.619 -1.25 6.06 Intr - 101788 101680 109 0 1 97 94 20 0.803 2.87 6.05 Intr - 106413 106223 191 0 2 61 103 151 0.997 11.36 6.04 Intr - 106856 106646 211 1 1 71 87 154 0.993 11.59 6.03 Intr - 108543 108204 340 1 1 50 93 305 0.924 20.81 6.02 Intr - 111715 111528 188 0 2 117 115 97 0.998 13.71 6.01 Init - 115925 115702 224 2 2 79 105 298 0.999 26.68 6.00 Prom - 117286 117247 40 -12.03 7.03 PlyA - 118045 118040 6 1.05 7.02 Term - 119643 119398 246 0 0 75 49 224 0.656 12.01 7.01 Init - 120351 120217 135 0 0 59 93 47 0.954 2.51 7.00 Prom - 122128 122089 40 -7.35 8.00 Prom + 126524 126563 40 -7.35 8.01 Init + 129203 129274 72 1 0 96 62 70 0.449 6.32 8.02 Intr + 134582 134712 131 1 2 43 30 146 0.290 2.87 8.03 Intr + 141208 141498 291 1 0 13 110 239 0.852 13.93 8.04 Intr + 141696 141884 189 0 0 26 45 201 0.867 7.38 8.05 Intr + 146526 146716 191 0 2 112 15 147 0.801 8.11 8.06 Term + 152733 152944 212 1 2 102 54 115 0.703 5.97 8.07 PlyA + 156296 156301 6 1.05 9.04 PlyA - 157950 157945 6 1.05 9.03 Term - 161404 161304 101 2 2 51 54 97 0.395 -0.09 9.02 Intr - 161977 161843 135 2 0 13 84 168 0.291 8.52 9.01 Init - 162353 162290 64 2 1 79 94 80 0.916 9.06 9.00 Prom - 167089 167050 40 -6.75 10.04 PlyA - 167405 167400 6 1.05 10.03 Term - 169090 168963 128 2 2 37 52 153 0.383 4.06 10.02 Intr - 173475 173394 82 0 1 107 88 6 0.703 0.79 10.01 Init - 175002 174889 114 0 0 89 84 74 0.491 7.36 10.00 Prom - 178081 178042 40 -6.85 11.00 Prom + 179067 179106 40 -8.35 11.01 Init + 179735 179870 136 1 1 53 99 44 0.375 2.35 11.02 Term + 182929 183074 146 2 2 -127 48 396 0.705 11.19 11.03 PlyA + 183588 183593 6 1.05 12.05 PlyA - 183607 183602 6 1.05 12.04 Term - 193925 193803 123 2 0 22 32 177 0.388 2.90 12.03 Intr - 201421 201226 196 0 1 56 108 88 0.459 6.10 12.02 Intr - 209829 209793 37 1 1 120 98 31 0.365 3.70 12.01 Intr - 215393 215268 126 0 0 35 69 105 0.263 3.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_1|147_aa MPRTKLLNISTKNANAWSQVGKEAEAHITAQATCQFPIGSARQTSILSGPPGNEIRREFK NILSFLHGTTSSVLTSLPPYPPNVKERVFINVSLLWPPEAAETDQDIDYISHNATMSPNY LAHLKSSQYVTRPSRVNKEHQMKIWPK >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_1|444_bp atgccccgaacaaaactactgaacatttctaccaaaaatgctaacgcctggtctcaggtg ggcaaggaggctgaagcccacattacagctcaggcaacgtgtcaatttcccattggctct gcaagacaaactagcatcctttcagggcccccaggtaacgagatcaggagagaattcaag aatatcctgtccttccttcatggaaccacttcctcagtccttacctctcttcctccatat cctcccaatgttaaagagagggtcttcatcaatgtgtctttgttgtggcccccagaagct gctgaaacagatcaggacattgattacatcagccacaatgccacgatgagccccaattat cttgctcatctcaaatcctctcagtatgttacccgaccaagcagggtgaacaaagagcac cagatgaaaatatggcccaaataa >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_2|101_aa MEGTVPQSKHLGQIFDRLITEEGTKKMHFTILTTASQSRHSCPLLPAVLSAGAPQRSPST PLAHSEPSPLTWDESSGSQPLLLRRTSWNSLETQHCWPPAF >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_2|306_bp atggagggaactgtgccccagtccaaacatttaggacagattttcgataggttaattact gaagagggaaccaaaaagatgcattttaccatcctaaccactgccagccagtcccgtcac agctgccccctgcttcctgctgtgttaagtgctggagctccccagaggtccccctccact ccactcgcacactcagagccctctcctcttacgtgggatgagagcagtggttctcaacca ttgctgctcaggagaaccagttggaactctctggaaacacagcactgttggccccctgcc ttctga >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_3|146_aa MAAYDPQYGLYLTEAAIFRGWMSMREVEEKMLHVQTKNSSYFADWIPYNGKTAFWDIPSA GLKTSTTFISNNTAIQELLRDVSEQFTAMFRCKAFLHWYMGEGKNEMEFTGAESNMNDLV SEYQQYQDATVEEEGESEEWTEEMGA >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_3|441_bp atggctgcctatgacccccagtatggcctctatctaacagaggctgccattttcaggggc tggatgtccatgagggaggtagaggagaaaatgcttcatgtccaaaccaagaacagcagc tactttgctgactggatcccctacaatgggaaaacagctttctgggacatcccatccgca ggactaaaaacgtccaccactttcatcagtaacaacacggccatccaggagctgctcaga gacgtctcagagcagttcacagccatgttcaggtgcaaggccttcctgcactggtacatg ggcgagggcaagaatgagatggaattcactggggccgagagcaacatgaatgacctggta tcagagtaccaacagtatcaggatgccactgttgaggaggagggagagtctgaggagtgg actgaggaaatgggggcgtag >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_4|189_aa METVSSSPGPDDPARAPSSRSVLSRNHKRDFPQRLSPERQGGVDYSKAATLSAAAAAQPP PLTGPTSPAEWWLGMEAKAAPKPAASGACSVSAEETEKWMEEAMHMVRSPGGTYCLIDEI KSGTDEKVVWSITSYIFHFVFQAKEALENTEVPVGCLMVYNNEVVGKGRNEVNQTKNVCE LSSIIANIQ >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_4|570_bp atggaaactgtttcttcgtcacctggcccagatgacccagcaagagcgccgtcgtcccgt tctgtgctctcccgaaatcacaaacgtgactttccgcagagactgtcaccggagcggcaa gggggcgtggactacagcaaagcagctacgctttctgccgctgccgccgcgcagccgccg ccgctcactggtccgacaagcccagctgagtggtggctgggtatggaggcgaaggcggca cccaagccagctgcaagcggcgcgtgctcggtgtcggcagaggagaccgaaaagtggatg gaggaggcgatgcacatggtgaggagcccgggaggaacatactgtttaattgatgaaatt aagtcaggcacagatgaaaaggttgtatggagtattacgtcttacattttccactttgtg tttcaggccaaagaagccctcgaaaatactgaagttcctgttggctgtcttatggtctac aacaatgaagttgtagggaaggggagaaatgaagttaaccaaaccaaaaatgtatgtgag ttgagcagtataattgcaaacatacagtag >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_5|411_aa MPVKDKGEGRQKKASEAFRPQCRSDICEGKKTRKEERLGKKSLRLQHSSKKALGDHEESA SQSFCAVHASNTERGLNAATEFREPHSSAKKQNFYKKYLEIFLREHSKRDMSHLGYIKYL TFELWLIPSLIRIHMTFCRFTRSTVAVYSTCMLVVLLRVQLNIIGGYIYLDNAAVGKNGT TILAPPDVQQQYLSSIQHLLGDGLTELITVIKQAVQKVLGSVSLKHSLSLLDLEQKLKEI RNLVEQHKSSSWINKDGSKPLLCHYMMPDEETPLAVQACGLSPRDITTIKLLNETRDMLE RRQWIWPAGHTLLTPLVEDIRMVVGKGSGAIALTIDTSEPTGIVSEQKMAVPLTYANLGK SAKHVFTKGYGFGLIKPDLKTKSENGLEFTSSGSANTETTKVTGSLETKYR >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_5|1236_bp atgcctgtaaaggataaaggggaagggaggcagaagaaggcaagcgaagcatttaggcca cagtgcaggtctgacatttgtgaagggaaaaagacaaggaaggaagaaagattaggcaag aagagtctcagactacagcacagttctaagaaagcattgggagaccatgaagagtctgca agccaaagtttctgtgctgtccatgcttccaacactgagagaggccttaatgcagcaact gaattccgagagcctcacagctctgctaaaaaacagaatttttataagaagtacttagaa atttttctgagggagcattcaaaaagagatatgagtcacttaggatacattaaatacctc acctttgaattgtggttgattcccagtttaatccgaatacatatgacattctgccgtttc acaagaagtactgtggctgtatacagtacctgtatgctggttgttcttttgcgggtccag ttaaacataattggtggatatatttacctggataatgcagcagttggcaaaaatggcact acaattcttgctcccccagatgtccaacagcagtatttatcaagtattcagcacctactt ggagatggcctgacagaattgatcactgtcattaaacaagctgtgcagaaggttttagga agtgtttctcttaaacattctttgtcccttttggacttggagcaaaaactaaaagaaatc agaaatctcgttgagcagcataagtcttcttcttggattaataaagatggatccaaacct ttattatgccattatatgatgccagatgaagaaactccattagcagtgcaggcctgtgga ctttctcctcgagacattaccactattaaacttctcaatgaaactagagacatgttggaa aggcggcagtggatttggcctgcaggccatactttgctgacccctttggtagaagatatt aggatggtagttggtaaaggtagtggagccatagccctcaccatagacacctctgagccc actggcattgtctctgagcagaagatggctgtgccactcacatatgctaatcttggcaaa tctgccaagcatgtcttcaccaagggatatggatttggcttaataaaacctgatttgaaa acaaaatctgagaatggattggaatttacaagctcaggctcagccaacactgagaccacc aaagtgacaggcagtctggaaaccaagtacagatag >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_6|467_aa MRPQELPRLAFPLLLLLLLLLPPPPCPAHSATRFDPTWESLDARQLPAWFDQAKFGIFIH WGVFSVPSFGSEWFWWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQWADI FQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRFGLY YSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYWNS TGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTIDKL SWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQMGSWL KVNGEAIYETHTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILGAT EVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVI >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_6|1404_bp atgcggccccaggagctccccaggctcgcgttcccgttgctgctgttgctgttgctgctg ctgccgccgccgccgtgccctgcccacagcgccacgcgcttcgaccccacctgggagtcc ctggacgcccgccagctgcccgcgtggtttgaccaggccaagttcggcatcttcatccac tggggagtgttttccgtgcccagcttcggtagcgagtggttctggtggtattggcaaaag gaaaagataccgaagtatgtggaatttatgaaagataattaccctcctagtttcaaatat gaagattttggaccactatttacagcaaaattttttaatgccaaccagtgggcagatatt tttcaggcctctggtgccaaatacattgtcttaacttccaaacatcatgaaggctttacc ttgtgggggtcagaatattcgtggaactggaatgccatagatgaggggcccaagagggac attgtcaaggaacttgaggtagccattaggaacagaactgacctgcgttttggactgtac tattccctttttgaatggtttcatccgctcttccttgaggatgaatccagttcattccat aagcggcaatttccagtttctaagacattgccagagctctatgagttagtgaacaactat cagcctgaggttctgtggtcggatggtgacggaggagcaccggatcaatactggaacagc acaggcttcttggcctggttatataatgaaagcccagttcggggcacagtagtcaccaat gatcgttggggagctggtagcatctgtaagcatggtggcttctatacctgcagtgatcgt tataacccaggacatcttttgccacataaatgggaaaactgcatgacaatagacaaactg tcctggggctataggagggaagctggaatctctgactatcttacaattgaagaattggtg aagcaacttgtagagacagtttcatgtggaggaaatcttttgatgaatattgggcccaca ctagatggcaccatttctgtagtttttgaggagcgactgaggcaaatggggtcctggcta aaagtcaatggagaagctatttatgaaacccatacctggcgatcccagaatgacactgtc accccagatgtgtggtacacatccaagcctaaagaaaaattagtctatgccatttttctt aaatggcccacatcaggacagctgttccttggccatcccaaagctattctgggggcaaca gaggtgaaactactgggccatggacagccacttaactggatttctttggagcaaaatggc attatggtagaactgccacagctaaccattcatcagatgccgtgtaaatggggctgggct ctagccctgactaatgtgatctaa >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_7|126_aa MAVNPRSLAQEALPQRTTQDRFPVRFLATSGSCLCHSNIKVTPGKRETALMPAARQMFHF FHCPLQKNSGDCQDWLVAVIQLPYGDLSRMCPREPRSDISSTSAANLIPVDTHGKTASSG SSKEFG >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_7|381_bp atggctgtgaaccccaggtccttggcacaggaggccctgccacagagaaccactcaggac agatttccagttaggttcttggctacttctgggtcctgtttgtgtcactcgaatatcaag gtcaccccaggaaagagagagactgcattgatgccagctgccaggcaaatgtttcacttc ttccactgtccacttcagaagaattctggggactgccaagactggctggtggcagtcatc caactcccatatggagatctatccagaatgtgtcctcgagagcctcgctcagatatcagc tccacctcggcagctaatcttatccctgtggacacacatggcaagactgcttcctctggc agcagcaaggaatttggctag >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_8|361_aa MPKVQGILSGRIQIQLTQRSLLAQHPEEPQVLKALSLQMTFLIRGCLCIADTTSSGKPSL VLHPKRVRAGQQRAAQPSAPAPPGAPAPPPGADGRGGLAPRGVPGRSGRAGGGGPSPRAR GARPAGGACPALASRGPEPPRPLTERPLVVLEQGPPAPGPHLRLRRKMLKAGDGLVRLAG RVCSLGWGRERSLSAARRQAAPQLGFRGFSIRKRNILGLRRDFGPSEGESTLFFAQKPWA SHLPRSLTAGQASFQKGHLLSDSYQVTIAADDSPVSELGTFEKLASPVKSGDFSDAVSIY SPQISPIYMSSVKASREGFPWNPIPHTSLGQVAIPDTSYIRFVGQRPLGCILAGVESTVP R >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_8|1086_bp atgcccaaggttcagggtattctaagtgggagaattcagatccagctcactcagaggtct ctgctggcccagcatccagaggagccccaggtcctcaaggctctgtccttgcagatgacc ttcctcatccgggggtgcctttgcattgcagacaccacatcctctgggaagccttccctg gttctccaccccaagcgggttagagccgggcagcagcgggcagcgcagccctcggcgccg gccccacccggtgctccggccccgccgccgggagccgatggccgaggcgggctggcgccc cgcggcgtccccggtcgatccggccgggcaggcggcggcggtcccagtccccgagcccga ggcgcccgccccgccggcggcgcctgccctgcgctggcttcccggggacccgagcccccg cggccgctcacagagcgacctctcgtcgtcctcgagcaggggccgcccgctccgggtcca catctccggctcaggcggaagatgctaaaagcaggcgacggcttggtgcgccttgcgggg cgagtgtgcagcctgggttggggtagggagcgctccctctcggctgcacggcgccaggca gcaccgcagcttggcttccgaggcttttccatcagaaagaggaacattctcggtcttcgg cgcgactttggtccctcggaaggggagagcacccttttctttgcacaaaagccatgggct tctcatctacccaggagcctgaccgccggtcaggcttcattccagaaggggcatctccta tctgattcttaccaagttaccatagcagctgatgacagtcctgtatcagagcttggcaca tttgaaaaattggcttcacctgttaagtcaggagacttctctgatgctgtgagcatatat tctcctcagatttcccctatttacatgtcttctgttaaagctagcagagagggcttccct tggaatccaattccacacaccagcctgggccaggtagccattcctgacaccagctatatc cgttttgttggacagagaccactggggtgtattcttgcaggtgtggagagcacagttcct cgatga >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_9|99_aa MWDGPKENVREKEFSLVSPEAGLATRPYGERSQPRKNDWSVEESGRAGPGPHGPVSSRKI ETLCGEGKSLSNQVNPLAEPHGEHILERDGRRAPVALSN >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_9|300_bp atgtgggatggacccaaggagaacgtcagagagaaagagttcagtctggtgtctcctgaa gcaggactcgcgacaagaccttacggagaaagaagtcagccaaggaaaaacgactggagt gtggaagagtctggaagagctggaccaggtccacatggtccggtcagttctcgaaaaatc gagaccctgtgtggtgaggggaaaagtttgtcaaaccaagtgaatcctcttgccgaacca catggagaacatattttggagagagatggtaggcgagctcctgtggccctctctaactga >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_10|107_aa MAAVQRTDLHKKVWGLGKESKRGEGEPQESGEGPATPGPTLAGTNHIALPNCKGGQKVQH FLMVTGTVTVCDFAIEILPKALFLISRPSENCRPISQWHLPGTFSGC >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_10|324_bp atggctgctgtgcaaaggactgatcttcataaaaaggtgtggggattagggaaggagagc aagcggggagagggagagccccaggagagtggggaggggcctgccacacctgggcccaca ctggctggaaccaatcacatagccctgcctaactgcaagggaggccagaaagtgcaacac tttctcatggtcacggggactgttactgtttgtgactttgccattgagattctgccaaag gctctgtttctgatcagcaggccttccgagaactgccgtccgatcagccagtggcattta ccaggcaccttctcagggtgctga >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_11|93_aa MNYQNASLEDGFQKDGLGEKALPSCCFTVWTLDGAVVILISQCNTGEDEEEEEEEGEEED MSAEEEDEDDYNNGEVDDEEDEEDLGEEERSQK >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_11|282_bp atgaactatcaaaacgccagtttagaggatggctttcagaaagatggacttggagaaaaa gcgctccctagctgctgttttaccgtgtggacactagatggtgcagtggtcatactgatc agccaatgtaatactggagaggatgaggaggaggaggaggaggaaggagaagaggaggac atgagtgcagaagaggaggatgaagacgattataacaatggagaggtagatgatgaagaa gatgaagaagaccttggtgaagaagaaaggagtcagaagtga >gi568815592r:143395710_143611550|GENSCAN_predicted_peptide_12|160_aa XKKHTPINFASECLITVHSREPLDGGKEDSGTGRMLMMLVVTASEEGQCNREKAHVEREN TEGKIHAKCRKIQSIEQKNVTLMCILGAETQEGRRQEGCAGSNRAVARHRQKADGAAGQQ KRKFGHRDTGSEHTEKRPPEDTRRQLYQSVLALQELPETE >gi568815592r:143395710_143611550|GENSCAN_predicted_CDS_12|483_bp nataaaaaacacacaccaatcaactttgcctctgaatgtcttatcacagttcattcacga gaaccacttgatggaggtaaggaggatagtggcactggcagaatgctgatgatgcttgta gtcacagcatcagaggaagggcaatgcaaccgtgaaaaagcacatgtggagcgtgagaac acagaggggaaaatacatgccaagtgtagaaagatccagagtatagagcagaagaatgtg accttgatgtgcatactgggagcagaaacgcaggagggaaggaggcaggagggctgtgct ggcagcaatagggctgttgcaagacacaggcagaaggcagatggggctgctggccagcaa aagaggaaatttggacacagagacaccggaagtgaacacacagagaaaagaccacctgag gatacaagacggcagttgtatcagtctgttctcgcattgcaagaactacctgagactgag taa