GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:15:38 Sequence gi568815587r:10459114_10668532 : 209419 bp : 44.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2402 2627 226 0 1 86 76 403 0.805 35.94 1.02 Intr + 19413 19617 205 0 1 84 74 180 0.523 15.40 1.03 Intr + 22950 23112 163 2 1 115 80 197 0.979 21.15 1.04 Intr + 25707 25926 220 1 1 93 87 387 0.958 36.46 1.05 Intr + 28032 28251 220 0 1 84 100 148 0.800 13.90 1.06 Intr + 34236 34525 290 2 2 87 33 418 0.571 32.24 1.07 Intr + 35555 35717 163 2 1 80 25 125 0.647 5.38 1.08 Intr + 35786 35917 132 1 0 107 113 114 0.999 16.64 1.09 Intr + 36457 36620 164 0 2 67 83 279 0.996 24.07 1.10 Intr + 37699 37825 127 1 1 69 92 109 0.737 10.08 1.11 Intr + 40973 41136 164 1 2 60 96 294 0.998 26.17 1.12 Intr + 42357 42477 121 0 1 121 86 158 0.998 19.40 1.13 Intr + 43608 43781 174 2 0 73 116 100 0.950 11.44 1.14 Intr + 45436 45546 111 0 0 -52 80 181 0.655 3.88 1.15 Term + 46595 46771 177 1 0 94 42 181 0.999 11.79 1.16 PlyA + 47990 47995 6 1.05 2.12 PlyA - 50010 50005 6 1.05 2.11 Term - 52077 52011 67 2 1 91 38 48 0.242 -2.49 2.10 Intr - 60028 59921 108 0 0 48 115 66 0.427 4.70 2.09 Intr - 66260 66079 182 2 2 88 71 63 0.693 3.27 2.08 Intr - 71638 71530 109 0 1 131 100 32 0.733 8.99 2.07 Intr - 95739 95654 86 0 2 8 116 61 0.007 -0.48 2.06 Intr - 100184 100025 160 1 1 95 9 185 0.019 11.29 2.05 Intr - 100781 100703 79 0 1 116 97 11 0.971 3.41 2.04 Intr - 101687 101382 306 0 0 90 84 65 0.550 2.72 2.03 Intr - 104966 104827 140 1 2 84 105 20 0.975 3.41 2.02 Intr - 105261 105090 172 1 1 66 94 122 0.985 9.60 2.01 Init - 109419 109335 85 0 1 110 100 84 0.998 10.78 2.00 Prom - 109994 109955 40 -4.96 3.24 PlyA - 112144 112139 6 1.05 3.23 Term - 117462 117219 244 2 1 74 49 47 0.324 -5.23 3.22 Intr - 121476 121342 135 2 0 125 75 205 0.894 22.58 3.21 Intr - 122873 122754 120 1 0 94 84 124 0.972 12.21 3.20 Intr - 132499 132435 65 1 2 79 106 7 0.563 -0.88 3.19 Intr - 134486 134379 108 2 0 103 111 -7 0.778 3.58 3.18 Intr - 135082 135033 50 2 2 92 91 -2 0.755 -1.10 3.17 Intr - 141946 141805 142 1 1 126 95 188 0.952 23.23 3.16 Intr - 144138 144007 132 0 0 60 109 123 0.987 12.44 3.15 Intr - 145432 145274 159 1 0 102 48 127 0.599 10.28 3.14 Intr - 146573 146427 147 2 0 75 75 31 0.648 0.93 3.13 Intr - 147659 147629 31 1 1 91 107 20 0.964 2.33 3.12 Intr - 150714 150615 100 1 1 68 93 107 0.670 8.37 3.11 Intr - 164743 164665 79 1 1 49 100 60 0.586 2.42 3.10 Intr - 167470 166853 618 1 0 100 90 439 0.859 37.81 3.09 Intr - 167711 167565 147 2 0 44 93 45 0.508 1.03 3.08 Intr - 168650 168603 48 2 0 104 76 37 0.654 2.98 3.07 Intr - 170598 170492 107 1 2 112 97 253 0.699 28.53 3.06 Intr - 172948 172878 71 0 2 80 82 15 0.838 -1.07 3.05 Intr - 174958 174855 104 1 2 78 92 77 0.723 6.07 3.04 Intr - 176399 176213 187 1 1 16 47 90 0.093 -2.61 3.03 Intr - 182966 182843 124 0 1 90 78 62 0.252 5.04 3.02 Intr - 185769 185601 169 0 1 50 95 18 0.115 -1.78 3.01 Intr - 193069 192912 158 2 2 71 108 129 0.151 12.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 81949 82237 289 1 1 74 53 152 0.873 5.15 S.002 Term - 100184 99998 187 1 1 95 37 214 0.980 13.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:10459114_10668532|GENSCAN_predicted_peptide_1|885_aa XEMPRQFPKLNISEVDEQVRLLAEKVFAKVLREEDSKDALSLFTVPEDCPIGQKEAKERE LQKELAEQKSVETAKRKKSFKMIRSQSLSLQMPPQQDWKGPPAASPAMSPTTPVVTGATS LPTPAPYAMPEFQRVTISGDYCAGITLEDYEQAAKSLAKALMIREKYARLAYHRFPRITS QYLGHPRADTAPPEEGLPDFHPPPLPQEDPYCLDDAPPNLDYLVHMQGGILFVYDNKKML EHQEPHSLPYPDLETYTVDMSHILALITDGPTQGWPLQMLEASKLESSMRLNYGLSPSPT CRKTYCHRRLNFLESKFSLHEMLNEMSEFKELKSNPHRDFYNVRKVDTHIHAAACMNQKH LLRFIKHTYQTEPDRTVAEKRGRKITLRQVFDGLHMDPYDLTVDSLDVHAVSELLLQCRQ QTAALAGDSAPWKATMIVLAGRWGLCCSFTGGLAKGSQADRVRIHGPGAQEFLRFHARID IEDVFRCGHTEGRLVKGRQTFHRFDKFNSKYNPVGASELRDLYLKTENYLGGEYFARMVK EVARELEESKYQYSEPRLSIYGRSPEEWPNLAYWFIQHKVYSPNMRWIIQVPRIYDIFRS KKLLPNFGKMLENIFLPLFKATINPQDHRELHLFLKYVTGFDSVDDESKHSDHMFSDKSP NPDVWTSEQNPPYSYYLYYMYANIMVLNNLRRERGLSTFLFRPHCGEAGSITHLVSAFLT ADNISHGLLLKKSPVLQYLYYLAQIPIAMSPLSNNSLFLEYSKNPLREFLHKGLHVSLST DDPMQFHYTKEALMEEYAIAAQVWKLSTCDLCEIARNSVLQSGLSHQEKQKFLGQNYYKE GPEGNDIRKTNVAQIRMAFRYETLCNELSFLSDAMKSEEITALTN >gi568815587r:10459114_10668532|GENSCAN_predicted_CDS_1|2658_bp nctgagatgccgcggcagtttcccaagctgaacatctctgaagtggatgagcaagtccgg ctcctggcggagaaggtgtttgctaaagtgctccgagaagaggacagcaaagatgccctg tccctgttcactgtcccagaggactgccccatcgggcaaaaggaagccaaggagagggag ctgcagaaggagctggcagagcagaagtctgtggagaccgcaaaaagaaagaaaagtttc aagatgattcggtcccagtccctgtctctgcaaatgccgccacagcaagattggaagggc cccccggcagccagtccggccatgtctcccacaacccctgtggtcactggagccacttcc ctgcccacgccagcaccctatgccatgcctgagttccagcgggtcaccatcagcggagat tactgtgccgggatcactttggaggactatgagcaggcagccaagagtctggccaaggcc ctaatgatccgggagaagtatgcgcggctcgcctaccaccgcttcccgcggatcacatcc cagtacctgggtcatccgcgggcggatactgcacctccggaagagggccttccagacttc caccctcctccactgccccaggaagacccctactgcctggatgatgcaccccccaacctg gattacttggtccacatgcaggggggcatcctctttgtgtatgataacaagaagatgctg gagcaccaggagccgcacagcctaccctaccccgacctggagacctacacggtggacatg agccacatcctggctctcatcaccgatggccccacccagggttggcccctgcagatgctg gaggcctccaaactggagagctcgatgcgactcaactatggtctctccccatctccaact tgcaggaaaacctattgtcaccggcgactgaactttctggaatccaagttcagccttcat gagatgttaaacgaaatgtccgagttcaaagagttgaagagtaacccccaccgggacttc tataacgtgagaaaggtggacacacacatccatgcggccgcctgcatgaaccaaaagcat ctgctgcgcttcatcaagcacacataccagacggagcctgacaggactgtggcagagaag cggggccggaagatcaccctgcggcaggtgtttgacggcctgcacatggacccctacgac ctcactgtggactcactggatgtccacgcggtgagtgagcttctgctccagtgccgccag cagacagcagccctggctggggactcagccccctggaaagccaccatgattgtgcttgcc gggaggtggggcctctgctgttccttcacagggggccttgccaaaggatcccaagctgac cgagtgaggatccatggtcctggtgctcaggagtttctaaggtttcatgcaagaattgac atagaagatgtctttcggtgtggccatacagaaggccgcctcgtaaagggccggcagaca ttccaccgctttgacaagttcaactccaaatacaaccctgtgggggccagtgagctgcgt gacctgtatttgaaaactgaaaactatctgggaggagagtactttgctcggatggtcaag gaggttgcccgggagctggaggagagcaagtaccagtactcagagccacggctctccatc tacggccgcagtcctgaggagtggcccaacctggcctactggttcatccagcacaaggtc tactctcccaacatgcgctggatcatccaggtgccccggatttatgacatatttaggtca aagaagctgctgccaaactttgggaagatgctggagaacatcttcctgccccttttcaag gccactatcaacccccaagatcatcgagagcttcacctcttccttaaatatgtgacgggg tttgacagcgtggatgatgagtccaagcacagcgaccacatgttttccgacaagagccca aacccggacgtctggaccagtgagcagaacccaccctacagctactacctgtactacatg tatgccaacatcatggtgctcaacaacctccgcagggagcgcggcctgagcacgttcctg ttccggccgcactgtggggaagccggctccatcacccacctggtgtctgccttcctcact gctgacaacatttcccacgggctgctcctcaagaagagtccggtattgcagtatctctac taccttgctcagatccccattgccatgtctcctcttagcaacaacagtttgttcctcgaa tattccaagaaccctctgagggaattcctacacaagggactgcatgtttctctttccacc gatgaccccatgcagttccactacacgaaggaagcacttatggaagaatatgccattgca gctcaagtgtggaagctgagcacctgcgacctgtgtgagatcgccaggaacagcgtgctg cagagcggcctctcgcatcaggaaaagcaaaagtttctgggacaaaattattataaagaa ggacctgaaggaaatgatattcgaaagacaaatgtggctcagatccggatggcattccga tatgagaccttatgcaatgagctcagcttcctgtctgatgctatgaaatcagaagagatc accgccttgaccaactag >gi568815587r:10459114_10668532|GENSCAN_predicted_peptide_2|497_aa MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEA CRLLGLSLAGKDQVETALKASFETCSYGWVGDGFVVISRISPNPKCGKNGVGVLIWKVPV SRQFAAYCYNSSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPA PTTTPPAPASTSIPRRKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTAL LVLALLFFGAAAGLGFCYVKRYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKT DKNPEESKSPSKTTKNSRALFYLCSHTTTTVINSITKELNDKRTAKVASGQEKHLLFEVQ PGSDSSAFWKVVVRVVCTKINKSSGIVEASRIMNLYQFIQLYKDITSQAAGVLAQSSTSE EPDENSSSVTSCQASLWMGRVKQLTDEEECCICMDGRADLILPCAHSFCQKCIDKWQQTS TVHSPGVVDPYPNRQQK >gi568815587r:10459114_10668532|GENSCAN_predicted_CDS_2|1494_bp atggccaggtgcttcagcctggtgttgcttctcacttccatctggaccacgaggctcctg gtccaaggctctttgcgtgcagaagagctttccatccaggtgtcatgcagaattatgggg atcacccttgtgagcaaaaaggcgaaccagcagctgaatttcacagaagctaaggaggcc tgtaggctgctgggactaagtttggccggcaaggaccaagttgaaacagccttgaaagct agctttgaaacttgcagctatggctgggttggagatggattcgtggtcatctctaggatt agcccaaaccccaagtgtgggaaaaatggggtgggtgtcctgatttggaaggttccagtg agccgacagtttgcagcctattgttacaactcatctgatacttggactaactcgtgcatt ccagaaattatcaccaccaaagatcccatattcaacactcaaactgcaacacaaacaaca gaatttattgtcagtgacagtacctactcggtggcatccccttactctacaatacctgcc cctactactactcctcctgctccagcttccacttctattccacggagaaaaaaattgatt tgtgtcacagaagtttttatggaaactagcaccatgtctacagaaactgaaccatttgtt gaaaataaagcagcattcaagaatgaagctgctgggtttggaggtgtccccacggctctg ctagtgcttgctctcctcttctttggtgctgcagctggtcttggattttgctatgtcaaa aggtatgtgaaggccttcccttttacaaacaagaatcagcagaaggaaatgatcgaaacc aaagtagtaaaggaggagaaggccaatgatagcaaccctaatgaggaatcaaagaaaact gataaaaacccagaagagtccaagagtccaagcaaaactaccaaaaactcccgagccttg ttttacctctgctctcacaccaccacaacagtcatcaactcaataacaaaagaactcaat gacaaaagaacggctaaagtggcttctggccaggaaaaacatcttctctttgaggtacaa cctgggtctgattcctctgctttttggaaagtggttgtacgggtggtctgtaccaagatt aacaaaagcagtggcattgtggaggcatcacggatcatgaatttataccagtttattcaa ctttataaagatatcacaagtcaagcagcaggagtattggcacagagctccacctctgaa gaacctgatgaaaactcatcctctgtaacatcttgtcaggctagtctttggatgggaagg gtgaagcagctgaccgatgaggaggagtgttgtatctgtatggatgggcgggctgacctc atcctgccttgtgctcacagcttttgtcagaagtgtattgataaatggcagcagaccagt actgtccacagcccaggggttgtggacccctaccccaacagacagcaaaaataa >gi568815587r:10459114_10668532|GENSCAN_predicted_peptide_3|1081_aa XCGAQASWSIFGADAAEVPGTRGHSQQEAAMPHIPEDEEPPGEPQAAQSPAGQPCPHWPT ERFCLYLPGNSLGTVCCPTREKVCDPKDQLSVRNEMKRNQIKHVKHLEPAVCSALCWFEG FLEEEVPSVKESAAWKNKSVRGQQGTRNQGSGASGGENISYAGALEGFGPPGKVAIETVG THCFEAAGGSTNSGGGSNMLWDARRQAFGKGSQGPPAAGVSCSPTPTIVLTGDATSPEGE TDKNLANRVHSPHKRLSHRHLKVSTASLTSVDPAGHIIDLVNDQLPDISISEEDKKKNLA LLEEAKLQKGDEADVSSPHPGEPLLSLGKFLAGFGLQVPFSCLFDKPRAEALSFHIVHSY PIHTYTNGPLGKNVPKGLADRKQNDQRKVSQGRLAPRPPPVEKSKEIAIEQKENFDPLQY PETTPKGLAPVTNSSGKMALNSPQPGPVESELGKQLLKTGWEGSPLPRSPTQDAAGVGPP ASQGRGPAGEPMGPEAGSKAELPPTVSRPPLLRGLSWDSGPEEPGPRLQKVLAKLPLAEE EKRFAGKAGGKLAKAPGLKDFQIQVQPVRMQKLTKLREEHILMRNQNLVGLKLPDLSEAA EQEKAIEEEESKSGLDVMPNISDVLLRKLRVHRSLPGSAPPLTEKEVEMYMDTALGMVEI KQAFKSDSLRLNPGSVTPCHLEQVNDRSGLGLFTHKQNVFVQLSLAFRNDSYTLESRINQ AERERNLTEENTEKELENFKASITVIVEPDSSASLWHHCEHRETYQKLLEDIAVLHRLAA RLSSRAEVVGAVRQEKRMSKATEVMMQYVENLKRTYEKDHAELMEFKKLANQNSSRSCGP SEDGVPRTARSMSLTLGKNMPRRRVSVAVVPKFNALNLPGQTPSSSSIPSLPALSESPNG KGSLPVTSALPALLENGKTNGDPDCEASAPALTLSCLEELSQETKARMEEEAYSKGFQEG LKKTKELQDLKEEEEEQKSESPEEPEEVEETEEEEKGPRSSKLEELVHFLQVMYPKLCQH WQVIWMMAAVMLVLTVVLGLYNSYNSCAEQADGPLGRSTCSAAQRDSWWSSGLQHEQPTE Q >gi568815587r:10459114_10668532|GENSCAN_predicted_CDS_3|3246_bp ncctgtggagcccaggcctcttggagcatctttggggctgacgcagcggaggttccgggc acacgtggccactcccagcaggaggctgccatgccccacattcccgaggacgaggagccc cccggagagccacaggcagcccagagccctgccggccaaccctgtcctcactggccaaca gagaggttctgcctgtacctccctggaaactccttgggaaccgtgtgctgcccgactaga gaaaaggtgtgtgacccaaaggatcaattatctgtcaggaatgagatgaagaggaaccag ataaagcatgtaaaacacttggaacctgctgtgtgttcagccttgtgctggtttgagggc ttcctggaggaagaggtaccttctgtgaaggagagtgcagcttggaagaataagagtgtc agaggccagcagggcaccaggaaccaggggagtggggctagtggaggtgaaaacatttca tatgcaggggctctagaaggatttggcccaccagggaaagtggcaatagagacagtgggc acccactgctttgaagctgcgggaggcagcaccaatagtggaggtggcagcaacatgctc tgggacgctaggaggcaggcctttggcaaaggcagtcagggtcctcctgccgcaggagta tcttgcagtccaactcccacgattgtcctgactggggatgccacttcaccagaaggagaa accgacaaaaacctggccaacagagttcacagtccccacaagaggctttctcaccgacac ttgaaggtgtccactgcctccctgacatctgtggaccccgcggggcacatcattgacctg gtgaatgaccagctgccagacatcagcatctcagaggaggacaagaagaaaaacctggcg ctgctggaagaagccaagttgcagaagggggatgaggccgacgtctcttcacctcaccct ggcgagcctctcctatcccttgggaaattcttggctggttttggattgcaggttccattt tcctgtctgtttgacaaacctcgggcagaagccctctccttccacatcgtccactcatac cccatacacacttatacaaatgggccccttggcaagaacgtccccaaagggctagctgac aggaagcagaatgaccagaggaaagtgtctcagggcaggctggctcctcgtcctcctcca gttgagaagtccaaagagattgcaatagaacaaaaggaaaacttcgatcccctccagtac cccgagaccacacccaaaggcctagctcctgttacaaacagcagtgggaaaatggccctg aacagccctcagcctggccccgtggagagcgagctggggaagcagctcttgaaaacgggc tgggagggcagccctctgccgagaagtccaacccaggatgcggcaggagtgggtccccca gcctcccaggggagaggcccagctggagagccgatggggcccgaggctggctccaaagct gagcttccacccactgtgtcccggcccccgctgctgcgagggctctcctgggacagtggc cctgaagaacctggcccccggctgcagaaagtgcttgccaagctgccactggcagaggaa gaaaagcgttttgcaggcaaggccggcggcaagctggccaaggcccctggtctcaaagac tttcagatacaagtgcagcccgtgcggatgcagaaactgaccaagctccgagaggagcac atcctgatgagaaatcagaacttagtggggctcaagcttccagaccttagtgaagcagct gagcaggaaaaagctattgaggaagaagagtcaaagagtggcttagatgtcatgcctaat atttctgatgtgctgctgcgcaaactgcgggtccacaggagtctccctggaagtgcccct ccactcactgaaaaggaagttgagatgtatatggacacagctctcggcatggtggaaata aaacaggctttcaaatcagacagtctgaggttaaatcctggctccgttactccgtgtcac cttgaacaagttaatgaccgttctggtcttggcctctttacccataaacagaacgtgttt gtgcaactgtccttggcctttagaaatgacagctacactctggaatctagaattaaccag gctgaaagggaacgcaacctgacagaggagaacactgagaaagaactggaaaacttcaaa gcttccattacggtaattgtggagcctgattcctcagcttcactctggcaccactgtgag caccgggaaacctaccagaagttgctggaggacatcgctgtcctgcaccgcctggctgcc cgcctctccagccgagctgaggtggtaggcgccgtccgccaggaaaagcgcatgtcgaaa gcaacggaagtgatgatgcagtatgtggagaatctaaagaggacgtatgagaaggaccat gcggagctcatggagtttaaaaagcttgcaaatcagaattcaagccgcagctgtggcccc tctgaagatggggtccctcgcacggcacggtccatgtccctcacgctgggaaagaatatg cctcgccggagggtcagcgttgctgtggttcctaagtttaatgccctgaatctgcctggc caaactcccagctcatcatccattccctccttaccagccttgtcggaatcacccaatggg aaaggcagcctacctgtcacttcagcactgcctgcacttttggaaaatggaaagacaaat ggggacccagattgtgaagcctctgctcctgcgctgaccctgagctgcctggaggagctt agtcaggagaccaaggccaggatggaggaagaagcctacagcaagggattccaagaaggt ctaaagaagaccaaagaacttcaagacctgaaggaggaggaggaagaacagaagagtgag agtcctgaggaacctgaagaggtagaagaaactgaggaagaggaaaagggcccaagaagc agcaaacttgaagaattggtccatttcttacaagtcatgtatcccaaactgtgtcagcac tggcaagtgatctggatgatggctgcagtgatgctggtcttgactgttgtgctggggctc tacaattcctataactcttgtgcagagcaggctgatgggccccttggaagatccacttgc tcggcagcccagagggactcctggtggagctcaggactccagcatgagcagcctacagag cagtag