GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:23:06 Sequence gi568815592f:86915335_87116429 : 201095 bp : 38.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4243 4282 40 -3.65 1.01 Sngl + 6867 7199 333 2 0 79 50 299 0.935 20.97 1.02 PlyA + 7280 7285 6 1.05 2.00 Prom + 8003 8042 40 -10.45 2.01 Init + 8119 9408 1290 0 0 66 41 547 0.505 39.82 2.02 Intr + 9502 10669 1168 0 1 -20 34 393 0.072 10.41 2.03 Intr + 18344 18446 103 0 1 66 80 73 0.238 2.61 2.04 Intr + 21749 21841 93 2 0 89 105 41 0.283 4.06 2.05 Intr + 22274 22489 216 2 0 79 100 133 0.382 10.30 2.06 Intr + 25606 25729 124 1 1 70 -37 145 0.242 -0.13 2.07 Term + 33330 33515 186 2 0 72 49 170 0.587 8.01 2.08 PlyA + 34760 34765 6 1.05 3.00 Prom + 50768 50807 40 -1.55 3.01 Sngl + 55087 55434 348 0 0 55 48 322 0.382 20.69 3.02 PlyA + 55606 55611 6 1.05 4.00 Prom + 61291 61330 40 -3.85 4.01 Sngl + 64459 64632 174 0 0 71 36 232 0.562 10.94 4.02 PlyA + 66336 66341 6 1.05 5.00 Prom + 77396 77435 40 -3.55 5.01 Init + 85763 85961 199 1 1 56 89 55 0.305 1.51 5.02 Term + 86822 87042 221 0 2 42 49 139 0.246 1.62 5.03 PlyA + 87808 87813 6 1.05 6.00 Prom + 93349 93388 40 -3.65 6.01 Sngl + 100001 101098 1098 1 0 93 38 968 0.772 89.00 6.02 PlyA + 101322 101327 6 1.05 7.02 PlyA - 101457 101452 6 1.05 7.01 Sngl - 119819 119220 600 2 0 58 37 270 0.720 14.84 7.00 Prom - 120054 120015 40 -11.04 8.02 PlyA - 120164 120159 6 1.05 8.01 Sngl - 121471 120815 657 1 0 49 36 261 0.520 12.92 8.00 Prom - 121564 121525 40 -8.75 9.05 PlyA - 121735 121730 6 1.05 9.04 Term - 122815 122285 531 1 0 -70 49 460 0.041 19.56 9.03 Intr - 129302 129183 120 2 0 91 75 48 0.058 3.57 9.02 Intr - 144919 144729 191 2 2 81 58 83 0.001 2.98 9.01 Init - 160178 159872 307 2 1 96 64 152 0.445 11.10 9.00 Prom - 163011 162972 40 -7.25 10.05 PlyA - 163281 163276 6 1.05 10.04 Term - 163960 163826 135 1 0 6 54 150 0.097 0.44 10.03 Intr - 167885 167800 86 0 2 62 68 31 0.206 -2.88 10.02 Intr - 168026 167944 83 1 2 62 93 132 0.417 9.46 10.01 Init - 170109 170018 92 0 2 37 67 104 0.388 3.21 10.00 Prom - 189162 189123 40 -3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_1|110_aa MRKKQNRKTGNSKKQSASPPPKIRSSSPATEQSWTENDFDELREEGFRRSNYSKLQEEIQ TKGKEVENFEKNLDEGITRITNTEKCLKELMELKAKARELREECRRLRSR >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_1|333_bp atgaggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaagatacgcagctcctctccagcaacggaacaaagctggacggagaatgactttgac gagttgagagaagaaggcttcagaagatcaaactactccaagctacaggaggaaattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagacgaaggtataactagaata accaatacagagaagtgcctaaaggagctgatggagctgaaagccaaggctcgagagcta cgtgaagaatgcagaagactcaggagccgatga >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_2|1059_aa MEEDLPSKWKTKKAGVAILVSDKTDFKPTKIKRDKEGHYIMVKGSIHQEELTILNIYAPN TGAPRFIKQVLSDLQRDLDSHTIIMGDFNTPLSTLDRSMRQKVNKDTQEFNSALHQADLI DIYRTLHPKSTEYTFFSAPYHTYSKIDHIVQSKALLSKCKRSEIITNCLSDHSAIKLDLR IKKLTQNHSTTWKLNNLLLNNYWVHNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGI FIALNAHKRKQETSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQN INESRSWFFERINKIDRPLARLIKKKREKKQIDTIKNDKGDITTDPTEIQTTIREYYKHL YTNKLKNLEEMDKFLDTYTLPRLNQEEVESLNTPITGSEIVAIINSLPTKKSPGPDGFTA EFYQRYKEELPGRDTTKKENFRPISLMNIDAKILNKILANQIRQHIKKLIHHDQVGFIPG MQGWLNIHKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKI IRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQ LGKEEVKLSLFADDMIVYLENPIVSAQNLKLISNFSKVSGYKINVQKSQAFLYTNNRQTE SQIMSELPFTIASKRIKYLGIQLTRDGKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRI NIMKMAILPKVIYRFNAIPIKLPMTFFTELEKTTFKFIWNQKRAHIAKSILSQKNKAGGI TLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPLPLEKGPSPISLSNFQVGGVQ EIVIENVSTISTHELATEEGSRVCPLVVSIFPFSARAVAGKGLRRFCLAAPRRALARART LAGRTPWAQRTPESNVERSSALIQLRRKRSGFRVRLLEPAGRAGLPSAARLHAPSTRFLL HVKLYEHKPQGNIEDECAVTSDEGTDEQGREMGQQHFRVTLNAPLAPRVTWDPGPSKHLV LQGTVIGSGIGRDPSRPIGINEAQLRGVQVPSPTLDLEI >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_2|3180_bp atggaggaagatctaccaagcaaatggaaaacaaaaaaggcaggggttgcaatcctagtc tctgataaaacagactttaaaccaacaaagatcaaaagagacaaagaaggccattacata atggtaaagggatcaattcaccaagaagagctaactatcctgaatatatatgcacccaat acaggagcacccagattcattaagcaagtcctgagtgacctacaaagagacttagactcc cacaccataataatgggagactttaacaccccactgtcaacattagacagatcaatgaga cagaaagttaacaaggatacccaagaattcaactcagctctgcaccaagcggacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccatac cacacctattccaaaattgaccacatagttcaaagtaaagctctcctcagcaaatgtaaa agatcagaaattataacaaactgtctctcagaccacagtgcaatcaaactagacctcagg attaagaaactcactcaaaaccactcaactacgtggaaactgaacaacctgctcctgaat aactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacacaacataccagaatctctgggacacattcaaagcagtgtgtagagggata tttatagcactaaatgcccacaagagaaagcaggagacatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaac attaatgaatccaggagctggttttttgaaaggatcaacaaaattgacagaccgctagca agactaataaagaagaaaagagaaaagaagcaaatagacacaataaaaaatgataaaggg gatatcaccactgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacacaaataaactaaaaaatctagaagaaatggataaattcctcgacacatacaccctc ccaagactaaaccaggaagaagttgaatctctgaatacaccaataacaggctctgaaatt gtggcaataatcaatagcttaccaacaaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactgccaggcagagacacaaccaaaaaagagaat tttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaac caaatccggcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctggg atgcaaggctggttgaatatacacaaatcaataaatgtaatccagcatataaacagaacc aaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaa caacccttcatgctaaaaactctcaataaattaggtattgatgggacatatctcaaaata ataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaagactggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccattgtctcagcccaaaatctcaagctgataagcaacttcagcaaagtctcagga tacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagag agccaaatcatgagtgaactgccattcacaattgcttcaaaaagaataaaatacctagga atccaacttacaagggacgggaaggacctcttcaaggagaactacaaaccactgctcaat gaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatc aatatcatgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccatc aagctaccaatgactttcttcacagaattggaaaaaactactttcaagttcatatggaac caaaaaagagcccacattgccaagtcaatcctaagccaaaagaacaaagctggaggcatc acactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacgccactt cctttagagaagggaccaagtcctatatctctttcaaacttccaggttggaggtgttcaa gaaatagttatcgaaaatgtatcaactatttctactcatgagctcgctacagaagaaggg agtagggtctgcccccttgttgtatctattttccccttctccgccagggctgtggccggc aaggggctcaggaggttctgtctcgccgcaccccggcgggcactggcgcgggcaaggacg ctggcggggagaacgccctgggctcaacgtactccagaatcgaatgttgagagaagcagt gctctgatccagctcaggagaaaaaggagcgggttccgagtgagacttctggagccagct ggacgtgccggtttgcccagtgcggcgcggctgcacgcaccgtccacaagatttttatta catgtgaaattatatgaacataaaccccagggaaacattgaagatgagtgtgccgtcaca agtgacgaggggacagatgaacagggaagagaaatggggcagcagcactttagagtcaca ctgaatgcacccttggctccaagagtgacatgggacccaggcccatccaaacatcttgtt ctccagggcactgtgattggttcagggataggccgtgatccaagcagaccaattggaatc aatgaggctcagttgagaggtgttcaggtgccatcacctacattggatttggagatttaa >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_3|115_aa MVMIPSFIAGTKECAKEKKVPALPKTLKKKQRSFAELKIKHLRKKFAQKVLVKARRKLIC EKLKHFHKKYTQMDRMEIRMARMARKAGNLYIPAESKLAFVIRIRSSNGVHTKVF >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_3|348_bp atggtgatgataccctcttttattgccggaaccaaggagtgtgccaaagagaagaaggtt cctgctttgccaaaaaccctcaagaaaaagcaaagaagttttgcagagctgaagatcaag cacctgagaaagaagtttgcccaaaaggtgcttgtaaaggcaaggagaaagcttatctgt gaaaaactgaagcactttcacaagaaatatacacagatggacagaatggagattcgaatg gccagaatggcaagaaaagctggcaacctctatatacctgcagaatccaaactggcattt gtcatcaggatcagaagttccaatggtgtgcacacaaaggtgttctga >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_4|57_aa MVKEPRIDNRRSHYRFYAEASLLRVMVVEGCLTGAMIIGGRLQKRSTCSTTGKEPRK >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_4|174_bp atggtgaaagagcctaggattgacaacaggagaagccattaccgcttctatgccgaagct agcctattgagagtcatggtggtagaaggatgcctgacaggagctatgatcataggtgga aggttacagaaacggtccacctgcagcaccacaggaaaggagccccggaaataa >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_5|139_aa MGCKGLFASLMVTKNQKIYNKDTKKKTHKIKTYHLRKSRLPKGRQKEKKEGKEDQKATQN KSKNGKRMKPWTFTVSVTVLKDGVSGVCYFRCSDVSGVSYFWWVRGLADFRSEAMDLVAS VTALNGGVDPKSEQQQDLL >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_5|420_bp atgggctgtaagggattatttgccagcctcatggtaaccaaaaatcaaaaaatatataac aaagacacaaaaaagaaaacacataaaattaaaacataccacctgaggaaatcacgtttg ccaaaaggaagacagaaagaaaagaaggaaggaaaagaagaccagaaagcaacccaaaac aaatcaaaaaatggcaagagaatgaagccatggaccttcacagtgagtgttacagttctt aaagatggtgtgtccggagtttgttacttcagatgttcagatgtgtctggagtttcttac ttctggtgggttcgtggtcttgctgacttcaggagcgaagccatggacttggtggcgagt gttacagctcttaatggtggtgtggacccaaagagtgagcaacagcaagatttattgtga >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_6|365_aa MNITNCTTEASMAIRPKTITEKMLICMTLVVITTLTTLLNLAVIMAIGTTKKLHQPANYL ICSLAVTDLLVAVLVMPLSIIYIVMDRWKLGYFLCEVWLSVDMTCCTCSILHLCVIALDR YWAITNAIEYARKRTAKRAALMILTVWTISIFISMPPLFWRSHRRLSPPPSQCTIQHDHV IYTIYSTLGAFYIPLTLILILYYRIYHAAKSLYQKRGSSRHLSNRSTDSQNSFASCKLTQ TFCVSDFSTSDPTTEFEKFHASIRIPPFDNDLDHPGERQQISSTRERKAARILGLILGAF ILSWLPFFIKELIVGLSIYTVSSEVADFLTWLGYVNSLINPLLYTSFNEDFKLAFKKLIR CREHT >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_6|1098_bp atgaacatcacaaactgtaccacagaggccagcatggctataagacccaagaccatcact gagaagatgctcatttgcatgactctggtggtcatcaccaccctcaccacgttgctgaac ttggctgtgatcatggctattggcaccaccaagaagctccaccagcctgccaactaccta atctgttctctggccgtgacggacctcctggtggcagtgctcgtcatgcccctgagcatc atctacattgtcatggatcgctggaagcttgggtacttcctctgtgaggtgtggctgagt gtggacatgacctgctgcacctgctccatcctccacctctgtgtcattgccctggacagg tactgggccatcaccaatgctattgaatacgccaggaagaggacggccaagagggccgcg ctgatgatccttaccgtctggaccatctccattttcatctccatgccccctctgttctgg agaagccaccgccgcctaagccctccccctagtcagtgcaccatccagcacgaccatgtt atctacaccatttactccacgctgggtgcgttttatatccccttgactttgatactgatt ctctattaccggatttaccacgcggccaagagcctttaccagaaaaggggatcaagtcgg cacttaagcaacagaagcacagatagccagaattcttttgcaagttgtaaacttacacag actttctgtgtgtctgacttctccacctcagaccctaccacagagtttgaaaagttccat gcctccatcaggatcccccccttcgacaatgatctagatcacccaggagaacgtcagcag atctctagcaccagggaacggaaggcagcacgcatcctggggctgattctgggtgcattc attttatcctggctgccatttttcatcaaagagttgattgtgggtctgagcatctacacc gtgtcctcggaagtggccgactttctgacgtggctcggttatgtgaattctctgatcaac cctctgctctatacgagttttaatgaagactttaagctggcttttaaaaagctcattaga tgccgagagcatacttag >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_7|199_aa MIVYLENAIVSAQNLLKLISNFSKVSGYKINVPKSQAFLYTNNRQTESQIMSELPFAIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIMIMAILSKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNQKRASIAKTILSQKNKVGGIMLPDFKLCYKATV TKTALYWYQNNGTEQSPEK >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_7|600_bp atgattgtatatttagaaaatgccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaacgtgccaaaatcacaagcattcttatac accaataacagacaaacagagagccaaattatgagtgaactcccatttgcaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggatctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatatcatgataatggccatactgtccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actctaaagttcatatggaaccaaaaaagagccagcattgccaagacaatcctaagccaa aagaacaaagttggaggcatcatgctacccgacttcaaactatgctacaaagctacagta accaaaacagcattgtactggtaccaaaacaatggaacagaacagagccctgagaaataa >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_8|218_aa MGDFNTPLSTLDRSIRQKVNKDIQELNSALHQVDLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNCLSDHSAFKLELRVKKLTQSHSTIWKLNDLLLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERLKIDTLTSQLK ELEKQEQIHSKATRRQEITKIIAELKETETQETLQKIN >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_8|657_bp atgggagactttaacaccccactgtcaacattagacagatcaataagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagtggacctaatagacatctacaga actctccatcccaaatcaacagaatatacattcttctcagcaccacatcacacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcattcaaactagaactcagggttaagaaactc actcaaagccactcaactatatggaaactgaatgacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacaca acataccagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagattgaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaatacattcaaaagctaccagaaggcaagaaataactaag atcatagcagaactgaaggagacagagacacaagaaacccttcaaaaaatcaattaa >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_9|382_aa MGESALPAVGGWDRDHFMEVEMSTLGFDRSHDFTRLKRRTKDIPVRWNSSEAWSEGGSSL GLCTGVQCCGLGTVGVMAKSRCETQVTKGLVCHSKESFAVNLSPTLAVLEQPSSLPLHCG SPSWGWPRPEPAPSACREVWRERQGQQPGLCTVLVVQRQFWVGMGLWEGVYEQMSAGTVV RECGNQPTTFGWQQVQILCRPHGSIQLKEEVRTQCKEANNIEKRLDEWLTRITSVEKSLN DLMELKTMARELRDKCTSFSSRFDQLEERVSVIEDQMNEMKREEKFREKRVKRNKQSLQE IWNYVKRPNLRLIGVTESDGENGTKLENTLQDITRENFPNLARQANIQIQEIQRTPQRYS SRRATPRHIIVRFTKVEMKKKC >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_9|1149_bp atgggggaatcagctctccctgcagtaggaggttgggacagggaccactttatggaagtg gaaatgtctacgctgggctttgaccgttcacatgatttcacaaggttgaaaaggagaacc aaggacattccagtgagatggaacagctcagaggcatggagtgagggaggaagtagtcta ggactctgcacaggggttcagtgctgtgggctgggaactgtgggagtcatggctaaatca agatgtgagacccaggtgacaaaggggctggtgtgccactctaaggaaagttttgctgtc aatctatcgcccactctggctgtgcttgagcagccctccagcctgccgctgcactgtggg agcccctcttggggctggccgaggccggagccggctccctctgcttgcagggaggtgtgg agggagaggcaagggcagcaaccggggctgtgcacagtgctcgtggtccagcgccagttc tgggtgggcatgggattgtgggagggagtgtatgagcaaatgagtgcaggaactgtagtg agagagtgtgggaaccagccaaccacttttgggtggcagcaggtgcaaattctgtgcagg ccccatggcagcatccagctaaaggaggaagttcgaacccaatgcaaagaagctaacaac attgaaaaaagattagacgaatggctaactagaataaccagtgtagagaagtccttaaat gacctgatggagctgaaaaccatggcacgagaactacgtgacaaatgcacaagcttcagt agccgattcgaccaactggaagaaagggtatcagtgattgaagatcaaatgaatgaaatg aagcgagaagagaagtttagagaaaaaagagtaaaaagaaacaaacaaagcctccaagaa atatggaactatgtgaaaagaccaaatctacgtctgattggtgtaactgaaagtgacggg gagaatggaaccaagttggaaaacactctgcaggatattacccgggagaactttcccaac ctagcaaggcaggccaatattcaaattcaggaaatacagagaacgccacaacgatactcc tcaagaagagcaactccaagacatataattgtcagattcactaaagttgaaatgaagaaa aaatgttaa >gi568815592f:86915335_87116429|GENSCAN_predicted_peptide_10|131_aa MEESGPLGELGPNVFGDLTLESLAVTTPSSVKNGRDTRGVKREDRDIPVSIQLNQVNPGM VLVQRRPRGIRHRPCPEAIYSLVQKGSRVKLSVAKEKPSLTASYGFLQSAAELTAAPCGQ VVHGASGRVSG >gi568815592f:86915335_87116429|GENSCAN_predicted_CDS_10|396_bp atggaagagagtgggccattgggtgagttggggccaaatgtgtttggtgacctgacattg gagagccttgctgttaccacacctagcagcgtaaagaatggaagggacactcgaggagta aagagagaggacagggacatccccgtgtccattcagctcaaccaggtcaaccctggcatg gtgctggtccagcgacggccaagaggaataagacacagaccctgccctgaagcaatttac agtctagtgcagaagggcagccgtgtgaaactgagcgtggccaaagagaagccctccctc actgccagctatggatttcttcagtccgctgcagaactcactgctgccccttgtggacaa gtggtccatggagcttctggaagagtcagtgggtga